Daitch-Mokotoff Soundex (D-M Soundex) is a phonetic algorithm invented in 1985 by genealogist Gary Mokotoff, and later improved by Randy Daitch, both of the Jewish Genealogical Society. It is a refinement of the Russell and American Soundex algorithms designed to allow matching of Slavic and Yiddish surnames with similar pronunciation but differences in spelling.
Daitch-Mokotoff Soundex is sometimes referred to as "Jewish Soundex" and "Eastern European Soundex", although the authors discourage use of these nicknames for the algorithm.
Improvements[edit | edit source]
Improvements over the older Soundex algorithms include:
- Coded names are six digits long, resulting in greater search precision (traditional Soundex uses four characters)
- Coded names can be stored as numeric values, which can save space in some applications (regular Soundex encodes values as alphanumeric text)
- Several rules in the algorithm encode multiple character n-grams as single digits (American and Russell Soundex do not handle multi-character n-grams)
- Multiple possible encodings can be returned for a single name (traditional Soundex returns only one encoding, even if the spelling of a name could potentially have multiple pronunciations)
Examples[edit | edit source]
|Surname||American Soundex||D-M Soundex|
|Jackson||J250||154600, 454600, 145460, 445460|
|Jackson-Jackson||J252||154664, 454664, 145466, 445466, 154646, 454646, 145464, 445464|
See also[edit | edit source]
[edit | edit source]
- Mokotoff, Gary. "Soundexing and Genealogy." Describes the history and the motivations behind D-M Soundex.
- JewishGen. "Soundex Coding." Describes both Russel and D-M Soundex.
- Project Dedupe http://dedupe.sourceforge.net
- Coles, Michael. "SQL 2000 DBA Toolkit, Part 3: Phonetic Matching" SQL Server-based implementation of the D-M Soundex algorithm w/source.
|This page uses content from the English language Wikipedia. The original content was at Daitch-Mokotoff Soundex. The list of authors can be seen in the page history. As with this Familypedia wiki, the content of Wikipedia is available under the Creative Commons License.|