Re: Help needed implementing fuzzy logic

Tech-Archive recommends: Repair Windows Errors & Optimize Windows Performance

From: John Nurick (j.mapSoN.nurick_at_dial.pipex.com)
Date: 03/22/04


Date: Mon, 22 Mar 2004 07:17:49 +0000

Soundex wasn't designed for quite this purpose and might give you a lot
of false matches (IIRC it will match Robert Smith and Robin Smythe).
Also and it puts a lot of weight on the first character and doesn't (as
far as I can remember) match Rob and Bob.

The Metaphone algorithm might get you a bit closer. There used to be a
DLL downloadable from
http://www.programmersheaven.com/zone15/cat161/2902.htm

Another approach would be to use the Levenshtein distances (the distance
between two strings is the number of individual character edits needed
to convert one into the other):
        Robert Smith
        Rob Smith (3)
        Bob Smith (4)
        Robt. Smith (2 if you strip out the punctuation)
There's an algorithm with VB implementation at
http://www.merriampark.com/ld.htm#VB

On Sat, 20 Mar 2004 18:18:11 GMT, masprog@attglobal.net.invalid (John
Moore) wrote:

>
>I would like to provide the user with the ability to identify
>duplicate records based on a person's first and last name. This works
>OK as long as the spelling of both names matches. However, if the user
>types a person's name slightly different it gets around my duplicate
>checking code. For example, I would like to catch that Robert Smith
>may be the same person as Rob Smith or Bob Smith or Robt. Smith.
>
>I have recently investigated using a Soundex type algorithm to do this
>fuzzy logic. There are numerous algorithms available but I have been
>unable to find any instructions and/or code that describes how you
>actually implement this process.
>
>Any and all help or guidance is greatly appreciated. If you wish to
>respond to me via email, remove the .invaild from the end.
>
>John Moore
>DSI

--
John Nurick [Microsoft Access MVP]
Please respond in the newgroup and not by email.


Relevant Pages

  • Re: Help needed implementing fuzzy logic
    ... of false matches (IIRC it will match Robert Smith and Robin Smythe). ... The Metaphone algorithm might get you a bit closer. ... I would like to catch that Robert Smith ...
    (microsoft.public.access.modulesdaovba)
  • Re: Help needed implementing fuzzy logic
    ... > of false matches (IIRC it will match Robert Smith and Robin Smythe). ... > The Metaphone algorithm might get you a bit closer. ... I would like to catch that Robert Smith ...
    (microsoft.public.access.modulesdaovba)
  • Re: Help needed implementing fuzzy logic
    ... > of false matches (IIRC it will match Robert Smith and Robin Smythe). ... > The Metaphone algorithm might get you a bit closer. ... I would like to catch that Robert Smith ...
    (microsoft.public.access.forms)