Fuzzy logic search to find international characters.
April 13, 2006 2:57 AM
Subscribe
I've got a database of names, many of which have international characters (e-acute, c-cedilla, o-umlaut, etc). I want the search routine to be clever enough that if I search for "Celik" it'll find c-cedilla-elik, even though "c" and "c-cedilla" are entirely different.
Does a look-up table exist that matches whole range of such non-English letters with their nearest-looking English equivalents? Or can anyone here help me construct one?
I'm thinking o and u umluat, c and s cedilla, o circumflex, Turkish g and undotted-i, Scandinavian o with a line through it, Spanish n, e with a grave and acute, accented a, the dipthongs.
Any more for any more?
posted by Pericles to computers & internet (16 comments total)
posted by grouse at 3:07 AM on April 13, 2006