I've been using the fuzzy lookup to try match similarly named source values and have come across one that surely can't be correct.
For the name "Hopkins W.", I have three potential matches (according to the fuzzy lookup) with similarity as follows
Higgins W. 0.632809937000275 Zoins W. 0.618580460548401 Hopkins K. 0.583522439002991
Now considering "Hopkins K." matches on the entire surname and the other two only on the "W" initial, why on earth are the other two rated as a more likely match? Even taking this at purely a token level match, they would both still be 1 token matching each so at worse, should be at least the same. Can anyone come up with a sound reason behind the result or worthy of raising with Microsoft?