Hi all, I have 2 tables, a Persons table (alot of records, where the surnames could be enter incorrectly due to human error) and a Surname table (has 10 records with correct surnames that I am looking for in the Persons table). How do I use the Fuzzy lookup to lookup on surnames that look the same.Eg. Petersen could look like peteiren, peterson, pterson. NB. There could be more than one of the same surnames that appear in the Persons table.
Wonderful Data Flow component for cleaning the data. Couple of things to look out for is "Similarity threshold" & "Editable Comparision Flag" one can set between input column and lookup column. Please look into below link for more information: http://www.sqlservercentral.com/articles/SQL Server 2005 - SSIS/3105/ http://msdn.microsoft.com/en-us/library/ms137786.aspx
Check with Mapping Type, It should be "Fuzzy" & not "Exact". ( Right Click on Mapping and then Edit Mapping under Columns Tab ) Also check with Similarity Threshold on "Advanced" tab. ( try with any value between 0.2 to 0.5 ). Fuzzy lookup will return you "_Similarity" & "_Confidence" along with other columns that you have in transformation. For "peteiren" compare to actual value "Petersen", i am getting _Similarity value as 0.7381395