I have used tfuzzymatch component to find the matched words in one lookup table. The tfuzzymatch only have one matching column attribute as condition, but I hope that there is a component can have two matching columns as condition using different matching method. Is there any component having this kind of function? Thanks a lot.
Don't think so, but maybe you can replicate the input data flow using tReplicate, then use the appropriate tFuzzyMatch on each flow.
After that, in a new subjob, you can combine both flows using a tMap to get the result from the two previous tFuzzyMatch.
No so easy as having the possibility to have more than a single lookup for the tFuzzyMatch, but it should work.
Here is a tRecordMatchingcomponent which joins two tables by doing a fuzzy match on several columns using a wide variety of comparison algorithms. It compares columns from the main flow with reference columns from the lookup flow and according to the matching strategy you define, outputs the match data, the possible match data and the rejected data. Let us know if it is what you are looking for.
I could not find tRecordMatching in the 6.1 Talend Open Studio. Is it a free component? I can not find it in the Talend exchange either. Where can I download it?
Thanks a lot.
Talend named a Leader.
Kickstart your first data integration and ETL projects.
Watch the recorded webinar!
Learn how to enable Data Governance
Take a peek at the definitive guide to Government Data Quality