I'm building a job where I want to identify duplicate values in multiple columns from a DB source, and retrieve those rows for further processing. My first though was to use tFuzzyMatch, but then I need a lookup component. However, I don't understand exactly what I'm supposed to do the lockup against. Any advice is much appreciated!
Talend named a Leader.
Kickstart your first data integration and ETL projects.
Learn how to do cool things with Context Variables
Move from On-Premises to the Cloud by following the advice of experts
Find out about Talend Open Studio for Data Quality