I have followed the instructions given in the manual:
The explanation is however not detailed to the point that you're told how to extract this data.
I too am having trouble doing this.
I have a paid for, full installation of Talend Studio running 7.1.1. I am looking for duplicate entries within two 'near matching' sets of client data.
I have created a Match Analysis.
I have created a Report of this.
What I need to be able to see are the *actual* results of the matching process to pass along to my client so they can see how wide spread the problem is, but also identify which records are true duplicates and which are not.
So, I want to be able to provide my customer with an an excel document that says something along the lines of: Client Code | Forename | Surname | email
1234 | John | Smith | firstname.lastname@example.org |
9876 | Jon | Smith | email@example.com |
I believe I will need to create a "job" to do this, but I do not know how to create a job to do this.
Talend named a Leader.
Kickstart your first data integration and ETL projects.
Find out about Talend Open Studio for Data Quality
Learn how to enable Data Governance
Take a peek at the definitive guide to Government Data Quality