Unable to view records from Match Analysis

Highlighted
One Star

Unable to view records from Match Analysis

Hi All, 
On Talend Open Studio for DQ 5.6, running on OsX 10.10, i am unable to access the "View Rows" option from Analysis Results, unlike in other analysis. Is this a known bug that has been fixed in TODQ 6, or am I doing something wrong.
I have followed the instructions given in the manual:
The explanation is however not detailed to the point that you're told how to extract this data.
Is there a reason for this (feature available in Enterprise edition for instance?)
Thanks
Highlighted
Moderator

Re: Unable to view records from Match Analysis

Hi,
So far, Match Analysis don't support for "view rows" feature both in Talend Open Studio and Talend Enterprise Subscription Solution.
I have followed the instructions given in the manual:
The explanation is however not detailed to the point that you're told how to extract this data.

Could you please indicate the document you are referring?
Best regards
Sabrina
--
Don't forget to give kudos when a reply is helpful and click Accept the solution when you think you're good with it.
Highlighted
One Star

Re: Unable to view records from Match Analysis

Hi Sabrina,
I meant the Talend help center guide (can't post URL):
How to show the match results
To collect duplicates from the input flow according to the match types you define, Levenshtein and Jaro-Winkler in this example, do the following:

If you are processing large data sets, select the Store on disk check box in the Analysis parameter view and:

In the Max buffer size field, type in the size of physical memory you want to allocate to processed data.
In the Temporary data directory path field, set the path to the directory where you want to store the temporary file.

Save the settings in the match analysis editor and press F6.
The analysis is executed. The match rule and blocking key are computed against the whole data set and the Analysis Results view is open in the editor
So, what would be the correct way to access the match data once the operation has run. There's nothing in the specified Temporary data directory path that remains from the operation.
Highlighted
One Star

Re: Unable to view records from Match Analysis

EJB-alpha now registered as erwanbegoc
I think i just found the answer to my question: 
In Talend-DQ, the purpose of the match analysis is really to create a rule that can then be used in a Talend-DI, Talend-DQ is not meant to be used as a standalone application for that particular use-case (deduplication). Am I correct? 
Highlighted
One Star

Re: Unable to view records from Match Analysis

Hi,
Am using Talend Data Management 6.1.1 subscription version. And in profiling --> Match Analysis, am unable to do "view rows". So can someone please help me in this regard?
Kind Regards,
Neeraja.
Highlighted
Four Stars

Re: Unable to view records from Match Analysis

I too am having trouble doing this.

 

I have a paid for, full installation of Talend Studio running 7.1.1.  I am looking for duplicate entries within two 'near matching'  sets of client data.

 

I have created a Match Analysis.

I have created a Report of this.

 

What I need to be able to see are the *actual* results of the matching process to pass along to my client so they can see how wide spread the problem is, but also identify which records are true duplicates and which are not.  

 

So, I want to be able to provide my customer with an an excel document that says something along the lines of: Client Code | Forename | Surname | email

1234 | John | Smith | johnsmith@myemail.com |

9876 | Jon | Smith | johnsmith@myemail.com | 

 

I believe I will need to create a "job" to do this, but I do not know how to create a job to do this.

 

Thank you

2019 GARTNER MAGIC QUADRANT FOR DATA INTEGRATION TOOL

Talend named a Leader.

Get your copy

OPEN STUDIO FOR DATA INTEGRATION

Kickstart your first data integration and ETL projects.

Download now

Introduction to Talend Open Studio for Data Quality

Find out about Talend Open Studio for Data Quality

Watch Now

Enabling Data Governance

Learn how to enable Data Governance

Watch Now

The Definitive Guide to Government Data Quality

Take a peek at the definitive guide to Government Data Quality

Read