Hi All, On Talend Open Studio for DQ 5.6, running on OsX 10.10, i am unable to access the "View Rows" option from Analysis Results, unlike in other analysis. Is this a known bug that has been fixed in TODQ 6, or am I doing something wrong. I have followed the instructions given in the manual: The explanation is however not detailed to the point that you're told how to extract this data. Is there a reason for this (feature available in Enterprise edition for instance?) Thanks
Hi Sabrina, I meant the Talend help center guide (can't post URL): How to show the match results To collect duplicates from the input flow according to the match types you define, Levenshtein and Jaro-Winkler in this example, do the following:
If you are processing large data sets, select the Store on disk check box in the Analysis parameter view and:
In the Max buffer size field, type in the size of physical memory you want to allocate to processed data. In the Temporary data directory path field, set the path to the directory where you want to store the temporary file.
Save the settings in the match analysis editor and press F6. The analysis is executed. The match rule and blocking key are computed against the whole data set and the Analysis Results view is open in the editor So, what would be the correct way to access the match data once the operation has run. There's nothing in the specified Temporary data directory path that remains from the operation.
EJB-alpha now registered as erwanbegoc I think i just found the answer to my question: In Talend-DQ, the purpose of the match analysis is really to create a rule that can then be used in a Talend-DI, Talend-DQ is not meant to be used as a standalone application for that particular use-case (deduplication). Am I correct?