Six Stars

tFuzzymatch not giviing required output in Talend

I have a workflow in which i have 2 files.

First file has below data separated by delimiter

Name;Department
wdci pty ltd1;Sales
talend Open Studiofdger;Marketing
Salesforce ;Design
wdci pty ltd1;Automation
Salesforce1;Marketing
wdci pvt limited;Design
Roger Federer;Tennis
Maradona;Soccer

second file has below data

nam;Sales
wdci pty ltd1;195
talend Open Studiofdger;188
Salesforce ;159
MyName;129

When i create a talend job to do fuzzy matching, its isn't giving me correct output.

Below is my talend workflow

enter image description here

 

Below is output

 

Starting job FuzzyMatch at 11:54 13/04/2017.

[statistics] connecting to socket on port 3777
[statistics] connected
Name|Department|1,2|nam,MyName
wdci pty ltd1|Sales|0|wdci pty ltd1
talend Open Studiofdger|Marketing|0|talend Open Studiofdger
Salesforce |Design|0|Salesforce 
wdci pty ltd1|Automation|0|wdci pty ltd1
Salesforce1|Marketing|1|Salesforce 
wdci pvt limited|Design||
Roger Federer|Tennis||
Maradona|Soccer||
[statistics] disconnected
Job FuzzyMatch ended at 11:54 13/04/2017. [exit code=0]

What i believe is that last two records should not be appearing in the output. Any suggestion?

  • Data Integration
2 REPLIES
One Star

Re: tFuzzymatch not giviing required output in Talend

Hi,

 

In my opinion it's the proper functioning.

 

You have a main with 7 rows, so in the output you keep the 7 rows and the Fuzzy Component says there's nothing in your lookup matching for the last two lines.

 

You just have to put a filter component and it should be ok no ?

 

 

Six Stars

Re: tFuzzymatch not giviing required output in Talend

So you mean when there is no match you don't need those records to your target flow? is it correct?