tmap wildacrd join

Five Stars

tmap wildacrd join

Dear All,

 

I have a fairly straight-forward requirement. I need to allocate a nearest delivery centre based on customers post-code...I can generate 2 files

customers.csv

Name,Address,PostCode

Joe Bloggs, Some Road in Reading, RG1 4XX

Tim John, Some Road in London, NW10 1AA

 

Deliveru.csv

PostCode, Centre

RG, Reading

NW, London

EC,London

 

I want to match based on '%PostCode%' from Customer to contain PostCode from Delivery.csv

 

Many Thanks

 


Accepted Solutions
Nine Stars

Re: tmap wildacrd join

You could something simple if you are sure that 1st two Char of postal code will contain region code. Please do the null check in your expression ( I missed to do that)

TalendImage.JPG

Regards
Abhishek KUMAR
( Please mark the post as resolved if you found your solution )

All Replies
Highlighted
Employee

Re: tmap wildacrd join

Hi,

 

    It is pretty risky proposition to identify the nearest delivery centre postcode by just doing %POST_CODE% match on customer post code. Obviously the match might not be correct in many cases and we are trying to reinvent the wheel where we are already having pretty matured SaaS solutions already available.

image.png

 

In a nutshell, I would suggest to use the Standardization match features on one of the above components based on your comfort level. They are using Royal Mail's Standardized Address File as their source and will give the accuracy percentages by comparing with input addresses. 

 

Another problem is that even if you got postal address as same, what is the guarantee that you will be delivering to exactly same person if your address match is just on hard match based on postal code? There are lot of common names in UK like Adam Smith where people with same name will be there in same street. 

 

Now, there is another scenario where father and son are having same name with Junior or II suffix. Another scenario is Husband and wife divorced and living in same location. If your address match is wrong and the letter is going wrongly to a divorced couple, you are going to end a paying lot of GDPR fine :-)

 

So my suggestion will be to utilize the address standardization features of Talend Data quality component rather than trying to try something yourself. First of all, you will have to put lot of efforts to reinvent the wheel and they are doing the stuff with pretty much matured algorithms in plug and play fashion.

 

So do you still want to try this item by yourself? 

 

Warm Regards,
Nikhil Thampi

Please appreciate our Talend community members by giving Kudos for sharing their time for your query. If your query is answered, please mark the topic as resolved :-)


Warm Regards,
Nikhil Thampi
Please appreciate our members by giving Kudos for spending their time for your query. If your query is answered, please mark the topic as resolved :-)
Five Stars

Re: tmap wildacrd join

I appreciate your advise. However all out delivery centres have a post code list i.e. just the first two letters. So I am really looking for a wildcard match in tmap or similar....

 

Many Thanks

Nine Stars

Re: tmap wildacrd join

You could something simple if you are sure that 1st two Char of postal code will contain region code. Please do the null check in your expression ( I missed to do that)

TalendImage.JPG

Regards
Abhishek KUMAR
( Please mark the post as resolved if you found your solution )
Five Stars

Re: tmap wildacrd join

Thanks Abhishek,

That is my fall back plan. Is there any way to do wild card match as the next phase load will be 3 digit post-codes. Alternatively I will upload them to staging database and use wildcard match in join....however was seeing if we have anything available in Talend to do this. 

 

Appreciate your answer.

 

 

 

Employee

Re: tmap wildacrd join

Hi,

 

   That will be a good fall back method but add the necessary caveats in your design documents and development handover guide about the possible data match risk so that the issue will not snowball to you later.

 

   If the architect of the project is deciding this is the best method, then the onus of handling any data problem/GDPR issues also should be eventually handled by him/ her. They should not throw the mud to you or testing team for it for any possible setback.

 

Warm Regards,
Nikhil Thampi

Please appreciate our Talend community members by giving Kudos for sharing their time for your query. If your query is answered, please mark the topic as resolved :-)

 

 

 

 


Warm Regards,
Nikhil Thampi
Please appreciate our members by giving Kudos for spending their time for your query. If your query is answered, please mark the topic as resolved :-)
Five Stars

Re: tmap wildacrd join

Appreciate Nikhil.

 

Thanks

 

Cloud Free Trial

Try Talend Cloud free for 30 days.

Tutorial

Introduction to Talend Open Studio for Data Integration.

Definitive Guide to Data Integration

Practical steps to developing your data integration strategy.

Definitive Guide to Data Quality

Create systems and workflow to manage clean data ingestion and data transformation.