CSV file and tUniqueRow

One Star

CSV file and tUniqueRow

Hi,
I have a problem with the component tUniqueRow
I go through a file Excel and I wish to delete all duplicates of this file
My component works correctly ( no error message) but the results are false
There are maybe errors for the treatment of files Excel (.csv )? Bugs?
Fichier_Valide_Temp contains the lines of valid customers => row Uniques
Fichier_Doublons_Nom+Siret contains customers' lines which are in duplicates => row Duplicates
Is it a bug of the component ?


Thx
Community Manager

Re: CSV file and tUniqueRow

Hi
I go through a file Excel and I wish to delete all duplicates of this file

Using this component can get the unique rows and remove the duplicate rows via key attribute.
what's are your input data and what are your expected result ?
Best regards
shong
----------------------------------------------------------
Talend | Data Agility for Modern Business
One Star

Re: CSV file and tUniqueRow

hi Shong ,
Input Data (file Excel) : Customer 's informations of type : Siret Name of the customer
001 toto
002 titi
003 tata
001 toto

Expected result : row Duplicates : Siret Name of the customer
001 toto
row Uniques : Siret Name of the customer
002 titi
003 tata

In my tUniqueRow i checked unique key in key attrinute "Siret" and "Name of customer"
My purpose is to eliminate duplicates on the siret and on the name of the customer .
I want to have in result valid customers in a file Excel and the customers in copies (duplicates) in a other file Excel.
tUniqueRow duplicates are eliminate of the valid file or if it keeps a copy of the doubloon of the valid file.
Example : Result : row Duplicates : Siret Names of the customers
001 toto
001 toto
row Uniques : Siret Names of the customers
002 titi
003 tata
OR
row Duplicates : Siret Names of the customers
001 toto
row Uniques : Siret Names of the customers
001 toto
002 titi
003 tata

Thank you for your help
One Star

Re: CSV file and tUniqueRow

try this
One Star

Re: CSV file and tUniqueRow

I do not see the interest to cut my file in entry to two different files.
Furthermore, which is the purpose of the tMap in your example ?
A small explanation of your screen shots is possible.....
One Star

Re: CSV file and tUniqueRow

If you don't split the file you can't use the two flow in a tMap I've tried to do it.
In my tMap I do an Inner Join to know which column are duplicated. So in the output Duplicates I've put the Values which are in both flow. In the Uniques flow I've put the values which are not. (with the inner join reject)