Talend 4.2.3 taddCRCrow same CRC value for 2 different data set

One Star

Talend 4.2.3 taddCRCrow same CRC value for 2 different data set

Hi,
We are currently encountering some issues with the taddCRCrow component.
The component is configured to calculate the CRC value based on 2 fields : ID_SHOP and ID_PRODUCT (screenshot 1).
But with several values like in the example (screenshot 2) it generates the same CRC value for 2 different combination of id_shop and id_product.
Do you have any explanation for this problem ?
Regards
One Star

Re: Talend 4.2.3 taddCRCrow same CRC value for 2 different data set

Hi,
Could it be the field length that causes this error?
One Star

Re: Talend 4.2.3 taddCRCrow same CRC value for 2 different data set

How so ?
Hi,
Could it be the field length that causes this error?
Moderator

Re: Talend 4.2.3 taddCRCrow same CRC value for 2 different data set

Hi vpacquet,
Here is component reference tAddCRCRow with scenario.
In addition, perhaps the" field length" said by @saukema is the schema of tAddCRCRow(see my pic). Could you give us more information about your workflow and job design(by the way, screenshot1 is so small that it is hard to see its inner structure clearly), thanks a lot.
Best regards
Sabrina
--
Don't forget to give kudos when a reply is helpful and click Accept the solution when you think you're good with it.
One Star

Re: Talend 4.2.3 taddCRCrow same CRC value for 2 different data set

Hi,
I have added a better quality screenshot.
The job is fairly simple, i am just adding a CRC row to a data flow using the fields ID_SHOP and ID_PRODUCT.
I have a length of 12 for the ID_PRODUCT but the ID_SHOP doesn't have any field length. Could this be the problem ?
Regards,
One Star

Re: Talend 4.2.3 taddCRCrow same CRC value for 2 different data set

Hello,
i meet the same problem and i try several cases:
* with length
* without length
i just find one thing: when i set CRC type in "CRC16", the result is better, crc are different.
but it's not a good solution for me, a lower CRC can take more same results. . .
i work on Talend 5.6.1
One Star

Re: Talend 4.2.3 taddCRCrow same CRC value for 2 different data set

if you want my example: (don't forget the space for the postal code after: "3498 "), all others have a trim
CRC is made on the 6 fields
.-----------+--------------------+---------------+-----------+-----------+------+----------.
| tLogRow_2 |
|=----------+--------------------+---------------+-----------+-----------+------+---------=|
|NUMERO_VOIE|ADRESSE_LIGNE_1 |ADRESSE_LIGNE_2|CODE_POSTAL|COMMUNE |PAYS |CRC |
|=----------+--------------------+---------------+-----------+-----------+------+---------=|
|924 |CHEMIN DE PEYROT |null |31620 |BOULOC |FRANCE|278186796 |
|null |3498 rue de la pomme|null |3498 |Commune3498|null |1853816814|
'-----------+--------------------+---------------+-----------+-----------+------+----------'
Moderator

Re: Talend 4.2.3 taddCRCrow same CRC value for 2 different data set

Hi rlegrand,
Did you also repro this issue on Talend v 6.1?
Best regards
Sabrina
--
Don't forget to give kudos when a reply is helpful and click Accept the solution when you think you're good with it.
One Star

Re: Talend 4.2.3 taddCRCrow same CRC value for 2 different data set

i don't have this version.
i ask my colleagues if they have Talend 6.1 and if they can try it
after opening an issue on talend support, they tell me to don't use tConvertType with option "Set empty values to Null before converting", what is impossible.
So they open a jira:
https://jira.talendforge.org/browse/TDQ-11547
One Star

Re: Talend 4.2.3 taddCRCrow same CRC value for 2 different data set

ok, a colleague has the version 6.1.
He tries it and has exactly the same result
Moderator

Re: Talend 4.2.3 taddCRCrow same CRC value for 2 different data set

Hi,
We have seen this jira issue:https://jira.talendforge.org/browse/TDI-35136 and will keep you posted.
Best regards
Sabrina
--
Don't forget to give kudos when a reply is helpful and click Accept the solution when you think you're good with it.
One Star

Re: Talend 4.2.3 taddCRCrow same CRC value for 2 different data set

I know, it's mine. . . but i'm a little bit disappointed. . . no solution because it's normal. . .
i only works on 24 millions adresses so 24 millions different CRC.
I calculate 1.6 millions CRC and compare it to the existing values to know if it is a insert or an update.
4374 CRC are found but they are not the same!
i can take some other examples. Do you think 4300 is little?
One Star

Re: Talend 4.2.3 taddCRCrow same CRC value for 2 different data set

new Datas to inject
NULL 1010 rue de la pomme NULL 1010 Commune1010 NULL 532135616
NULL 2376 rue de la pomme NULL 2376 Commune2376 NULL 2259114405
NULL 3498 rue de la pomme NULL 3498 Commune3498 NULL 278186796
NULL 3638 rue de la pomme NULL 3638 Commune3638 NULL 1579150529
NULL 3648 rue de la pomme NULL 3648 Commune3648 NULL 1937392661
NULL 3654 rue de la pomme NULL 3654 Commune3654 NULL 2640442085
NULL 3814 rue de la pomme NULL 3814 Commune3814 NULL 1014668805
NULL 3892 rue de la pomme NULL 3892 Commune3892 NULL 4218793342
NULL 4640 rue de la pomme NULL 4640 Commune4640 NULL 3040598754
NULL 5196 rue de la pomme NULL 5196 Commune5196 NULL 2352608391
Existing dtas
7 Rue PELISSIER 63000 CLERMONT-FERRAND FR 532135616
18 RUE DES CITES 70250 RONCHAMP FR 2259114405
924 CHEMIN DE PEYROT 31620 BOULOC FRANCE 278186796
25 rue st pierre 57935 LUTTANGE FRANCE 1579150529

Result when i calculate the CRC
.-----------+--------------------+---------------+-----------+----------------+------+----------.
|                                           tLogRow_4                                           |
|=----------+--------------------+---------------+-----------+----------------+------+---------=|
|NUMERO_VOIE|ADRESSE_LIGNE_1     |ADRESSE_LIGNE_2|CODE_POSTAL|COMMUNE         |PAYS  |CRC       |
|=----------+--------------------+---------------+-----------+----------------+------+---------=|
|null       |1010 rue de la pomme|null           |1010       |Commune1010     |null  |532135616 |
|7          |Rue PELISSIER       |null           |63000      |CLERMONT-FERRAND|null  |532135616 |
|null       |3498 rue de la pomme|null           |3498       |Commune3498     |null  |278186796 |
|924        |CHEMIN DE PEYROT    |null           |31620      |BOULOC          |FRANCE|278186796 |
|null       |3638 rue de la pomme|null           |3638       |Commune3638     |null  |1579150529|
|25         |rue st pierre       |null           |57935      |LUTTANGE        |FRANCE|1579150529|
'-----------+--------------------+---------------+-----------+----------------+------+----------'
Moderator

Re: Talend 4.2.3 taddCRCrow same CRC value for 2 different data set

Hi,
Our component developer has updated this jira issue: https://jira.talendforge.org/browse/TDI-35136, could you please take a look at it?
Feel free to add your comment in this jira issue.
Best regards
Sabrina
--
Don't forget to give kudos when a reply is helpful and click Accept the solution when you think you're good with it.