I have an input_file encoded in ANSI that I want to encode to UTF-8.
So basically, I use the tChangeFileEncoding component and I do get an output_file encoded in UTF-8. While I open it with notepad++, everything is alright.
But when i open it with Excel, "€" and "é" caracters show me things like "â‚¬_" and "Ã©".
Is there any way to fix this ?
I started to get a grasp on your awnser and the solution to fix my problem is to use the BOM.
Unfortunately, while using tFileChangeEncoding and indicating "UTF-8-BOM", Talend can not recognize it and therefore deliver a proper output file. Anyone knows how to use the BOM in Talend ? Or use the custom encoding option ?
Ok, it's not how it works. I have found this topic which is related to my problem. Apparently, I need to use a custom component in order to use BOM. BOM is not native on Talend. But maybe the previous topic is too old. I can't find the tWriteHeaderLineToFileWithBOM component. Is there a way to download it or did the OP retrieve it ?
The key to my problem is the BOM. I'm sure of it. Once I can download, install and use that custom component, my problem will be solved.
Could you please refer to this link about:https://exchange.talend.com/#marketplaceproductoverview:marketplace=marketplace%252F1&p=marketplace%...?
And feel free to let us know if you can download this custom component from talend exchange portal.