We are using the Data Preparation Free tool to manage excel file that will then be imported into a Marketing Automation tool called Marketo.
Unfortunately, when we export the data from the data preparation tool usin CSV UTF-8 encoding, any Latin characters that have accents or double dots on top for example are all coming out incorrectly. When we do the same thing with MS Excel all come out fine.
What it is that we are doing wrong when exporting data from the data preparation free tool?
Thanks for the help
It seems surprising at first glance ... especially knowing that Data Prep only supports UTF-8 when exporting to CSV.
Therefore, few questions:
Thanks for replying to quickly.
We use Talend Data preparation Desktop Free tool. Then open the file in MS Excel 2016.
Here are two examples
|Company||First Name||Last Name||Function Title||Address||Postal Code||Post Office||Country|
|ABC||John||Smith||CIO||StephensonstraÃŸe 1||12345||Frankfurt a. Main||Germany|
Hope this helps
That is what I assumed ... it's actually an Excel issue: Excel doesn't automatically detect encoding of CSV files and always assumes it is Windows-1252 ... hence the issue you see when opening the CSV file generated by Data Prep. If you open the CSV file in Notepad++ or any other decent text editor, you will see that the file generated by Data Prep is well encoded in UTF-8.
Note that you can display correctly a file encoded in UTF-8 in Excel ... it's everything but user-friendly, though. See https://www.youtube.com/watch?v=GcYt1mJbwk4 for instance (it is not on Excel 2016 but the sequence is essentially the same). And it has been this way for ever ... so it is unlikely to be fixed by Microsoft.
Good news: in the next Data Prep release (planned for January 2018), you will be able to select the encoding when exporting to CSV. So you'll be able to export in Windows-1252 so that Excel can read it correctly natively.
Hope this helps,