Four Stars

Export to UTF-8 CSV using Data Preparation Tool Free

Hello

 

We are using the Data Preparation Free tool to manage excel file that will then be imported into a Marketing Automation tool called Marketo.

 

Unfortunately, when we export the data from the data preparation tool usin CSV UTF-8 encoding, any Latin characters that have accents or double dots on top for example are all coming out incorrectly. When we do the same thing with MS Excel all come out fine.

 

What it is that we are doing wrong when exporting data from the data preparation free tool?

 

Thanks for the help

Tags (2)
3 REPLIES
Employee

Re: Export to UTF-8 CSV using Data Preparation Tool Free

Hi,

 

It seems surprising at first glance ... especially knowing that Data Prep only supports UTF-8 when exporting to CSV.

 

Therefore, few questions:

  • Can you clarify in which tool you have faced the encoding issue when opening the CSV file generated by Data Prep?
  • Could you share a sample of the source file and of the resulting file exported from Data Prep? It can of course be dummy data instead of your real data, as long as we can witness the issue.

 

 

Thank you

Four Stars

Re: Export to UTF-8 CSV using Data Preparation Tool Free

Hi Gwendal

 

Thanks for replying to quickly. 

 

We use Talend Data preparation Desktop Free tool. Then open the file in MS Excel 2016.

 

Here are two examples

CompanyFirst NameLast NameFunction TitleAddressPostal CodePost OfficeCountry
ABCJohnSmithCIOStephensonstraße 112345Frankfurt a. MainGermany
XYZJörgMunchCDOArabellastraße 412345MünchenGermany

 

Hope this helps

Thanks

Axel

Employee

Re: Export to UTF-8 CSV using Data Preparation Tool Free

Hi Axel,

 

That is what I assumed ... it's actually an Excel issue: Excel doesn't automatically detect encoding of CSV files and always assumes it is Windows-1252 ... hence the issue you see when opening the CSV file generated by Data Prep. If you open the CSV file in Notepad++ or any other decent text editor, you will see that the file generated by Data Prep is well encoded in UTF-8.

 

Note that you can display correctly a file encoded in UTF-8 in Excel ... it's everything but user-friendly, though. See https://www.youtube.com/watch?v=GcYt1mJbwk4 for instance (it is not on Excel 2016 but the sequence is essentially the same). And it has been this way for ever ... so it is unlikely to be fixed by Microsoft.

 

Good news: in the next Data Prep release (planned for January 2018), you will be able to select the encoding when exporting to CSV. So you'll be able to export in Windows-1252 so that Excel can read it correctly natively.

 

Hope this helps,

 

Gwendal