Six Stars

Encoding UTF-8 and ISO-8859

Hello, 

 

I would like to encode in Encoding UTF-8 and ISO-8859 because all characters in data with accent (é ; à ; è ; etc.) = "?"

I saw that we can change in  advanced settings but i don't have...

Someone can help me ?

Thank you.

Version of Talend :Talend Open Studio for Data IntegrationVersion: 6.3.1

Capture.PNG

  • Data Integration
Tags (1)
6 REPLIES
Nine Stars TRF
Nine Stars

Re: Encoding UTF-8 and ISO-8859

Encoding is for files, not for db components.

TRF
Six Stars

Re: Encoding UTF-8 and ISO-8859

Thanks TRF for your answer.
Can I force encoding in the database connectors ?
I use 2 databases connectors : firebird and General JDBC (Microsoft Azure).

Tags (1)
Nine Stars TRF
Nine Stars

Re: Encoding UTF-8 and ISO-8859

No (I don't think so).

Depending on the engine, the charset is choosen when the database is created.

 


TRF
Eleven Stars

Re: Encoding UTF-8 and ISO-8859

If you are having trouble with charsets, you can try fixing this using a bit of Java.

 

String broken_nordic = "Här är några merkkejä";
byte[] bytes = broken_nordic.getBytes("ISO-8859-1");  
String fixed_nordic = new String(bytes,"UTF-8"); 

This *should* produce the following text .....

"Här är några merkkejä"

Test it and if that works you can create a reusable routine to make it easier to use

Rilhia Solutions
Six Stars

Re: Encoding UTF-8 and ISO-8859

The SQL Server uses SQL_Latin1_General_CP1_CI_AS charset but I don't understand when I insert directly rows it takes into account special characters ...

 

I think the unique solution is to create a new database with adequate charset.

Six Stars

Re: Encoding UTF-8 and ISO-8859

Thanks rhall_2_0 for your answer but i'm a newbie lol.
It's a little bit complicated for me .