Four Stars

Handling special characters

Hi Guys,


I need to transform special characters like "á" . Whenever i read these characters and create a output file it shows "" . Please help how these characters can be handled and it should be loaded into the output. I am stuck badly with this .




  • Big Data
  • Data Integration
Seven Stars TRF
Seven Stars

Re: Handling special characters


Try using regex:

row1.theStringYouWishToTransform.replaceAll("[^\\w]", "")

Which means, replace any non-word characters (any character outside from [a-zA-Z_0-9]).


If it doesn't matches with your requirements, you can specifiy the characters to replaced bt yourself:


row1.theStringYouWishToTransform.replaceAll("[àâäéèêëîïôöùûü]", "")

Which means, replace these characters (àâäéèêëîïôöùûü) by nothing.

You just have to complete the list of characters you want to remove.



Four Stars

Re: Handling special characters

Thanks for the reply.

But my requirement is not to replace special character with empty string. It is to load the special characters to output file/table with same size as input file and data should not get trimmed.

In my case, it is populating special characters as empty string but I want to know how Talend handles special characters. 



Six Stars

Re: Handling special characters

Hi @srkalakonda,


I had encountered a similiar issue. 


make sure your source and target files are with the same encoding.


If you use UTF-8 character encoding this should not occur. 




Tags (2)