Four Stars

Handling special characters

Hi Guys,

 

I need to transform special characters like "á" . Whenever i read these characters and create a output file it shows "" . Please help how these characters can be handled and it should be loaded into the output. I am stuck badly with this .

 

Thanks,

Srinath

  • Big Data
  • Data Integration
3 REPLIES
Nine Stars TRF
Nine Stars

Re: Handling special characters

Hi,

Try using regex:

row1.theStringYouWishToTransform.replaceAll("[^\\w]", "")

Which means, replace any non-word characters (any character outside from [a-zA-Z_0-9]).

 

If it doesn't matches with your requirements, you can specifiy the characters to replaced bt yourself:

 

row1.theStringYouWishToTransform.replaceAll("[àâäéèêëîïôöùûü]", "")

Which means, replace these characters (àâäéèêëîïôöùûü) by nothing.

You just have to complete the list of characters you want to remove.

 

 


TRF
Four Stars

Re: Handling special characters

Thanks for the reply.

But my requirement is not to replace special character with empty string. It is to load the special characters to output file/table with same size as input file and data should not get trimmed.

In my case, it is populating special characters as empty string but I want to know how Talend handles special characters. 

 

Thanks

Six Stars

Re: Handling special characters

Hi @srkalakonda,

 

I had encountered a similiar issue. 

 

make sure your source and target files are with the same encoding.

 

If you use UTF-8 character encoding this should not occur. 

 

Cheers!

Gatha

Tags (2)