DOCX from Oracle db LONGRAW to tFileOutputRaw

Four Stars

DOCX from Oracle db LONGRAW to tFileOutputRaw


I'm trying to select files stored in Oracle db as LONGRAW and write them to file.

I'm using tFileOutputRaw and it is working fine, for example with DOC and PDF files.


But newer Microsoft Office files are not working, for example DOCX and XLSX.

When i try to open the file Word says "Word found content that could not be read. Do you want to restore the contents of the document?"

If I press "Yes". Document opens fine. This does not happens if file is opened from database not using Talend.


My job look like this:




What do I need to get this working?


Re: DOCX from Oracle db LONGRAW to tFileOutputRaw


The tFileOutputExcel component with 'Write excel2007 file format(xlsx)' is able to read XLSX and the tFileOutputDelimited component will create .doc files.

Best regards


Don't forget to give kudos when a reply is helpful and click Accept the solution when you think you're good with it.
Four Stars

Re: DOCX from Oracle db LONGRAW to tFileOutputRaw

I have now find out that the problem is in the data. There is two extra 0-characters  at the end om the LONGRAW stored in the database.

How do I remove this in Talend before my tFileOutputRaw-component?

Cloud Free Trial

Try Talend Cloud free for 30 days.


Introduction to Talend Open Studio for Data Integration.

Definitive Guide to Data Integration

Practical steps to developing your data integration strategy.

Definitive Guide to Data Quality

Create systems and workflow to manage clean data ingestion and data transformation.