I'm trying to select files stored in Oracle db as LONGRAW and write them to file.
I'm using tFileOutputRaw and it is working fine, for example with DOC and PDF files.
But newer Microsoft Office files are not working, for example DOCX and XLSX.
When i try to open the file Word says "Word found content that could not be read. Do you want to restore the contents of the document?"
If I press "Yes". Document opens fine. This does not happens if file is opened from database not using Talend.
My job look like this:
What do I need to get this working?
The tFileOutputExcel component with 'Write excel2007 file format(xlsx)' is able to read XLSX and the tFileOutputDelimited component will create .doc files.
I have now find out that the problem is in the data. There is two extra 0-characters at the end om the LONGRAW stored in the database.
How do I remove this in Talend before my tFileOutputRaw-component?
Try Talend Cloud free for 30 days.
Introduction to Talend Open Studio for Data Integration.
Practical steps to developing your data integration strategy.
Create systems and workflow to manage clean data ingestion and data transformation.