I'm trying to select files stored in Oracle db as LONGRAW and write them to file.
I'm using tFileOutputRaw and it is working fine, for example with DOC and PDF files.
But newer Microsoft Office files are not working, for example DOCX and XLSX.
When i try to open the file Word says "Word found content that could not be read. Do you want to restore the contents of the document?"
If I press "Yes". Document opens fine. This does not happens if file is opened from database not using Talend.
My job look like this:
What do I need to get this working?
The tFileOutputExcel component with 'Write excel2007 file format(xlsx)' is able to read XLSX and the tFileOutputDelimited component will create .doc files.
I have now find out that the problem is in the data. There is two extra 0-characters at the end om the LONGRAW stored in the database.
How do I remove this in Talend before my tFileOutputRaw-component?
Talend named a Leader.
Kickstart your first data integration and ETL projects.
Watch the recorded webinar!
Pick up some tips and tricks with Context Variables
Learn how media organizations have achieved success with Data Integration
Create systems and workflow to manage clean data ingestion and data transformation.