Four Stars

tFileInputExcel error NotOfficeXmlFileException

Hi,

I'm getting error when trying to iterate and load multiple 2007 excel files into AWS Aurora Database. I set up Excel metadata with 'Read excel2007 file format' checked.

My job flow is as follows:

tFileList --> tFileInputExcel --> tMap --> tAmazonAuroraOutput.

tFileList is pointing to a folder which has only 6 excel files. Job is reading all the files and loading into database fine but after loading last file job fails as below:

 

Exception in component tFileInputExcel_1 (Load_M1_Excel_Files)
org.apache.poi.openxml4j.exceptions.NotOfficeXmlFileException: No valid entries or contents found, this is not a valid OOXML (Office Open XML) file
at org.apache.poi.openxml4j.opc.ZipPackage.getPartsImpl(ZipPackage.java:286)
at org.apache.poi.openxml4j.opc.OPCPackage.getParts(OPCPackage.java:758)
at org.apache.poi.openxml4j.opc.OPCPackage.open(OPCPackage.java:261)
at org.apache.poi.openxml4j.opc.OPCPackage.open(OPCPackage.java:187)
at org.apache.poi.POIXMLDocument.openPackage(POIXMLDocument.java:90)
at org.apache.poi.xssf.usermodel.XSSFWorkbook.<init>(XSSFWorkbook.java:336)

  • Data Integration
1 ACCEPTED SOLUTION

Accepted Solutions
Four Stars

Re: tFileInputExcel error NotOfficeXmlFileException

TRF,

Your hint has prompted me to put a tLogRow after tFileList and it showed me 2 extra temp xl files ~file1.xlsx and ~file2.xlsx, but these files are not there in the folder. I have put an exclude filemask in Advanced settings of tFileList as "~*.xlsx" and now job is running without error. Thanks!

3 REPLIES
Eleven Stars TRF
Eleven Stars

Re: tFileInputExcel error NotOfficeXmlFileException

Carefully check your tFileList mask, it looks like you're trying to open a file which is not a regular xlsx file.

TRF
Four Stars

Re: tFileInputExcel error NotOfficeXmlFileException

TRF,

Your hint has prompted me to put a tLogRow after tFileList and it showed me 2 extra temp xl files ~file1.xlsx and ~file2.xlsx, but these files are not there in the folder. I have put an exclude filemask in Advanced settings of tFileList as "~*.xlsx" and now job is running without error. Thanks!

Eleven Stars TRF
Eleven Stars

Re: tFileInputExcel error NotOfficeXmlFileException

Fine!
Don't forget to mark this case as solved.
Kudos also accepted!

TRF