AnsweredAssumed Answered

Excel Input Step Error Excel 2007 XLSX (Apache POI Streaming)

Question asked by Matthew Kennedy on Apr 14, 2019
Latest reply on Apr 15, 2019 by Matthew Kennedy

I have a transformation that has been tested locally (Windows Environment) and runs successfully. When I promote to the repository and try to run from the Pentaho Server I receive errors about not being able open the zip entry source stream.

 

Has anyone seen this before and have tips on fixing it?

 

The Excel file is on the local file system in both environments.

 

 

 

 

The Server environment is Ubuntu 16.04

Pentaho Server 8.2 CE

OpenJDK 8

Repository is MySQL 5.7

 

Below is the Error Log from the server

 

jb_load_customer - Starting entry [SQL_Create tmp_shamrock_customers table]

jb_load_customer - Starting entry [ct-tr_customer_file_to_table]

ct-tr_customer_file_to_table - Using run configuration [Pentaho local]

ct-tr_customer_file_to_table - Using legacy execution engine

tr_customer_file_to_table - Dispatching started for transformation [tr_customer_file_to_table]

to-write to tmp_shamrock_customers.0 - Connected to database [shamrock] (commit=1000)

(version 8.2.0.0-342, build 8.2.0.0-342 from 2018-11-14 10.30.55 by buildguy) : Error processing row from Excel file [/home/pentaho/projects/adi-etl/shamrock/file-mgmt/customer/Customer_dimension.xlsx] : org.pentaho.di.core.exception.KettleException:

org.apache.poi.openxml4j.exceptions.InvalidOperationException: Could not open the specified zip entry source stream

Could not open the specified zip entry source stream

[org.pentaho.di] 2019/04/15 01:03:01 - mec-get-customer-data.0 - ERROR (version 8.2.0.0-342, build 8.2.0.0-342 from 2018-11-14 10.30.55 by buildguy) : org.pentaho.di.core.exception.KettleException:

org.apache.poi.openxml4j.exceptions.InvalidOperationException: Could not open the specified zip entry source stream

Could not open the specified zip entry source stream

 

 

at org.pentaho.di.trans.steps.excelinput.poi.PoiWorkbook.<init>(PoiWorkbook.java:81)

at org.pentaho.di.trans.steps.excelinput.WorkbookFactory.getWorkbook(WorkbookFactory.java:41)

at org.pentaho.di.trans.steps.excelinput.ExcelInput.getRowFromWorkbooks(ExcelInput.java:552)

at org.pentaho.di.trans.steps.excelinput.ExcelInput.processRow(ExcelInput.java:432)

at org.pentaho.di.trans.step.RunThread.run(RunThread.java:62)

at java.lang.Thread.run(Thread.java:748)

Caused by: org.apache.poi.openxml4j.exceptions.InvalidOperationException: Could not open the specified zip entry source stream

at org.apache.poi.openxml4j.opc.ZipPackage.openZipEntrySourceStream(ZipPackage.java:205)

at org.apache.poi.openxml4j.opc.ZipPackage.openZipEntrySourceStream(ZipPackage.java:187)

at org.apache.poi.openxml4j.opc.ZipPackage.openZipEntrySourceStream(ZipPackage.java:161)

at org.apache.poi.openxml4j.opc.ZipPackage.<init>(ZipPackage.java:142)

at org.apache.poi.openxml4j.opc.OPCPackage.open(OPCPackage.java:295)

at org.apache.poi.ss.usermodel.WorkbookFactory.create(WorkbookFactory.java:264)

at org.apache.poi.ss.usermodel.WorkbookFactory.create(WorkbookFactory.java:226)

at org.apache.poi.ss.usermodel.WorkbookFactory.create(WorkbookFactory.java:205)

at org.pentaho.di.trans.steps.excelinput.poi.PoiWorkbook.<init>(PoiWorkbook.java:73)

... 5 more

Caused by: java.util.zip.ZipException: invalid block type

at java.util.zip.InflaterInputStream.read(InflaterInputStream.java:164)

at java.util.zip.ZipInputStream.read(ZipInputStream.java:194)

at org.apache.poi.openxml4j.util.ZipSecureFile$ThresholdInputStream.read(ZipSecureFile.java:220)

at java.io.FilterInputStream.read(FilterInputStream.java:107)

at org.apache.poi.openxml4j.util.ZipInputStreamZipEntrySource$FakeZipEntry.<init>(ZipInputStreamZipEntrySource.java:132)

at org.apache.poi.openxml4j.util.ZipInputStreamZipEntrySource.<init>(ZipInputStreamZipEntrySource.java:56)

at org.apache.poi.openxml4j.opc.ZipPackage.openZipEntrySourceStream(ZipPackage.java:203)

... 13 more

[org.pentaho.di] 2019/04/15 01:03:01 - mec-get-customer-data.0 - Finished processing (I=0, O=0, R=0, W=0, U=0, E=1)

[org.pentaho.di] 2019/04/15 01:03:01 - tr_customer_file_to_table - ERROR (version 8.2.0.0-342, build 8.2.0.0-342 from 2018-11-14 10.30.55 by buildguy) : Errors detected!

 

Thanks,

Matt

Outcomes