Question asked by Peter Jansen on Aug 11, 2018
I have a large (nested) XML file, almost 2 Gb.

I would like to convert the XML to CSV data.


In a transformation (PDIce v 8) I use "get data from xml", point to the XML file and in the content tab I press "Get XPath nodes". PDI popup "Reading Document..." and after about 10 minutes an error occurs: "GC overhead limit exceeded".


I tried "PENTAHO_DI_JAVA_OPTIONS="-Xms4096m -Xmx6g -XX:MaxPermSize=512m" to enlarge the memory, but doen't have any effect.


Any thoughts on this?