Pentaho

 kettle / pdi - can I read an excel file with lazy conversion?

Jerry Lau's profile image
Jerry Lau posted 01-04-2022 03:21
hi all,

i am trying to read a xlsx / xls file using kettle and i would like to read the content as what it is originally (reserving all formats)
the problem now is i see kettle pdi is doing an auto formatting for me for fields like date and alphanumeric (supposed to be text)
e.g. an alphanumeric field could contain values 123 (numeric) and 'abc' (string) but it is now converting the numeric value to 123.0 (decimal)
date data are read and reformatted into another different format
i already read all the input as string but does not help

can anyone suggest a solution on this? thanks!

jerry
Benedict Faria's profile image
Benedict Faria
Jerry,
what version of Pentaho DI are you using? Presumably you're using the MS Excel Input Step. I've just created a single column xlsx file with mixed data, numerics, strings and combinations of both. PDI returned them to a dummy step exactly as they were in the original spreadsheet. Could you share your transformation and some sample data?
Jerry Lau's profile image
Jerry Lau
Hi Benedict,

I am using version 9.2
Please find the attachment

Jerry