Pentaho

 View Only
  • 1.  " Get file with SFTP " Job add filenames to result, how does it work?

    Posted 10-21-2022 12:06
    As below image shows i want to send the filenames it processed to next job by selecting the "Add filenames to result" , what would be the stream name, can get the stream in other job using "get rows from result "?   using pentaho 9.3 version



    ------------------------------
    Kartik Rao
    Systems Engineer
    Pepsico
    ------------------------------


  • 2.  RE: " Get file with SFTP " Job add filenames to result, how does it work?

    Posted 10-21-2022 13:24
    Hi Kartik.
    When this job entry downloads files from server, adds target files to job result. For manipulating with files in result, look at to File management section where are job entries which have result in name.

    I don't known job entry "Get rows from result". I known transformation step with this name. But rows in result are another thing. If you want to get files from result, use step: "Get files from result".

    ------------------------------
    Petr Prochazka
    Systems Engineer
    P.V.A. systems s.r.o.
    ------------------------------



  • 3.  RE: " Get file with SFTP " Job add filenames to result, how does it work?

    Posted 02-23-2023 09:42

    Hey Petr, could you provide a little more guidance in hooking up the results from "Get file with SFTP" to a transformation with "Get files from result"? I'm using those two steps but cannot get the transformation to use the filenames.

    More details here: pentaho - How do I pass the results of "Get File with SFTP" to "GZIP CSV input" in PDI? - Stack Overflow



    ------------------------------
    Nathan Van Dyken
    Systems Engineer
    Pipestone
    ------------------------------



  • 4.  RE: " Get file with SFTP " Job add filenames to result, how does it work?

    Posted 02-27-2023 06:10
      |   view attached

    Hi Nathan,

    I look at problem in configuration of trans entry in job. You have checked Execute every input row, but rows not stored in result of job. This configuration uncheck.

    Second is, input field of GZIP CSV Input is wrong, use field path (filename is name of file, path is full path of file).

    I attached to this post example which read one file from samples directory and process via CSV Input. Important information are from step Write to Log where is information about file from result set.

    2023/02/27 12:06:09 - Write to log.0 -
    2023/02/27 12:06:09 - Write to log.0 - ------------> Linenr 1------------------------------
    2023/02/27 12:06:09 - Write to log.0 - Metadata about file
    2023/02/27 12:06:09 - Write to log.0 -
    2023/02/27 12:06:09 - Write to log.0 - comment = null
    2023/02/27 12:06:09 - Write to log.0 - filename = customers-100.txt
    2023/02/27 12:06:09 - Write to log.0 - origin = Add filenames to result
    2023/02/27 12:06:09 - Write to log.0 - parentorigin = result_files
    2023/02/27 12:06:09 - Write to log.0 - path = file:///D:/usr/share/pentaho/data-integration/samples/transformations/files/customers-100.txt
    2023/02/27 12:06:09 - Write to log.0 - timestamp = 2023/02/27 12:06:09.580
    2023/02/27 12:06:09 - Write to log.0 - type = General
    2023/02/27 12:06:09 - Write to log.0 -
    2023/02/27 12:06:09 - Write to log.0 - ====================

    For you is interesting path and origin (step|job entry which add file to result).



    ------------------------------
    Petr Prochazka
    Systems Engineer
    P.V.A. systems s.r.o.
    ------------------------------

    Attachment(s)

    zip
    result_files.zip   3 KB 1 version