How to preserve tagged field after Read Lines?

Question

My goal is to:

1. Input a .tar file, containing a CSV file

2. Tag it with a unique ID, based on the filename (or something else TBD)

3. Extract the tar file

4. Read Lines from the extracted CSV

5. Output to indexes, including the unique ID that was added in step 2

I've added a Tagging stage (step 2) to the Extraction pipeline (step 3), to tag the unique ID. But, when I test the workflow, the Read Lines stage does not seem to copy the tagged field into the new documents it generates. (If it matters, I have recursion on.)

Is it possible for a tag added to a document to be applied to the resulting documents, when the original passes through the Read Lines stage? Or should I find another way to accomplish this (perhaps using HCI_parentUri, which seems to contain the URL of the original .tar file)?

#HitachiContentIntelligenceHCI

Answer

You can log into the HALO lab labs.hds.com under HCI-training or HCI-salesdemo. We have an example Pipeline of this called " ReadHDICIFSLOGS " you can look at the pipeline or export it via AW and use it as your template. The CSV file is in the HCP NS along with a custom index.

Troy

Hitachi Content Platform​

How to preserve tagged field after Read Lines?

Related Content

HCI: Logging and Monitoring at the Document/Object level

Reprocessing dropped documents by workflow

Announcing Hitachi Content Intelligence v1.5

HCI: Logging and Monitoring at the Document/Object level

Performance Monitoring w/ ELK - Part II: Monitoring HCP Access Logs

Hitachi Content Platform