We are trying to optimize our index schema and want to avoid indexing unnecessary fields. We are struggling a bit with understanding which date fields in HCI are equivalent for which fields coming from the original documents.
In particular for a PDF document, we have found lots of dates. Some will come from the system metadata of the file and some come from the PDF document itself. We need to understand what the best practice is to select valuable fields, in our case
- Creation date of the document
- Last Modified date of the document and
- Last Access Date of the document.
In regards of Last Access Date it seems that the field HCI_accessDateString is a time stamp which is defined when HCI has accessed the file. Is this correct?
Please some of our questions in the screenshot below:
Maybe somebody is able to bring some light into this, please?
Thanks and regards,