The generic problem with all the "semi-structured" documents is that
the meta-data has no consistent naming. Making up names here, but Word
might have "created_on", PDF "created" etc. Its really frustrating,
but each type has to be investigated to figure out which field you
want to map to "created"
Sorry if my question was too vague. In my mind it wasn't but you led me in the
right direction which gave me a new issue.
I added the following to my schema.xml to bring back the Created Date: but now I am getting
back the created date for PDF files but not for Word documents (specifically
Not really sure what you're asking here, I must be missing something.
The mapping is through the field name supplied, so as long as your input
XML has something like
your date here
it should be fine.
You can use "date math" here as well, as:
NOW
Best,
Erick
On Tue, Apr