Hi there! I am quite new to Lucene/Solr/Tika, etc., so I would appreciate you help concerning the following matter.
I have a RTF-document, that I want to index in Solr, using Tika. The RTF-indexing works in general, but since I changed the Solr-schema, the indexer complains about missing mandatory fields, like "module-id". The rtf-file is generated by me and I added the metadata-fields to the RTF-document in the "userprops"-section of the RTF-file (see below) -- so Tika should be able to read it and to provide it. The problem is: I don't know HOW or WHERE Tika provides this metadata, so I don't know how to access it. As a result, I don't know how I can map it to the respective Solr-fields, like "module-id", that are mandatory in my Solr-schema. Can someone give me a hint, please? I am running out of ideas here ... :-/ <RTF-file> {\rtf1\fbidis\ansi\ansicpg1252\deff0\deflang1031{\fonttbl{\f0\fnil\fcharset0 Arial;}} {\colortbl ;\red0\green0\blue0;} {\userprops {\propname module-id}\proptype30{\staticval 000ba8a6} } } Mit freundlichen Grüßen/ With kind regards Jan Schluchtmann Systems Engineering Cluster Instruments VW Group Continental Automotive GmbH Division Interior ID S3 RM VDO-Strasse 1, 64832 Babenhausen, Germany Telefon/Phone: +49 6073 12-4346 Telefax: +49 6073 12-79-4346