The logical place to do that (if you cannot do outside of Solr) would
be in an UpdateRequestProcessor.
Unfortunately, there is no TikaExtract URP though other similar ones
exist (e.g. for language guessing). The full list is here:
http://www.solr-start.com/info/update-request-processors/
But you
Thanks for the response.
I am afraid I can't use the DataImportHandler. I do the indexation using an
Indexation Service joining data from several places.
I have a final xml with plenty of data and one of them is the rtf field.
That's the xml I send to Solr using the /update. I am guessing if it w
I think DataImportHandler with nested entity (JDBC, then Tika with
FieldReaderDataSource) should do the trick.
Have you tried that?
Regards,
Alex.
Solr Example reading group is starting November 2016, join us at
http://j.mp/SolrERG
Newsletter and resources for Solr beginners and intermedi