Re: RTF Rich text format

2016-11-14 Thread Alexandre Rafalovitch
The logical place to do that (if you cannot do outside of Solr) would be in an UpdateRequestProcessor. Unfortunately, there is no TikaExtract URP though other similar ones exist (e.g. for language guessing). The full list is here: http://www.solr-start.com/info/update-request-processors/ But you

Re: RTF Rich text format

2016-11-14 Thread Sergio GarcĂ­a Maroto
Thanks for the response. I am afraid I can't use the DataImportHandler. I do the indexation using an Indexation Service joining data from several places. I have a final xml with plenty of data and one of them is the rtf field. That's the xml I send to Solr using the /update. I am guessing if it w

Re: RTF Rich text format

2016-11-14 Thread Alexandre Rafalovitch
I think DataImportHandler with nested entity (JDBC, then Tika with FieldReaderDataSource) should do the trick. Have you tried that? Regards, Alex. Solr Example reading group is starting November 2016, join us at http://j.mp/SolrERG Newsletter and resources for Solr beginners and intermedi