Thanks for the response. I am afraid I can't use the DataImportHandler. I do the indexation using an Indexation Service joining data from several places.
I have a final xml with plenty of data and one of them is the rtf field. That's the xml I send to Solr using the /update. I am guessing if it would be possible Solr to do it with a tokenizer filter or something like that. On 14 November 2016 at 16:24, Alexandre Rafalovitch <arafa...@gmail.com> wrote: > I think DataImportHandler with nested entity (JDBC, then Tika with > FieldReaderDataSource) should do the trick. > > Have you tried that? > > Regards, > Alex. > ---- > Solr Example reading group is starting November 2016, join us at > http://j.mp/SolrERG > Newsletter and resources for Solr beginners and intermediates: > http://www.solr-start.com/ > > > On 15 November 2016 at 03:19, marotosg <marot...@gmail.com> wrote: > > Hi, > > > > I have a use case where I need to index information coming from a > database > > where there is a field which contains rich text format. I would like to > > convert that text into simple plain text, same as tika does when indexing > > documents. > > > > Is there any way to achive that having a field only where i sent this > rich > > text and then Solr cleans that data? I can't find anyhting so far. > > > > Thanks > > Sergio > > > > > > > > -- > > View this message in context: http://lucene.472066.n3. > nabble.com/RTF-Rich-text-format-tp4305778.html > > Sent from the Solr - User mailing list archive at Nabble.com. >