Thanks for the response.

I am afraid I can't use the DataImportHandler. I do the indexation using an
Indexation Service joining data from several places.

I have a final xml with plenty of data and one of them is the rtf field.
That's the xml I send to Solr using the /update. I am guessing if it would
be possible Solr to do it with a tokenizer filter or something like that.

On 14 November 2016 at 16:24, Alexandre Rafalovitch <arafa...@gmail.com>
wrote:

> I think DataImportHandler with nested entity (JDBC, then Tika with
> FieldReaderDataSource) should do the trick.
>
> Have you tried that?
>
> Regards,
>    Alex.
> ----
> Solr Example reading group is starting November 2016, join us at
> http://j.mp/SolrERG
> Newsletter and resources for Solr beginners and intermediates:
> http://www.solr-start.com/
>
>
> On 15 November 2016 at 03:19, marotosg <marot...@gmail.com> wrote:
> > Hi,
> >
> > I have a use case where I need to index information coming from a
> database
> > where there is a field which contains rich text format. I would like to
> > convert that text into simple plain text, same as tika does when indexing
> > documents.
> >
> > Is there any way to achive that having a field only where i sent this
> rich
> > text and then Solr cleans that data? I can't find anyhting so far.
> >
> > Thanks
> > Sergio
> >
> >
> >
> > --
> > View this message in context: http://lucene.472066.n3.
> nabble.com/RTF-Rich-text-format-tp4305778.html
> > Sent from the Solr - User mailing list archive at Nabble.com.
>

Reply via email to