Solr: extracting/indexing HTML via cURL

okayndc Mon, 30 Apr 2012 07:08:16 -0700

Hello,

Over the weekend I experimented with extracting HTML content via cURL and
just
wondering why the extraction/indexing process does not include the HTML
tags.
It seems as though the HTML tags either being ignored or stripped somewhere
in the pipeline.
If this is the case, is it possible to include the HTML tags, as I would
like to keep the
formatted HTML intact?


Any help is greatly appreciated.

Solr: extracting/indexing HTML via cURL

Reply via email to