On 05/10/2007, at 8:45 AM, Mike Klaas wrote:
In general, I don't recommend indexing HTML content straight to
Solr. None of the Solr contributors do this so the use case hasn't
received a lot of love.
We're indexing XHTML straight to Solr and it's working great so far.
I'm actually somewhat surprised that several people are interested
in this but none have have been sufficiently interested to
implement a solution to contribute:
http://issues.apache.org/jira/browse/SOLR-42
Didn't know there was a problem to solve. We're a fair way off
actually playing with highlighting but I'll keep an eye on this for
when we get to it.
-Mike
Thanks,
Adrian Sutton
http://www.symphonious.net