On 05/10/2007, at 8:45 AM, Mike Klaas wrote:
In general, I don't recommend indexing HTML content straight to Solr. None of the Solr contributors do this so the use case hasn't received a lot of love.

We're indexing XHTML straight to Solr and it's working great so far.

I'm actually somewhat surprised that several people are interested in this but none have have been sufficiently interested to implement a solution to contribute:

http://issues.apache.org/jira/browse/SOLR-42

Didn't know there was a problem to solve. We're a fair way off actually playing with highlighting but I'll keep an eye on this for when we get to it.

-Mike

Thanks,

Adrian Sutton
http://www.symphonious.net

Reply via email to