On Fri, 2010-10-01 at 12:00 +1000, Scott Yeadon wrote: > Hi, > > The problem is that the article text is HTML and Solr appears to strip > the HTML by default.
I think what you need to look at is how the fields are defined by default in your schema. If Data sent as HTML is being added to the standard html-text type and stored then the html is stripped and words indexed by default. If you want to store the raw html then maybe you should be doing that and not storing the stripped version, just indexing it. -- Regards, Neil Lunn