Re: Stripping html from text before indexing to solr

2014-10-10 Thread Vishal Sharma
Oh gotcha. Thanks for that! *Vishal Sharma**TL, Grazitti Interactive*T: +1 650­ 641 1754 E: vish...@grazitti.com www.grazitti.com [image: Description: LinkedIn] [image: Description: Twitter] [image: fbook]

Re: Stripping html from text before indexing to solr

2014-10-09 Thread Ahmet Arslan
Yes, your plain string queries will automatically match in index. This is always true. If you don't strip html, html tags are considered part of the document and would cause false matches. For example q=bold,code,class, etc. On Friday, October 10, 2014 12:35 AM, Vishal Sharma wrote: I think

Re: Stripping html from text before indexing to solr

2014-10-09 Thread Vishal Sharma
I think I dint get you completely. I am really sorry for asking this again. New to solr world :) Are you saying if I don't strip html my plain string queries will automatically match in index? *Vishal Sharma**TL, Grazitti Interactive*T: +1 650­ 641 1754 E: vish...@grazitti.com www.grazitti.com [i

Re: Stripping html from text before indexing to solr

2014-10-09 Thread Ahmet Arslan
It depends on you, if you strip html using a char filter, it won't match htm tags. But the original document, when requested using fl= parameter, will be html. If you do not strip html at all, q=html will return all documents. Ahmet On Friday, October 10, 2014 12:01 AM, Vishal Sharma wrote:

Re: Stripping html from text before indexing to solr

2014-10-09 Thread Vishal Sharma
Ahmet, So if its not necessary to strip html. Are you saying that plain text query strings will automatically match the html content indexed to solr? *Vishal Sharma**TL, Grazitti Interactive*T: +1 650­ 641 1754 E: vish...@grazitti.com www.grazitti.com [image: Description: LinkedIn]

Re: Stripping html from text before indexing to solr

2014-10-09 Thread Ahmet Arslan
Hi Vishal, Stripping html is not mandatory. Solr indexes it just like other text. By the way, there are to places where you can strip html. i) at analysis : char filter ii) before analysis : Update processor, html strip transformer Ahmet On Thursday, October 9, 2014 11:50 PM, Vishal Sharma