Re: Filtering HTML content in Solr 4.0.0

2012-10-26 Thread Rafał Kuć
u can also use scripting languages like Python, Ruby and > Javascript to write scripts for use as updateprocessors too. > -Mensagem Original- > From: Pratyul Kapoor > Sent: Friday, October 26, 2012 3:56 AM > To: solr-user@lucene.apache.org > Subject: Filtering HTML content in

Re: Filtering HTML content in Solr 4.0.0

2012-10-26 Thread Rogério Pereira Araújo
- From: Pratyul Kapoor Sent: Friday, October 26, 2012 3:56 AM To: solr-user@lucene.apache.org Subject: Filtering HTML content in Solr 4.0.0 Hi, I am using Solr 4.0.0. I have a HTML content as description of a product. If I index it without any filtering it is giving errors on search. How can I filter

Re: Filtering HTML content in Solr 4.0.0

2012-10-26 Thread Rafał Kuć
Hello! You try to put the HTML into the XML sent to Solr right ? You should use the proper UTF-8 encoding to do that. For example look at the utf8-example.xml file from the exampledocs directory that comes with Solr and you'll see something like this: tag with escaped chars: As you ca

Filtering HTML content in Solr 4.0.0

2012-10-26 Thread Pratyul Kapoor
Hi, I am using Solr 4.0.0. I have a HTML content as description of a product. If I index it without any filtering it is giving errors on search. How can I filter an HTML content. Pratyul