Re: Mapping and Capture in ExtractingRequestHandler

2011-12-21 Thread Erick Erickson
it to construct a Solr document ? > > Thanks and Regards, > Swapna. > > -Original Message- > From: Erick Erickson [mailto:erickerick...@gmail.com] > Sent: Wednesday, December 21, 2011 2:28 AM > To: solr-user@lucene.apache.org > Subject: Re: Mapping and Capture in Extract

RE: Mapping and Capture in ExtractingRequestHandler

2011-12-20 Thread Swapna Vuppala
@lucene.apache.org Subject: Re: Mapping and Capture in ExtractingRequestHandler When you start getting into complex HTML extraction, you're probably better off using a SolrJ program with a forgiving HTML parser and extracting the relevant bits yourself and construction a SolrDocument. FWIW,

Re: Mapping and Capture in ExtractingRequestHandler

2011-12-20 Thread Erick Erickson
When you start getting into complex HTML extraction, you're probably better off using a SolrJ program with a forgiving HTML parser and extracting the relevant bits yourself and construction a SolrDocument. FWIW, Erick On Tue, Dec 20, 2011 at 12:54 AM, Swapna Vuppala wrote: > Hi, > > I understand

Mapping and Capture in ExtractingRequestHandler

2011-12-19 Thread Swapna Vuppala
Hi, I understand that we can specify parameters in ExtractingRequestHandler in solrconfig.xml to capture HTML tags of a particular type and map them to desired solr fields, like something below. div mysolrfield The above setting will capture content in "div" tags and copy to the solr field "m