Re: how to present html content in browse

2012-05-04 Thread Lance Norskog
> >>> >>> >>> Which has: >>> >>> >>>   >>>   >>>  >> mapping="mapping-** >>> ISOLatin1Accent.txt"/> >>>   >>>   >>>   >>>   >>> >>>   >&g

Re: how to present html content in browse

2012-05-04 Thread okayndc
/browse/SOLR-42> > > > -- Jack Krupansky > > -Original Message- From: okayndc > Sent: Friday, May 04, 2012 4:35 PM > > To: solr-user@lucene.apache.org > Subject: Re: how to present html content in browse > > Is it possible to return the HTML field highlighted?

Re: how to present html content in browse

2012-05-04 Thread Jack Krupansky
can add HTMLStripCharFilterFactory to some other field type that you might want to use, but this "charFilter" needs to be before the "tokenizer". The "text" field type above is just an example. -- Jack Krupansky -Original Message- From: okayndc Sent: Friday, May

Re: how to present html content in browse

2012-05-04 Thread okayndc
nt to call that field type "text_stripped" to avoid > confusion with a simple text field > > You can add HTMLStripCharFilterFactory to some other field type that you > might want to use, but this "charFilter" needs to be before the > "tokenizer". The "

Re: how to present html content in browse

2012-05-04 Thread Jack Krupansky
kenizer". The "text" field type above is just an example. -- Jack Krupansky -----Original Message- From: okayndc Sent: Friday, May 04, 2012 1:01 PM To: solr-user@lucene.apache.org Subject: Re: how to present html content in browse Hello, I'm having a hard time und

Re: how to present html content in browse

2012-05-04 Thread okayndc
Hello, I'm having a hard time understanding this, and I had this same question. When using DIH should the HTML field be stored in the raw HTML string field or the stripped field? Also what source field(s) need to be copied and to what destination? Thanks On Thu, May 3, 2012 at 10:15 PM, Lance

Re: how to present html content in browse

2012-05-03 Thread Lance Norskog
Make two fields, one with stores the stripped HTML and another that stores the parsed HTML. You can use so that you do not have to submit the html page twice. You would mark the stripped field 'indexed=true stored=false' and the full text field the other way around. The full text field should be