RE: Strip HTML Tags and Store

2016-07-14 Thread Kalpana
html-strip-features From: Markus Jelsma-2 [via Lucene] [mailto:ml-node+s472066n4287069...@n3.nabble.com] Sent: Wednesday, July 13, 2016 6:08 PM To: Kalpana Sivanandan Subject: RE: Strip HTML Tags

RE: Strip HTML Tags and Store

2016-07-13 Thread Kalpana
+s472066n4287069...@n3.nabble.com] Sent: Wednesday, July 13, 2016 6:08 PM To: Kalpana Sivanandan Subject: RE: Strip HTML Tags and Store Hello - just as mentioned in the thread: > Add to your solrconfig: > > > > features > > > > And point your update

RE: Strip HTML Tags and Store

2016-07-13 Thread Markus Jelsma
alpana > Sent: Wednesday 13th July 2016 23:34 > To: solr-user@lucene.apache.org > Subject: RE: Strip HTML Tags and Store > > Hello > > I am trying to do something similar, I want to strip the html tags before > being stored. Where do I add this code in Sol

RE: Strip HTML Tags and Store

2016-07-13 Thread Kalpana
Hello I am trying to do something similar, I want to strip the html tags before being stored. Where do I add this code in Solrconfig.xml? which section? Thanks much Kalpana -- View this message in context: http://lucene.472066.n3.nabble.com/Strip-HTML-Tags-and-Store-tp4067208p4287064.html

Re: Strip HTML Tags and Store

2013-05-31 Thread Jack Krupansky
Great. That was an example from the book. -- Jack Krupansky -Original Message- From: Kalyan Kuram Sent: Friday, May 31, 2013 4:04 PM To: solr-user@lucene.apache.org Subject: RE: Strip HTML Tags and Store Thanks it worked..!! From: j...@basetechnology.com To: solr-user

RE: Strip HTML Tags and Store

2013-05-31 Thread Kalyan Kuram
Thanks it worked..!! > From: j...@basetechnology.com > To: solr-user@lucene.apache.org > Subject: Re: Strip HTML Tags and Store > Date: Thu, 30 May 2013 22:53:37 -0400 > > Update Request Processors to the rescue again. Namely, the HTML Strip Field > Update processor: >

Re: Strip HTML Tags and Store

2013-05-30 Thread Jack Krupansky
or... some other options. See: http://lucene.apache.org/solr/4_3_0/solr-core/org/apache/solr/update/processor/FieldMutatingUpdateProcessorFactory.html -- Jack Krupansky -Original Message- From: Kalyan Kuram Sent: Thursday, May 30, 2013 8:18 PM To: solr-user@lucene.apache.org Subject: Strip HTML Ta

Strip HTML Tags and Store

2013-05-30 Thread Kalyan Kuram
Hi AllI am trying to understand what gets stored when i configure a field indexed and stored for example i have this in my schema.xmland I was expecting that solr will

Re: Strip html

2012-06-05 Thread Tigunn
I resolve my problem: I had to specify the field to return with my query. Thanks A LOT for your help ! -- View this message in context: http://lucene.472066.n3.nabble.com/Strip-html-tp3987051p398.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Strip html

2012-06-05 Thread Tigunn
r" snipped by html or xml tags like exemple). Look: http://lucene.472066.n3.nabble.com/file/n3987731/recherche_solr_tei.jpg The php file: Thanks you for your help. -- View this message in context: http://lucene.472066.n3.nabble.com/Strip-html-tp3987051p3987731.html Sent from the Solr

Re: Strip html

2012-06-01 Thread Tigunn
the xslt do that Jack Krupansky-2 wrote > > The bottom line is that you will need to have your own code that will > detect > the "choice" tag and map it to the desired choice, and you will have to do > that before you &qu

Re: Strip html

2012-06-01 Thread Jack Krupansky
The bottom line is that you will need to have your own code that will detect the "choice" tag and map it to the desired choice, and you will have to do that before you "strip" html. So, given:

Re: Strip html

2012-06-01 Thread Tigunn
ml, it's a xsl sheet (not mine, but i can change xsl sheets to generate a text whitout html: i want to try). For information xslt1.0 return for the exemple: You can notice : the word "castors" is break by html tag 2 - I want to strip html tags before indexing. i try in php:

Re: Strip html

2012-06-01 Thread Jack Krupansky
doesn't use the "strip HTML filter". And you probably want the French light stemmer to allow search on "castor" to match "castors". Show us the schema with field types and an actual input document that you post to Solr. Unfortunately, we may still be confus

Re: Strip html

2012-06-01 Thread Tigunn
7;t work. i want to put in index not :"castors" or "c astors" or again "astors" but "castors". -- View this message in context: http://lucene.472066.n3.nabble.com/Strip-html-tp3987051p3987232.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Strip html

2012-05-31 Thread Chris Hostetter
: I make a transformation XSLT which return : : --- : si les ruches d’abeilles prouvent la : monarchie, les fourmillières, les troupes d’éléphants ou : de castors prouvent la république. : --- : i put this ht

Fwd: Strip html

2012-05-31 Thread Michael Della Bitta
a preprocessing step. Michael Della Bitta Appinions, Inc. -- Where Influence Isn’t a Game. http://www.appinions.com -Original Message- From: Tigunn Sent: Thursday, May 31, 2012 11:30 AM To: solr-user@lucene.apache.org Subject: Strip htm

Re: Strip html

2012-05-31 Thread Jack Krupansky
There is no option in the Strip HTML filter to discard whitespace between elements. And it certainly doesn't know the semantics of some XML schema for "choice". You'll have to pre-process that semantics before Solr ingestion, or do your own custom filter. -- Jack Krup

Strip html

2012-05-31 Thread Tigunn
ddField('body_strip_html', $body_norm); In schema.xml: AND But this don't work! I want to return this xml files (look exemple) if i search "castor". Can you help me, please? thanks. -- View this message in context: http://lucene.472066.n3.nabb

Re: strip html from data

2011-08-15 Thread Merlin Morgenstern
2011/8/11 Ahmet Arslan > > Is there a way to strip the html tags completly and not > > index them? If not, > > how to I retrieve the results without html tags? > > How do you push documents to solr? You need to strip html tags before the > analysis chain. For exa

Re: strip html from data

2011-08-13 Thread Erick Erickson
;> >>>> On 07/25/2011 12:01 PM, Markus Jelsma wrote: >> >>>> >> >>>> >> >>>>> charFilters are executed first regardless of their position in the >> >>>>> analyzer. >> >>>>> >&

Re: strip html from data

2011-08-11 Thread Alexei Martchenko
>>>>> charFilters are executed first regardless of their position in the > > >>>>> analyzer. > > >>>>> > > >>>>> On Monday 25 July 2011 17:53:59 Mike Sokolov wrote: > > >>>>> &g

Re: strip html from data

2011-08-11 Thread Ahmet Arslan
> Is there a way to strip the html tags completly and not > index them? If not, > how to I retrieve the results without html tags? How do you push documents to solr? You need to strip html tags before the analysis chain. For example, if you are using Data Import Handler, yo

Re: strip html from data

2011-08-11 Thread Merlin Morgenstern
;>>>> > >>>>>> -Mike > >>>>>> > >>>>>> On 07/25/2011 09:03 AM, Merlin Morgenstern wrote: > >>>>>> > >>>>>> > >>>>>>> sounds logical. I just changed

Re: strip html from data

2011-08-09 Thread Erick Erickson
gt;            >>>>>> >>>>>>> positionIncrementGap="100" autoGeneratePhraseQueries="**true"> >>>>>>> >>>>>>>                   >>>>>>> >>>>>>>                    

Re: strip html from data

2011-08-08 Thread Merlin Morgenstern
uot;0" splitOnCaseChange="1"/> >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> >>

Re: strip html from data

2011-07-25 Thread Mike Sokolov
. You need to add the char filter to the index-time analyzer. On Monday 25 July 2011 13:09:14 Merlin Morgenstern wrote: Hi there, I am trying to strip html tags from the data before adding the documents to the index.

Re: strip html from data

2011-07-25 Thread Markus Jelsma
>>> > >>>>>> class="solr.HTMLStripCharFilterFactory"/> > >>> > >>> > >>> >

Re: strip html from data

2011-07-25 Thread Mike Sokolov
ld be? 2011/7/25 Markus Jelsma You've three analyzer elements, i wonder what that would do. You need to add the char filter to the index-time analyzer. On Monday 25 July 2011 13:09:14 Merlin Morgenstern wrote: Hi there, I am trying to strip html tags from the data before

Re: strip html from data

2011-07-25 Thread Markus Jelsma
> > > > > > > > > > Unfortunatelly that did not fix the error. There are still tags > > inside the data. Although I believe there are viewer then before but I > > can not prove that. Fact is, there a

Re: strip html from data

2011-07-25 Thread Mike Sokolov
? 2011/7/25 Markus Jelsma You've three analyzer elements, i wonder what that would do. You need to add the char filter to the index-time analyzer. On Monday 25 July 2011 13:09:14 Merlin Morgenstern wrote: Hi there, I am trying to strip html tags from the data before addin

Re: strip html from data

2011-07-25 Thread Markus Jelsma
t; > > > > > 2011/7/25 Markus Jelsma > > > You've three analyzer elements, i wonder what that would do. You need to > > add > > the char filter to the index-time analyzer. > > > > On Monday 25 July 2011 13:09:1

Re: strip html from data

2011-07-25 Thread Merlin Morgenstern
4 Merlin Morgenstern wrote: > > Hi there, > > > > I am trying to strip html tags from the data before adding the documents > to > > the index. To do that I altered schem.xml like this: > > > > &

Re: strip html from data

2011-07-25 Thread Markus Jelsma
You've three analyzer elements, i wonder what that would do. You need to add the char filter to the index-time analyzer. On Monday 25 July 2011 13:09:14 Merlin Morgenstern wrote: > Hi there, > > I am trying to strip html tags from the data before adding the documents to > the