html-strip-features
From: Markus Jelsma-2 [via Lucene]
[mailto:ml-node+s472066n4287069...@n3.nabble.com]
Sent: Wednesday, July 13, 2016 6:08 PM
To: Kalpana Sivanandan
Subject: RE: Strip HTML Tags
+s472066n4287069...@n3.nabble.com]
Sent: Wednesday, July 13, 2016 6:08 PM
To: Kalpana Sivanandan
Subject: RE: Strip HTML Tags and Store
Hello - just as mentioned in the thread:
> Add to your solrconfig:
>
>
>
> features
>
>
>
>
And point your update
alpana
> Sent: Wednesday 13th July 2016 23:34
> To: solr-user@lucene.apache.org
> Subject: RE: Strip HTML Tags and Store
>
> Hello
>
> I am trying to do something similar, I want to strip the html tags before
> being stored. Where do I add this code in Sol
Hello
I am trying to do something similar, I want to strip the html tags before
being stored. Where do I add this code in Solrconfig.xml? which section?
Thanks much
Kalpana
--
View this message in context:
http://lucene.472066.n3.nabble.com/Strip-HTML-Tags-and-Store-tp4067208p4287064.html
Great. That was an example from the book.
-- Jack Krupansky
-Original Message-
From: Kalyan Kuram
Sent: Friday, May 31, 2013 4:04 PM
To: solr-user@lucene.apache.org
Subject: RE: Strip HTML Tags and Store
Thanks it worked..!!
From: j...@basetechnology.com
To: solr-user
Thanks it worked..!!
> From: j...@basetechnology.com
> To: solr-user@lucene.apache.org
> Subject: Re: Strip HTML Tags and Store
> Date: Thu, 30 May 2013 22:53:37 -0400
>
> Update Request Processors to the rescue again. Namely, the HTML Strip Field
> Update processor:
>
or... some
other options. See:
http://lucene.apache.org/solr/4_3_0/solr-core/org/apache/solr/update/processor/FieldMutatingUpdateProcessorFactory.html
-- Jack Krupansky
-Original Message-
From: Kalyan Kuram
Sent: Thursday, May 30, 2013 8:18 PM
To: solr-user@lucene.apache.org
Subject: Strip HTML Ta
Hi AllI am trying to understand what gets stored when i configure a field
indexed and stored for example i have this in my schema.xmland
I was expecting that solr will
I resolve my problem:
I had to specify the field to return with my query.
Thanks A LOT for your help !
--
View this message in context:
http://lucene.472066.n3.nabble.com/Strip-html-tp3987051p398.html
Sent from the Solr - User mailing list archive at Nabble.com.
r"
snipped by html or xml tags like exemple). Look:
http://lucene.472066.n3.nabble.com/file/n3987731/recherche_solr_tei.jpg
The php file:
Thanks you for your help.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Strip-html-tp3987051p3987731.html
Sent from the Solr
the xslt do that
Jack Krupansky-2 wrote
>
> The bottom line is that you will need to have your own code that will
> detect
> the "choice" tag and map it to the desired choice, and you will have to do
> that before you &qu
The bottom line is that you will need to have your own code that will detect
the "choice" tag and map it to the desired choice, and you will have to do
that before you "strip" html.
So, given:
ml, it's a xsl sheet (not mine, but i can change
xsl sheets to generate a text whitout html: i want to try).
For information xslt1.0 return for the exemple:
You can notice : the word "castors" is break by html tag
2 - I want to strip html tags before indexing.
i try in php:
doesn't use the "strip HTML filter". And you
probably want the French light stemmer to allow search on "castor" to match
"castors".
Show us the schema with field types and an actual input document that you
post to Solr.
Unfortunately, we may still be confus
7;t work.
i want to put in index not :"castors" or "c astors" or again "astors" but
"castors".
--
View this message in context:
http://lucene.472066.n3.nabble.com/Strip-html-tp3987051p3987232.html
Sent from the Solr - User mailing list archive at Nabble.com.
: I make a transformation XSLT which return :
: ---
: si les ruches d’abeilles prouvent la
: monarchie, les fourmillières, les troupes d’éléphants ou
: de castors prouvent la république.
: ---
: i put this ht
a
preprocessing step.
Michael Della Bitta
Appinions, Inc. -- Where Influence Isn’t a Game.
http://www.appinions.com
-Original Message- From: Tigunn
Sent: Thursday, May 31, 2012 11:30 AM
To: solr-user@lucene.apache.org
Subject: Strip htm
There is no option in the Strip HTML filter to discard whitespace between
elements. And it certainly doesn't know the semantics of some XML schema for
"choice". You'll have to pre-process that semantics before Solr ingestion,
or do your own custom filter.
-- Jack Krup
ddField('body_strip_html', $body_norm);
In schema.xml:
AND
But this don't work!
I want to return this xml files (look exemple) if i search "castor".
Can you help me, please?
thanks.
--
View this message in context:
http://lucene.472066.n3.nabb
2011/8/11 Ahmet Arslan
> > Is there a way to strip the html tags completly and not
> > index them? If not,
> > how to I retrieve the results without html tags?
>
> How do you push documents to solr? You need to strip html tags before the
> analysis chain. For exa
;> >>>> On 07/25/2011 12:01 PM, Markus Jelsma wrote:
>> >>>>
>> >>>>
>> >>>>> charFilters are executed first regardless of their position in the
>> >>>>> analyzer.
>> >>>>>
>&
>>>>> charFilters are executed first regardless of their position in the
> > >>>>> analyzer.
> > >>>>>
> > >>>>> On Monday 25 July 2011 17:53:59 Mike Sokolov wrote:
> > >>>>>
&g
> Is there a way to strip the html tags completly and not
> index them? If not,
> how to I retrieve the results without html tags?
How do you push documents to solr? You need to strip html tags before the
analysis chain. For example, if you are using Data Import Handler, yo
;>>>>
> >>>>>> -Mike
> >>>>>>
> >>>>>> On 07/25/2011 09:03 AM, Merlin Morgenstern wrote:
> >>>>>>
> >>>>>>
> >>>>>>> sounds logical. I just changed
gt; >>>>>>
>>>>>>> positionIncrementGap="100" autoGeneratePhraseQueries="**true">
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
uot;0" splitOnCaseChange="1"/>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>> >>
. You need
to add
the char filter to the index-time analyzer.
On Monday 25 July 2011 13:09:14 Merlin Morgenstern wrote:
Hi there,
I am trying to strip html tags from the data before adding the
documents
to
the index.
>>>
> >>>>>> class="solr.HTMLStripCharFilterFactory"/>
> >>>
> >>>
> >>>
>
ld be?
2011/7/25 Markus Jelsma
You've three analyzer elements, i wonder what that would do. You need to
add
the char filter to the index-time analyzer.
On Monday 25 July 2011 13:09:14 Merlin Morgenstern wrote:
Hi there,
I am trying to strip html tags from the data before
> >
> >
> >
> >
> > Unfortunatelly that did not fix the error. There are still tags
> > inside the data. Although I believe there are viewer then before but I
> > can not prove that. Fact is, there a
?
2011/7/25 Markus Jelsma
You've three analyzer elements, i wonder what that would do. You need to
add
the char filter to the index-time analyzer.
On Monday 25 July 2011 13:09:14 Merlin Morgenstern wrote:
Hi there,
I am trying to strip html tags from the data before addin
t;
>
>
>
>
> 2011/7/25 Markus Jelsma
>
> > You've three analyzer elements, i wonder what that would do. You need to
> > add
> > the char filter to the index-time analyzer.
> >
> > On Monday 25 July 2011 13:09:1
4 Merlin Morgenstern wrote:
> > Hi there,
> >
> > I am trying to strip html tags from the data before adding the documents
> to
> > the index. To do that I altered schem.xml like this:
> >
> > &
You've three analyzer elements, i wonder what that would do. You need to add
the char filter to the index-time analyzer.
On Monday 25 July 2011 13:09:14 Merlin Morgenstern wrote:
> Hi there,
>
> I am trying to strip html tags from the data before adding the documents to
> the
34 matches
Mail list logo