Thanks again for your input.
In fact I already preprocess the data (concatenation of only the content
values) and index it into another field.
But my general problem is the following: My data has such a cryptic format
and I have to search only within the content values. Therefore I preprocess
it
ope better with straight Java code than cryptic regular
expression sequences.
-- Jack Krupansky
-Original Message-
From: jasimop
Sent: Thursday, May 30, 2013 12:46 AM
To: solr-user@lucene.apache.org
Subject: Re: Problem with PatternReplaceCharFilter
Honestly, I have no idea how
Honestly, I have no idea how to do that.
PatternReplaceCharFilter doesn't seem to have a parameter like
preservePositions="true" and
optionally fillCharacter=" ".
And I don't think I can express this simply as regex. How would I count in a
pure
regex the length difference before and after the match
Just replace the stripped markup with the equivalent number of spaces to
maintain positions.
Was there some specific problem you were encountering?
-- Jack Krupansky
-Original Message-
From: jasimop
Sent: Wednesday, May 29, 2013 4:12 PM
To: solr-user@lucene.apache.org
Subject: Proble