Re: Problem with PatternReplaceCharFilter

2013-05-31 Thread jasimop
Thanks again for your input. In fact I already preprocess the data (concatenation of only the content values) and index it into another field. But my general problem is the following: My data has such a cryptic format and I have to search only within the content values. Therefore I preprocess it

Re: Problem with PatternReplaceCharFilter

2013-05-30 Thread Jack Krupansky
ope better with straight Java code than cryptic regular expression sequences. -- Jack Krupansky -Original Message- From: jasimop Sent: Thursday, May 30, 2013 12:46 AM To: solr-user@lucene.apache.org Subject: Re: Problem with PatternReplaceCharFilter Honestly, I have no idea how

Re: Problem with PatternReplaceCharFilter

2013-05-29 Thread jasimop
Honestly, I have no idea how to do that. PatternReplaceCharFilter doesn't seem to have a parameter like preservePositions="true" and optionally fillCharacter=" ". And I don't think I can express this simply as regex. How would I count in a pure regex the length difference before and after the match

Re: Problem with PatternReplaceCharFilter

2013-05-29 Thread Jack Krupansky
Just replace the stripped markup with the equivalent number of spaces to maintain positions. Was there some specific problem you were encountering? -- Jack Krupansky -Original Message- From: jasimop Sent: Wednesday, May 29, 2013 4:12 PM To: solr-user@lucene.apache.org Subject: Proble