Chris Hostetter wrote:
>
>
> patches for issues can't be applied until someone who cares about them
> write them and contribute them for committers to consider/apply :)
>
>
it seems I'm one of the very few people who care about this feature :)
Unfortunately my daily languages are c++ and c#
: The suggested fix from Mirko seems very simple. Hopefull a patch will be
: applied
: very soon. In the meantime, I'll use my backup solution:
patches for issues can't be applied until someone who cares about them
write them and contribute them for committers to consider/apply :)
-Hoss
Chris Hostetter wrote:
>
>
> It is tracked in http://issues.apache.org/jira/browse/SOLR-42
>
> ...there are currently no patches.
>
>
The suggested fix from Mirko seems very simple. Hopefull a patch will be
applied
very soon. In the meantime, I'll use my backup solution:
http://fucoder.co
oblems with HTML tagged fields
:
:
:
: Yonik Seeley wrote:
: >
: > HTMLStripWhitespaceTokenizerFactory works in two phases...
: > HTMLStripReader removes the HTML and passes the result to
: > WhitespaceTokenizer... at that point, Tokens are generated, but the
: > offsets will corresp
Yonik Seeley wrote:
>
> HTMLStripWhitespaceTokenizerFactory works in two phases...
> HTMLStripReader removes the HTML and passes the result to
> WhitespaceTokenizer... at that point, Tokens are generated, but the
> offsets will correspond to the text after HTML removal, not before.
>
> I did it