The escaping does appear excessive. Please file a bug to the Lucene
project in Apache JIRA.
On Fri, May 26, 2017 at 11:26 AM Michael Joyner wrote:
> Isn't the unified html escaper a rather bit extreme in it's escaping?
>
> It makes it hard to deal with for simple post-processing.
>
> The origin
Hi,
I'm not so sure about the escaping, but to control how much text is
returned as context around the highlighted frag, you can set the following
in solrconfig.xml.
200
This will limit the fragments to consider for highlight to around 200
characters, and it will not return the whole chunk of da
Isn't the unified html escaper a rather bit extreme in it's escaping?
It makes it hard to deal with for simple post-processing.
The original html escaper seems to do minimial escaping, not every
non-alphabetical character it can find.
Also, is there a way to control how much text is returned