Re: The unified highlighter html escaping. Seems rather extreme...

2017-07-20 Thread David Smiley
The escaping does appear excessive. Please file a bug to the Lucene project in Apache JIRA. On Fri, May 26, 2017 at 11:26 AM Michael Joyner wrote: > Isn't the unified html escaper a rather bit extreme in it's escaping? > > It makes it hard to deal with for simple post-processing. > > The origin

Re: The unified highlighter html escaping. Seems rather extreme...

2017-05-28 Thread Zheng Lin Edwin Yeo
Hi, I'm not so sure about the escaping, but to control how much text is returned as context around the highlighted frag, you can set the following in solrconfig.xml. 200 This will limit the fragments to consider for highlight to around 200 characters, and it will not return the whole chunk of da

The unified highlighter html escaping. Seems rather extreme...

2017-05-26 Thread Michael Joyner
Isn't the unified html escaper a rather bit extreme in it's escaping? It makes it hard to deal with for simple post-processing. The original html escaper seems to do minimial escaping, not every non-alphabetical character it can find. Also, is there a way to control how much text is returned