try setting hl.fragsizeIsMinimum=true I did some benchmarking and found that this helps quite a bit
BTW I used the highlights.alg benchmark file, with some changes to make it more reflective of your scenario -- offsets in postings, and used "enwiki" (english wikipedia) docs which are larger than the Reuters ones (so it appears, any way). I had to do a bit of hacking to use the "LengthGoalBreakIterator, which wasn't previously used by this framework. ~ David On Tue, May 26, 2020 at 4:42 PM Michal Hlavac <m...@hlavki.eu> wrote: > fine, I'l try to write simple test, thanks > > > > On utorok 26. mája 2020 17:44:52 CEST David Smiley wrote: > > > Please create an issue. I haven't reproduced it yet but it seems > unlikely > > > to be user-error. > > > > > > ~ David > > > > > > > > > On Mon, May 25, 2020 at 9:28 AM Michal Hlavac <m...@hlavki.eu> wrote: > > > > > > > Hi, > > > > > > > > I have field: > > > > <field name="content_txt_sk_highlight" type="txt_sk_lemma_no_accent" > > > > stored="true" indexed="false" storeOffsetsWithPositions="true"/> > > > > > > > > and configuration: > > > > <str name="hl">true</str> > > > > <str name="hl.method">unified</str> > > > > <bool name="hl.defaultSummary">true</bool> > > > > <str name="hl.fl">content_txt_sk_highlight</str> > > > > <int name="hl.snippets">2</int> > > > > <bool name="spellcheck">true</bool> > > > > > > > > Doing query with hl.bs.type=SENTENCE it takes around 1000 - 1300 ms > which > > > > is really slow. > > > > Same query with hl.bs.type=WORD takes from 8 - 45 ms > > > > > > > > is this normal behaviour or should I create issue? > > > > > > > > thanks, m. > > > > > > > > >