facet processing module in Version 6.x needs significantly more time compared to version 4.10

guenterh.li...@bluewin.ch Mon, 21 Aug 2017 06:35:32 -0700

Hi,
I can't figure out the reason why the facet processing in version 6 needs 
significantly more time compared to version 4.
The debugging response (for 30 million documents)
solr 4
<lst name="process"><double name="time">280.0</double><lst name="query"><double 
name="time">0.0</double></lst><lst name="facet"><double 
name="time">280.0</double></lst>
(once the query is cached)
before caching: between 1.5 and 2 sec
solr 6.x (my last try was with 6.6)
without docvalues for facetting fields (same schema as version 4)
<lst name="process"><double name="time">5874.0</double><lst 
name="query"><double name="time">0.0</double></lst><lst name="facet"><double 
name="time">5873.0</double></lst><lst name="facet_module"><double 
name="time">0.0</double></lst>
the time is not getting better even after repeating the query several times
solr 6.6 with docvalues for facetting fields
<lst name="process"><double name="time">9837.0</double><lst 
name="query"><double name="time">0.0</double></lst><lst name="facet"><double 
name="time">9837.0</double></lst><lst name="facet_module"><double 
name="time">0.0</double></lst>
used query (our productive system with version 4)
http://search.swissbib.ch/solr/sb-biblio/select?debugQuery=true&q=*:*&facet=true&facet.field=union&facet.field=navAuthor_full&facet.field=format&facet.field=language&facet.field=navSub_green&facet.field=navSubform&facet.field=publishDate&qt=edismax&ps=2&json.nl=arrarr&bf=recip(abs(ms(NOW/DAY,freshness)),3.16e-10,100,100)&fl=*,score&hl.fragsize=250&start=0&q.op=AND&sort=score+desc&rows=0&hl.simple.pre={{{{START_HILITE}}}}&facet.limit=100&hl.simple.post={{{{END_HILITE}}}}&spellcheck=false&qf=title_short^1000+title_alt^200+title_sub^200+title_old^200+title_new^200+author^750+author_additional^100+author_additional_dsv11_txt_mv^100+title_additional_dsv11_txt_mv^100+series^200+topic^500+addfields_txt_mv^50+publplace_txt_mv^25+publplace_dsv11_txt_mv^25+fulltext+callnumber^1000+ctrlnum^1000+publishDate+isbn+variant_isbn_isn_mv+issn+localcode+id&pf=title_short^1000&facet.mincount=1&hl.fl=fulltext&&wt=xml&facet.sort=count
Running the queries on smaller indices (8 million docs) the difference is 
similar although the absolut figures for processing time are smaller
Any hints why this huge differences?
Günter

facet processing module in Version 6.x needs significantly more time compared to version 4.10

Reply via email to