On Fri, 2016-12-02 at 12:17 +0000, Markus Jelsma wrote:
> I have not considered streaming as i am still completely unfamiliar
> with it and i don't yet know what problems it can solve.

Standard faceting requires all nodes to produce their version of the
full result and send it as one chunk, which is then merged at the
calling node (+ other stuff). For large results that comes with a
significant memory overhead.

Solr streaming is ... well, streaming: With practically the same memory
overhead if you request 10K or 10 billion entries.

> One simple solution, in my case would be, now just thinking of it,
> run the query with no facets and no rows, get the numFound, and set
> that as facet.limit for the actual query.

That would work with your solution. Still, try issuing a "*:*"-search
and see if it breaks your very large facet request.

> Are there any examples / articles about consuming streaming facets
> with SolrJ? 

Sorry, I have little experience with SolrJ.

- Toke Eskildsen, State and University Library, Denmark

Reply via email to