Sandy,
Export uses a very different approach then the normal select approach.
Export uses an incremental stream sorting approach that won't run out of
memory when sorting very large result sets. And Export does not use stored
fields to return results, it uses docValues caches to return results.
T
I believe export is streaming and it avoids building various caches,
so it will not blow up Solr's memory on large datasets.
You can read a lot more details in the JIRA that introduced it:
https://issues.apache.org/jira/browse/SOLR-5244
I am not sure how it compares with deep-paging though.
Rega
Thanks Alexandre.
I actually need the whole result set. But it is large(perhaps 10m-100m) and
I find select is slow.
How does export differ from select except that select will make distributed
requests and do the merge?
Will select with ‘distrib=false’ have comparable performance with export?
201
Export was specifically designed to get everything which is very
expensive otherwise.
If you just want the subset, you might be better off with normal
queries and/or with deep paging (cursor).
Regards,
Alex.
Sign up for my Solr resources newsletter at http://www.solr-start.com/
On 6 Jan
Using rows=xxx doesn't seem to work.
Is there a way to do this?