Re: How to limit the number of result sets of the 'export' handler

2015-01-07 Thread Joel Bernstein
Sandy, Export uses a very different approach then the normal select approach. Export uses an incremental stream sorting approach that won't run out of memory when sorting very large result sets. And Export does not use stored fields to return results, it uses docValues caches to return results. T

Re: How to limit the number of result sets of the 'export' handler

2015-01-07 Thread Alexandre Rafalovitch
I believe export is streaming and it avoids building various caches, so it will not blow up Solr's memory on large datasets. You can read a lot more details in the JIRA that introduced it: https://issues.apache.org/jira/browse/SOLR-5244 I am not sure how it compares with deep-paging though. Rega

Re: How to limit the number of result sets of the 'export' handler

2015-01-06 Thread Sandy Ding
Thanks Alexandre. I actually need the whole result set. But it is large(perhaps 10m-100m) and I find select is slow. How does export differ from select except that select will make distributed requests and do the merge? Will select with ‘distrib=false’ have comparable performance with export? 201

Re: How to limit the number of result sets of the 'export' handler

2015-01-06 Thread Alexandre Rafalovitch
Export was specifically designed to get everything which is very expensive otherwise. If you just want the subset, you might be better off with normal queries and/or with deep paging (cursor). Regards, Alex. Sign up for my Solr resources newsletter at http://www.solr-start.com/ On 6 Jan

How to limit the number of result sets of the 'export' handler

2015-01-05 Thread Sandy Ding
Using rows=xxx doesn't seem to work. Is there a way to do this?