On 7/30/2013 12:16 PM, Dotan Cohen wrote:
To search for duplicate IDs, I am running the following query:
select?q=*:*&facet=true&facet.field=id&rows=0

However, since upgrading from Solr 4.1 to Solr 4.3 I am receiving
OutOfMemoryError errors instead of the desired facet:

<snip>

Might there be a less resource-intensive way to get this information.

Add &facet.method=enum to the query URL. This will cause Solr to enumerate the facet information on every query rather than load it into the field cache, which takes a lot of memory. Solr 4.1 was probably very close to running out of memory as well.

If you have enough OS disk cache for your index, the enum method should not cause an enormous slowdown. If you don't have enough OS disk cache, then it can make the facets run very slowly.

Thanks,
Shawn

Reply via email to