Re: iterate through each document in Solr

Mingfeng Yang Mon, 06 May 2013 09:37:35 -0700

Hi Dmitry,

My index is not sharded, and since its size is so big, sharding won't help
much on the paging issue.  Do you know any API which can help read from
lucene binary index directly?     I will be nice if we can just scan
through the docs directly.


Thanks!
Ming-


On Mon, May 6, 2013 at 3:33 AM, Dmitry Kan <solrexp...@gmail.com> wrote:

> Are you doing it once? Is your index sharded? If so, can you ask each shard
> individually?
> Another way would be to do it on Lucene level, i.e. read from the binary
> indices (API exists).
>
> Dmitry
>
>
> On Mon, May 6, 2013 at 5:48 AM, Mingfeng Yang <mfy...@wisewindow.com>
> wrote:
>
> > Dear Solr Users,
> >
> > Does anyone know what is the best way to iterate through each document
> in a
> > Solr index with billion entries?
> >
> > I tried to use  select?q=*:*&start=xx&rows=500  to get 500 docs each time
> > and then change start value, but it got very slow after getting through
> > about 10 million docs.
> >
> > Thanks,
> > Ming-
> >
>

Re: iterate through each document in Solr

Reply via email to