Yes, that is our approach. We did try deleting a day's worth of data at a
time, and that resulted in OOM as well.

Thanks
Vinay


On 14 April 2014 00:27, Furkan KAMACI <furkankam...@gmail.com> wrote:

> Hi;
>
> I mean you can divide the range (i.e. one week at each delete instead of
> one month) and try to check whether you still get an OOM or not.
>
> Thanks;
> Furkan KAMACI
>
>
> 2014-04-14 7:09 GMT+03:00 Vinay Pothnis <poth...@gmail.com>:
>
> > Aman,
> > Yes - Will do!
> >
> > Furkan,
> > How do you mean by 'bulk delete'?
> >
> > -Thanks
> > Vinay
> >
> >
> > On 12 April 2014 14:49, Furkan KAMACI <furkankam...@gmail.com> wrote:
> >
> > > Hi;
> > >
> > > Do you get any problems when you index your data? On the other hand
> > > deleting as bulks and reducing the size of documents may help you not
> to
> > > hit OOM.
> > >
> > > Thanks;
> > > Furkan KAMACI
> > >
> > >
> > > 2014-04-12 8:22 GMT+03:00 Aman Tandon <amantandon...@gmail.com>:
> > >
> > > > Vinay please share your experience after trying this solution.
> > > >
> > > >
> > > > On Sat, Apr 12, 2014 at 4:12 AM, Vinay Pothnis <poth...@gmail.com>
> > > wrote:
> > > >
> > > > > The query is something like this:
> > > > >
> > > > >
> > > > > *curl -H 'Content-Type: text/xml' --data
> '<delete><query>param1:(val1
> > > OR
> > > > > val2) AND -param2:(val3 OR val4) AND date_param:[1383955200000 TO
> > > > > 1385164800000]</query></delete>'
> > > > > 'http://host:port/solr/coll-name1/update?commit=true'*
> > > > >
> > > > > Trying to restrict the number of documents deleted via the date
> > > > parameter.
> > > > >
> > > > > Had not tried the "distrib=false" option. I could give that a try.
> > > Thanks
> > > > > for the link! I will check on the cache sizes and autowarm values.
> > Will
> > > > try
> > > > > and disable the caches when I am deleting and give that a try.
> > > > >
> > > > > Thanks Erick and Shawn for your inputs!
> > > > >
> > > > > -Vinay
> > > > >
> > > > >
> > > > >
> > > > > On 11 April 2014 15:28, Shawn Heisey <s...@elyograg.org> wrote:
> > > > >
> > > > > > On 4/10/2014 7:25 PM, Vinay Pothnis wrote:
> > > > > >
> > > > > >> When we tried to delete the data through a query - say 1
> > day/month's
> > > > > worth
> > > > > >> of data. But after deleting just 1 month's worth of data, the
> > master
> > > > > node
> > > > > >> is going out of memory - heap space.
> > > > > >>
> > > > > >> Wondering is there any way to incrementally delete the data
> > without
> > > > > >> affecting the cluster adversely.
> > > > > >>
> > > > > >
> > > > > > I'm curious about the actual query being used here.  Can you
> share
> > > it,
> > > > or
> > > > > > a redacted version of it?  Perhaps there might be a clue there?
> > > > > >
> > > > > > Is this a fully distributed delete request?  One thing you might
> > try,
> > > > > > assuming Solr even supports it, is sending the same delete
> request
> > > > > directly
> > > > > > to each shard core with distrib=false.
> > > > > >
> > > > > > Here's a very incomplete list about how you can reduce Solr heap
> > > > > > requirements:
> > > > > >
> > > > > > http://wiki.apache.org/solr/SolrPerformanceProblems#
> > > > > > Reducing_heap_requirements
> > > > > >
> > > > > > Thanks,
> > > > > > Shawn
> > > > > >
> > > > > >
> > > > >
> > > >
> > > >
> > > >
> > > > --
> > > > With Regards
> > > > Aman Tandon
> > > >
> > >
> >
>

Reply via email to