maybe this will help you:
http://wiki.apache.org/solr/DataImportHandlerDeltaQueryViaFullImport


On Tue, Jun 4, 2013 at 8:38 PM, Arun Rangarajan <arunrangara...@gmail.com>wrote:

> Shawn,
>
> Thanks for your reply. My data-config.xml actually has two entities. I sent
> only the first entity in my previous email. Since I had not run any imports
> on the 2nd entity, dataimport.properties did not have an entry for it yet.
> This worked fine in 3.6.2, so looks like a bug in 4.2.1.
>
> For now, I am thinking that I can skip using the dih properties entirely.
> For the first entity, I can look for documents that changed in the last 10
> min in the DB and run the delta import cron job every 10 min. For the 2nd
> entity, the interval is 1 hour. Of course, if one of the delta imports fail
> this approach may skip some documents, but we do full import once a day so
> those docs should eventually catch up. Guess that's the best I can get with
> DIH for now!
>
>
> On Tue, Jun 4, 2013 at 7:05 AM, Shawn Heisey <s...@elyograg.org> wrote:
>
> > On 6/4/2013 7:52 AM, Arun Rangarajan wrote:
> > > I upgraded from Solr 3.6.2 to 4.2.1 and I am noticing that my data
> import
> > > handler's delta import is actually doing a full import.
> >
> > <snip>
> >
> > > What changed and how do I get delta import to only index the documents
> > that
> > > got modified after ${dih.Lists.last_index_time}'?
> >
> > It's a bug.  I've built a test that shows the problem, but I haven't
> > figured out yet how to actually fix it.
> >
> > https://issues.apache.org/jira/browse/SOLR-4788
> >
> > I now have one more data point to add to the mix that I didn't know
> > before - it works in 3.6.2.
> >
> > It looks like you only have the one entity showing a last_indexed_time,
> > so you should be able to use ${dih.last_index_time} instead of
> > ${dih.Lists.last_index_time}.
> >
> > Thanks,
> > Shawn
> >
> >
>



-- 
Regards,
Raheel Hasan

Reply via email to