Please file a JIRA asking for the ability to direct what is saved in the DIH properties file. It should be possible to use your own timestamp or generated id.
On 4/20/10, Shawn Heisey <s...@elyograg.org> wrote: > Michael, > > The SolrEntityProcessor looks very intriguing, but it won't work with > the released 1.4 version. If that's OK with you and it looks like it'll > do what you want, feel free to ignore the rest of this. > > I'm also using MySQL as an import source for Solr. I was unable to use > the last_index_time because my database doesn't have a field I can match > against it. I believe you can use something similar to the method that > I came up with. The point of this post is to show you how to inject > values from outside Solr into a DIH request rather than have Solr > provide the milestone that indicates new content. > > Here's a simplified version of my URL template and entity configuration > in data-config.xml. The did field in my database is an autoincrement > BIGINT serving as my private key, but something similar could likely be > cooked up with timestamps too: > > http://HOST:PORT/solr/CORE/dataimport?command=COMMAND&dataTable=DATATABLE&minDid=MINDID&maxDid=MAXDID > > ---- > > <entity name="dataTable" pk="did" > query="SELECT * FROM ${dataimporter.request.dataTable} WHERE did > > ${dataimporter.request.minDid} AND did <= > ${dataimporter.request.maxDid}" > deltaQuery="SELECT MAX(did) FROM ${dataimporter.request.dataTable}" > deltaImportQuery="SELECT * FROM ${dataimporter.request.dataTable} WHERE > did > ${dataimporter.request.minDid} AND did <= > ${dataimporter.request.maxDid}"> > </entity> > > ---- > > If I am doing a full-import, I set minDid to zero and maxDid to the > highest value in the database. For a delta-import, minDid comes from > the maxDid value stored after the last successful import. > > The deltaQuery is required, but in my case, is a throw-away query that > just tells Solr the delta-import needs to be run. My query and > deltaImportQuery are identical, though yours may not be. > > Good luck, no matter how you choose to approach this. > > Shawn > > > On 4/18/2010 9:02 PM, Michael Tibben wrote: >> I don't really understand how this will help. Can you elaborate ? >> >> Do you mean that the last_index_time can be imported from somewhere >> outside solr? But I need to be able to *set* what last_index_time is >> stored in dataimport.properties, not get properties from somewhere else >> >> >> >> On 18/04/10 10:02, Lance Norskog wrote: >>> The SolrEntityProcessor allows you to query a Solr instance and use >>> the results as DIH properties. You would have to create your own >>> regular query to do the delta-import instead of using the delta-import >>> feature. > > -- Lance Norskog goks...@gmail.com