This sounds a little like my original problem of deltaQuery imports
per entity ...
https://issues.apache.org/jira/browse/SOLR-783
I wonder if those 2 hacks could be combined to fix the issue.
- Jon
On Dec 6, 2008, at 12:29 PM, Marc Sturlese wrote:
Hey there,
I am doing some hacks to some parts of the solr source. I am doing a
feature
for everytime I use delta import hanlder I want it to start geting
info from
the db starting from the last indexed document id (from the latest
execution).
The point of doing that is that if I start a full import and the
process is
aborted for any reason, I want to be able to start a delta import
and start
indexing from the last indexed id of the full import.
To do that basically I have created functions in solrwriter.java and
dataimporter.java. The funcions I have created are the same as the
ones to
write and retrieve the timestamp to the dataimport.properties but
mines do
it with an id (long instead of date).
I call this functions in docbuilder.java (in the places were
functions for
timestamp were created)
I do one more thing... i write in the dataimport.properties every
time I
call the function upload in docbuilder to upload a document.
The problem is that not every time the upload function (in
docbuilder) is
called a commit is called aswell. So, if I kill -9 the process in
the middle
of the execution i will have in the dataimport.properties the last
uploaded
id but in the index (opening it with luke) I will have the last
commited.
I have done some tests calling writer.commit(false) just after the
upload or
setting in solrconfig.xml <maxBufferedDocs>2</maxBufferedDocs>.
With both
it works fine but opiously the indexer works extremely slow.
Is there any way to write in the dataimport.properties
(writer.persistIndexLastID(arow.get("id").toString())) just after
every
commit but not calling myself the commit function? If not, I would
apreciate
any advice about other ways to reach this goals.
If I get it done I will open an issue and upload there the patch
cause I
thing that this can be a common use case.
Thanks in advanced
--
View this message in context:
http://www.nabble.com/Delta-import-hack-to-use-last-indexed-id-document-tp20872450p20872450.html
Sent from the Solr - User mailing list archive at Nabble.com.