On 21 February 2013 19:30, cveres <csabave...@me.com> wrote: > Thanks Gora, > > Sorry I might not have been sufficiently clear. > > I start with an empty index, then add documents. > 9000 are added and 6000 immediately deleted again, leaving 3000. > I assume this can only happen with duplicate IDs, but that should not be > possible! So I wanted to get a list of deleted documents so that I could try > and figure out why they were deleted immediately. [...]
What do you mean by "9000 are added and 6000 immediately deleted again"? How are you getting the number added, and the number deleted? How many documents does DIH report on the final screen after the full-import completes? >From what you describe, it is most likely duplicate IDs. Could you do a SELECT from the database outside of Solr, create the IDs as you do with DIH, and see what is going wrong there? Regards, Gora