I am doing a DIH full import on a very recent checkout from branch_4x. Something I've recently done differently is enabling autocommit. I am seeing that there are deleted documents in some of the indexes. See "Development Build Indexes" at the bottom of the following screenshot. When the import is complete, the numbered shards will contain 13 million documents.

http://dl.dropbox.com/u/97770508/statuspage-deletes-import.png

The MySQL database that this imports from has a unique index on the field that Solr is using for its UniqueKey, soit's not possible to have duplicates. Each import uses one SELECT statement for the entire 13 million document import. What might be leading to these deleted docs?

Thanks,
Shawn

Reply via email to