SOLR 4.0 Beta documents being duplicated

David Quarterman Fri, 05 Oct 2012 07:26:14 -0700

Hi,

We've been using V4.x of SOLR since last November without too much
trouble. Our MySQL database is refreshed daily and a full import is run
automatically after the refresh and generally produces around 86,000
products, obviously on unique doc_id's.


 

So, we upgraded to 4.0 Beta a few days ago, with only mild difficulty,
reindexed and all was fine. Except after the next data refresh and
full-import, we had duplicate products appearing on different unique
doc_ids. Not all products are being duplicated, just random ones. We've
just deleted the data directory and reindexed and the product count has
dropped from 116,711 to 86,543. There'll be another refresh/import early
tomorrow morning and I fear we'll have more duplicates.

 

The call to the import now contains clean=true, commit=true and
optimize=true but it seems to make no difference.

 

Anyone have any ideas?

 

Regards,

 

David Q

SOLR 4.0 Beta documents being duplicated

Reply via email to