Hi, We've been using V4.x of SOLR since last November without too much trouble. Our MySQL database is refreshed daily and a full import is run automatically after the refresh and generally produces around 86,000 products, obviously on unique doc_id's.
So, we upgraded to 4.0 Beta a few days ago, with only mild difficulty, reindexed and all was fine. Except after the next data refresh and full-import, we had duplicate products appearing on different unique doc_ids. Not all products are being duplicated, just random ones. We've just deleted the data directory and reindexed and the product count has dropped from 116,711 to 86,543. There'll be another refresh/import early tomorrow morning and I fear we'll have more duplicates. The call to the import now contains clean=true, commit=true and optimize=true but it seems to make no difference. Anyone have any ideas? Regards, David Q