Thanks Erik. I used implicit shards. So the right maintenance could be add other shards after a period of time, change the roule that fill partition field in collection and drop old shards when they are empty. Is it right ? How can I see that 2 billion records limit is reached ? Is there an API ? -- Inviato da Libero Mail per Android Giovedì, 06 Luglio 2017, 11:17PM +02:00 da Erick Erickson erickerick...@gmail.com :
>Right, every individual shard is limited to 2B records. That does >include deleted docs. But I've never seen a shard (a Lucene index >actually) perform satisfactorily at that scale so while this is a >limit people usually add shards long before that. > >There is no technical reason to optimize every time, normal segment >merging will eventually remove the data associated with deleted >documents. You'll carry forward a number of deleted docs, but I >usually see it stabilize around 10%-15%. > >You don't necessarily have to re-index, you can split existing shards. > >But from your e-mail, it looks like you think you have to do something >explicit to reclaim the resources associated with deleted documents. >You do not have to do this. Optimize is really a special heavyweight >merge. Normal merging happens when you do a commit and that process >also reclaims the deleted document resources. > >Best, >Erick > >On Thu, Jul 6, 2017 at 11:59 AM, < calamita.agost...@libero.it > wrote: >> Hi, >> >> I'm working on an application that index CDR ( Call Detail Record ) in >> SolrCloud with 1 collection and 3 shards. >> >> Every day the application index 30 millions of CDR. >> >> I have a purge application that delete records older than 10 days, and call >> OPTIMIZE, so the collection will keep only 300 millions of CDR. >> >> Do you know if there is a limit on max number of documents per shard , >> included deleted documents ? >> >> I read in some blogs that there is a limit of 2 Billions per shard included >> deleted documents, that is I can have an empty collection, but if I already >> indexed 6 Billions of CDR ( 2 per 3 shards ) in that collection, I'll get an >> error. Is it true ? Have I recreate the collection ? >> >> I see that when I call delete records, apache solr free space on disk. >> >> Thanks. >> >> Agostino >> >> >>