We have a cronjob that runs every week at a quiet time to run the optimizecommand on our Solr collections. Even when it's quiet it's still an extremely heavy operation.
One of the things I keep seeing on stackoverflow is that optimizing is now essentially deprecated and lucene (We're on Solr 5.5.2) will now keep the amount of segments at a reasonable level and that the performance impact of having deletedDocs is now much less. One of our cores doesn't get optimized and it's currently sitting at 5.5 million documents with 1.9 million deleted docs. Which seems pretty high to me. How true is this claim? Is optimizing still a good idea for the general case? -- Mintel Group Ltd | 11 Pilgrim Street | London | EC4V 6RN Registered in England: Number 1475918. | VAT Number: GB 232 9342 72 Contact details for our other offices can be found at http://www.mintel.com/office-locations. This email and any attachments may include content that is confidential, privileged or otherwise protected under applicable law. Unauthorised disclosure, copying, distribution or use of the contents is prohibited and may be unlawful. If you have received this email in error, including without appropriate authorisation, then please reply to the sender about the error and delete this email and any attachments.