Hi,

We use Solr like a search engine / document store / database. We are
currently optimizing a test environment and would welcome any relevant
suggestions.

I've taken a lot of time researching this mailing list and found a lot of
relevant information.


Here's our current setup :


SolrCloud 6.5 Cluster : 5-6 nodes (8 cpu, 16GB ram : 8GB Solr and 8GB OS)
with 3-4 Solr cores each. Shards: 9-10 with replicationFactor = 2.

Current sharding: we have 4-5 cores per 8-cpu server.

Documents : 10M +, based on Wikipedia.

We use dynamic fields, with hundred of different fields (400?) but
individual documents have around 50 fields. We store most field.

We do not commit immediately: once every few seconds

We use fast hybrid storage (1 GB/s write)

Cache : generally have 0.4-0.66 hit rate. Default options.

Tried changing max indexing threads and merge factor. No significant gain.

We use an application cache for simple queries (get by id)

Current performance: on a 1100 concurrent users load, we average 3-4
seconds by query / update.

We use a couple of negative filter queries. Example *:* AND -type_id:A

Couple of questions:

-          Suggestions for new optimisations?

-          Anyone seen performance gains from 6.5 to Solr 7.1?

-          Do negative filter queries severely impact performance?

-          Heap is not an issue, but could docValues enhance search
performance? Or just memory usage?

Here’s an excellent guide we used for some of our work :
http://events.linuxfoundation.org/sites/events/files/slides/HighPerformanceSolr.pdf


Nicolas

Reply via email to