Hello
We are implementing SolrCloud; we expect around ~200millions documents per node and 160-200 nodes. I looked on other references, seems like we are not the first to work with such volume. The indexing itself will be done locally (no distribution, each node-server indexes its own) The search is distributed. The search includes simple search, facet and pivot. The end-user may search specific field or free-text-search. We are indexing kind of event log (user, client, serverIP, time, object, etc.around 14 fields); We would like to enable specific field search (e.g., user=John Smith) and also free text search (e.g., John Smith with no restriction to specific field). We've tried to index each field separately and the whole string together (all fields together) in another field to allow free-text. With 1 million documents where a document represents one event (pretty short), the performance are poor (seconds , we expect ms). - The field search is fast but when searching the full string field (free-text-search) it is pretty slow (seconds). - We've implement the SolrCloud, when we try two machines with 1 millions documents, the Pivot search is very, very slow. In the past we did it with pure Lucene (local only) and it was pretty cool, 160millions document were pretty fast for free text search. Thanks Oded