Hello 

We are implementing SolrCloud; we expect around ~200millions documents per 
node and 160-200 nodes. I looked on other references, seems like we are 
not the first to work with such volume. 

The indexing itself will be done locally (no distribution, each 
node-server indexes its own) 
The search is distributed. The search includes simple search, facet and 
pivot. 

The end-user may search specific field or free-text-search. 


We are indexing kind of event log (user, client, serverIP, time, object, 
etc.around 14 fields); 
We would like to enable specific field search (e.g., user=John Smith) and 
also free text search (e.g., John Smith with no restriction to specific 
field). 

We've tried to index each field separately and the whole string together 
(all fields together) in another field to allow free-text. 

With 1 million documents where a document represents one event (pretty 
short), the performance are poor (seconds , we expect ms). 

- The field search is fast but when searching the full string field 
(free-text-search) it is pretty slow (seconds). 
- We've implement the SolrCloud, when we try two machines with 1 millions 
documents, the Pivot search is very, very slow. 

In the past we did it with pure Lucene (local only) and it was pretty 
cool, 160millions document were pretty fast for free text search. 


Thanks
Oded


Reply via email to