Hi All,

I have a problem with scalability on my project. we are running almost
close of 100 cores which are having documents of ~25000 each and the total
size of the index files being 7.5 GB.


Also, we have the staging server where we build index files using data
importer and using replication we are pushing data to data LIVE servers
which are used for serving the live application. because we are initiating
index pulling from live servers currently 5 servers against one stage
server. Stage server is overburden and will not be able to respond properly
either to indexing job or to other services (very minimal).


So now the problem statement is I understand my current SOLR architecture
is not able to handle all my needs so we are thinking to upgrade to next
levels but I am confused with below questions,


   1. Can I use index Sharding for my problem? it is recommended to use if
   my have more than ~1million docs in a core but I have in only 25000 in one
   core and having 100 cores.
   2. Can I consider SOLR cloud? if yes please let me know why?
   3. How about using apache zookeeper and maintain only LIVE servers by
   dividing every 20 cores to each server? so that I can make only 25 snap
   pull request to stage server instead of 100 snap-puller per server as
   current.

Any new suggestions or reply to this email is greatly appreciated. Thanks
in advance.



-- 
Regards,
Santosh Sidnal

Reply via email to