Re: Support for huge data set?

2011-05-12 Thread atreyu
Thanks for the detailed response, Jonathon. I will look into the links and check out SolrCloud and Distributed Search. Load-sharing b/t 2 or 3 servers should not pose a problem, so long as it is robust (or at least not slower), fault-tolerant, and reliable. -- View this message in context: http

Re: Support for huge data set?

2011-05-12 Thread atreyu
Oh, my fault. No, I am not using Solr yet - just evaluating it. The current implementation is a combination of Sphinx and Oracle Text, but I have not been involved with any of the integration - I'm more of an outside analyst looking in, but will probably be involved in the integration of any new

Support for huge data set?

2011-05-12 Thread atreyu
Hi, I have about 300 million docs (or 10TB data) which is doubling every 3 years, give or take. The data mostly consists of Oracle records, webpage files (HTML/XML, etc.) and office doc files. There are b/t two and four dozen concurrent users, typically. The indexing server has > 27 GB of RAM,