Hello, >________________________________ > From: mustafozbek <mustafoz...@gmail.com> > >All documents that we use are rich text documents and we parse them with >tika. we need to search real time.
Because of real-time requirement, you'll need to use unreleased/dev version of Solr. >Robert Stewart wrote >> Any idea how many documents your 5TB data contains? >There are about 3millions document. You see the problem is that we have >documents large in size and small in numbers. Is that fine? That's fine. But you may want to think about breaking up large docs into smaller Solr docs, since finding a match in a very large doc may make it hard for users to jump to the match/matches in a large doc unless you highlight matches in the document and allow the user to jump from match to match. Otis ---- Performance Monitoring SaaS for Solr - http://sematext.com/spm/solr-performance-monitoring/index.html