Performance issues when querying on large documents

ahammad Fri, 23 Jul 2010 11:53:18 -0700

Hello,

I have an index with lots of different types of documents. One of those
types basically contains extracts of PDF docs. Some of those PDFs can have
1000+ pages, so there would be a lot of stuff to search through.


I am experiencing really terrible performance when querying. My whole index
has about 270k documents, but less than 1000 of those are the PDF extracts.
The slow querying occurs when I search only on those PDF extracts (by
specifying filters), and return 100 results. The 100 results definitely adds
to the issue, but even cutting that down can be slow.

Is there a way to improve querying with such large results? To give an idea,
querying for a single word can take a little over a minute, which isn't
really viable for an application that revolves around searching. For now, I
have limited the results to 20, which makes the query execute in roughly
10-15 seconds. However, I would like to have the option of returning 100
results.

Thanks a lot.

 
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/Performance-issues-when-querying-on-large-documents-tp990590p990590.html
Sent from the Solr - User mailing list archive at Nabble.com.

Performance issues when querying on large documents

Reply via email to