On Thu, 2014-07-03 at 13:10 +0200, Matt Stunfield wrote:

[Sections of text < 200 words, dictionary of terms & phrases]

> Task: find if terms occur in single section. Found terms must be
> distinguished. In query result there is (highlighted?) found terms
> positions.

I agree with Alexandre that this sounds like a Luwak-job.

> 2. I suppose it isn’t possible to perform search with 1600 search items
> like: “term_1” “term_2” … “term_N” at once, is it?

This should be possible with stock Solr, although it would not be a very
fast query. You might have to use POST instead of GET as you will have a
very big request.

At some point in time you had to ensure that the Solr config was set to
handle many boolean clauses
http://wiki.apache.org/solr/SolrConfigXml#The_Query_Section
but that does not seem to apply anymore
https://issues.apache.org/jira/browse/SOLR-4586

> 3. Is Lucene/Solr capable of performing simultaneous 1600 queries in
> separate threads?

Yes (hard limit is 10,000 with the supplied Jetty), although my guess is
that you would get better performance with less concurrent requests. 

Anyway, that solution would mean that the results would have to be
merged in your end, which would require some work with merging the
highlight information for the matched terms.

- Toke Eskildsen, State and University Library, Denmark


Reply via email to