On Thu, 2014-07-03 at 13:10 +0200, Matt Stunfield wrote: [Sections of text < 200 words, dictionary of terms & phrases]
> Task: find if terms occur in single section. Found terms must be > distinguished. In query result there is (highlighted?) found terms > positions. I agree with Alexandre that this sounds like a Luwak-job. > 2. I suppose it isn’t possible to perform search with 1600 search items > like: “term_1” “term_2” … “term_N” at once, is it? This should be possible with stock Solr, although it would not be a very fast query. You might have to use POST instead of GET as you will have a very big request. At some point in time you had to ensure that the Solr config was set to handle many boolean clauses http://wiki.apache.org/solr/SolrConfigXml#The_Query_Section but that does not seem to apply anymore https://issues.apache.org/jira/browse/SOLR-4586 > 3. Is Lucene/Solr capable of performing simultaneous 1600 queries in > separate threads? Yes (hard limit is 10,000 with the supplied Jetty), although my guess is that you would get better performance with less concurrent requests. Anyway, that solution would mean that the results would have to be merged in your end, which would require some work with merging the highlight information for the matched terms. - Toke Eskildsen, State and University Library, Denmark