Hi guy, there's not much of a search operation here. Why not store the documents in a key/value store and simply fetch them by matching ids?
Another approach: as there is no query, you could easily partition the set of ids and fetch the results in multiple batches. The maximum number of clauses should be 1024. You can set it to a higher value using the respective method in org.apache.lucene.search.BooleanQuery (I've never done that one before, though). Now, your mileage may vary. What is the idea behind this retrieval? You really want to fetch objects by id? Check out MemcacheDB or Apache Cassandra or Apache CouchDB, depending on your application and the type of information you want to store. Best regards, --Jürgen On 14.11.2014 17:51, henry cleland wrote: > Hi guys, > How do I search only a subset of my corpus based on a large list of non > consecutive unique key ids (cannot do a range query). > Is there a way around doing this q=id:(id1 OR id2 OR id3 OR id4 ... OR > id40000 ) AND name:* > > Also what is the limit of "OR"s i can apply on the query if that is the > only way out, i don't suppose it is infinity. > Thanks > -- Mit freundlichen Grüßen/Kind regards/Cordialement vôtre/Atentamente/С уважением *i.A. Jürgen Wagner* Head of Competence Center "Intelligence" & Senior Cloud Consultant Devoteam GmbH, Industriestr. 3, 70565 Stuttgart, Germany Phone: +49 6151 868-8725, Fax: +49 711 13353-53, Mobile: +49 171 864 1543 E-Mail: juergen.wag...@devoteam.com <mailto:juergen.wag...@devoteam.com>, URL: www.devoteam.de <http://www.devoteam.de/> ------------------------------------------------------------------------ Managing Board: Jürgen Hatzipantelis (CEO) Address of Record: 64331 Weiterstadt, Germany; Commercial Register: Amtsgericht Darmstadt HRB 6450; Tax Number: DE 172 993 071