De-basing / re-basing docIDs, or how to effectively pass calculated values from a Scorer or Filter up to (Solr's) QueryComponent.process

Aaron McKee Tue, 06 Oct 2009 09:41:48 -0700

(Posted here, per Yonik's suggestion)

In the code I'm working with, I generate a cache of calculated values asa by-product within a Filter.getDocidSet implementation (and within aQuery-ized version of the filter and its Scorer method) . These valuesare keyed off the IndexReader's docID values, since that's all that'saccessible at that level. Ultimately, however, I need to be able toaccess these values much higher up in the stack (Solr'sQueryComponent.process method), so that I can inject the dynamic valuesinto the response as a fake field. The IDs available here, however, arefor the entire index and not just relative to the current IndexReader.I'm still fairly new to Lucene and I've been scratching my head a bittrying to find a reliable way to map these values into the same space,without having to hack up too many base classes. I noticed that therewas a related discussion at:

http://issues.apache.org/jira/browse/LUCENE-1821?focusedCommentId=12745041#action_12745041

... but also a bit of disagreement on the suggested strategies. Ideally,I'm also hoping there's a strategy that won't require me to hack up toomuch of the core product; subclassing IndexSearcher in the way suggestedwould basically require me to change all of the various SearchComponentsI use in Solr, and that sounds like it'd end up a real maintenancenightmare. I was looking at the Collector class as possible solution,since it has knowledge of the docbase, but it looks like I'd then needto change every derived collector that the code ultimately uses and,including the various anonymous Collectors in Solr, that also looks likeit'd be a fairly ghoulish solution. I suppose I'm being wishful, orlazy, but is there a reasonable and reliable way to do this, withouthaving to fork the core code? If not, any suggestion on the beststrategy to accomplish this, without adding too much overhead every timeI wanted to up-rev the core Lucene and/or Solr code to the latest version?


Thanks a ton,
Aaron

De-basing / re-basing docIDs, or how to effectively pass calculated values from a Scorer or Filter up to (Solr's) QueryComponent.process

Reply via email to