I ended up pursuing the ParallelWriter http://issues.apache.org/jira/browse/LUCENE-600 , so we can map different fields to different indexes. This appears to keep the indexes in sync, although I still need to do more testing.
However, some ugly hackery was needed to get it to extend SolrIndexWriter, so it could be dropped in as a replacement for the existing writers. The writer gets created by a custom UpdateHandler, which overrides createMainIndexWriter. Most of this can be done with extensions/plugins to Solr, but there are a few parts that need to patch Solr directly (eg. SolrCore directly creating Searchers & Writers, need more than one index dirs, etc) thanks for the comments -Mark Grant Ingersoll-6 wrote: > > ParallelReader is definitely out there on the Lucene landscape. See > http://www.lucidimagination.com/search/page:2?q=ParallelReader > for some background discussion, including Doug's original post on it > and some others view of the use case. The key is that the small index > has to be rebuilt in exactly the same order as the large index, which > seems particularly onerous in high-update environments. I will add > that it is definitely one of those areas most people do not use, so > getting help on it may be difficult. > > I've often thought about an AsynchronousParallelReader that maintained > a mapping between the two indexes such that you could let the indexes > get out of sync, but have never implemented it or gone far enough down > the path to know whether it would even work or not. The devil is > likely in the details what with Lucene's merging, etc. > > Thinking out loud, you might also try a custom component (or some > changes to the QueryComponent) that uses the MultiSearcher or maybe > some lower level Solr changes. The MultiSearcher is also designed to > search across multiple indexes. > > HTH, > Grant > > > On Feb 9, 2009, at 8:37 PM, Mark Kranz wrote: > >> >> Just an update on my own research: >> >> I have discovered the 'ParallelReader' class (subclass of >> IndexReader) in >> lucene, which is designed for searching across multiple indexes. >> >> This appears to suit our needs - and I do not expect will be too >> difficult >> to integrate into Solr. >> -- >> View this message in context: >> http://www.nabble.com/Vertical-Partitioning-advice-tp21906668p21926031.html >> Sent from the Solr - User mailing list archive at Nabble.com. >> > > -------------------------- > Grant Ingersoll > http://www.lucidimagination.com/ > > Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids) > using Solr/Lucene: > http://www.lucidimagination.com/search > > > -- View this message in context: http://www.nabble.com/Vertical-Partitioning-advice-tp21906668p21946496.html Sent from the Solr - User mailing list archive at Nabble.com.