You'd want to do your Solr spatial query, get IDs from the index, and then *after* that do a multi get against your HBase table with top N IDs from Solr's response and get thus get the data back to the caller. I don't know how fast multi gets are, what the limitations are, etc. Maybe somebody else can address that.
Alternatively, I suppose you could implement a custom collector that does gets as matching documents are being collected by Solr. I don't recall the class/interface you'd need to implement off the top of my head. Otis -- Solr & ElasticSearch Support http://sematext.com/ On Thu, Jan 17, 2013 at 8:01 PM, oakstream <mike.oa...@oakstreamsystems.com>wrote: > Thanks for your response! I appreciate it. > > There will be cases where I want to "AND or OR" the query between HBASE and > Lucene. Would it make sense to custom code querying both repositories at > the same time or sequentially....Or are there any tools out there to do > this? > > Basically I'm thinking that HBASE will keep the majority of my data columns > and lucene will keep the index and a unique pointer to the HBASE record. > > Like > HBASE > > UID = 12345, COL1, COL2, COL3, COL4, COL5, COL6 > > LUCENE > ID = 999, UID = 12345 , INDEX Columns (LAT/LON) > > My query would be something like where lat/lon in (Polygon) AND COL3 = > 'ABC' > > Would this kind of setup make sense? Is there a better way? > > I'll be working with Terabytes of data > > Thanks > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Using-Solr-Spatial-in-conjunction-with-HBASE-Hadoop-tp4034307p4034400.html > Sent from the Solr - User mailing list archive at Nabble.com. >