You'd want to do your Solr spatial query, get IDs from the index, and then
*after* that do a multi get against your HBase table with top N IDs from
Solr's response and get thus get the data back to the caller.  I don't know
how fast multi gets are, what the limitations are, etc.  Maybe somebody
else can address that.

Alternatively, I suppose you could implement a custom collector that does
gets as matching documents are being collected by Solr.  I don't recall the
class/interface you'd need to implement off the top of my head.

Otis
--
Solr & ElasticSearch Support
http://sematext.com/





On Thu, Jan 17, 2013 at 8:01 PM, oakstream
<mike.oa...@oakstreamsystems.com>wrote:

> Thanks for your response!  I appreciate it.
>
> There will be cases where I want to "AND or OR" the query between HBASE and
> Lucene.  Would it make sense to custom code querying both repositories at
> the same time or sequentially....Or are there any tools out there to do
> this?
>
> Basically I'm thinking that HBASE will keep the majority of my data columns
> and lucene will keep the index and a unique pointer to the HBASE record.
>
> Like
> HBASE
>
> UID = 12345, COL1, COL2, COL3, COL4, COL5, COL6
>
> LUCENE
> ID = 999, UID = 12345 , INDEX Columns (LAT/LON)
>
> My query would be something like where lat/lon in (Polygon) AND COL3 =
> 'ABC'
>
> Would this kind of setup make sense?  Is there a better way?
>
> I'll be working with Terabytes of data
>
> Thanks
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Using-Solr-Spatial-in-conjunction-with-HBASE-Hadoop-tp4034307p4034400.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>

Reply via email to