A common approach (for web search engines) is to use HBase [1] as a
"Document Repository". Each document indexed inside Solr will have an
entry (row, identified by the document URL) in the HBase table. This
works great when you deal with a large data collection (it scales better
than a SQL data
Sure, we are doing essentially that with our Drupal integration module
- each search result contains a link to the "real" content, which is
stored in MySQL, etc, and presented via the Drupal CMS.
http://drupal.org/project/apachesolr
-Peter
On Tue, Feb 17, 2009 at 11:57 AM, roberto wrote:
> Hell