Re: integrating Accumulo with solr

Erik Hatcher Thu, 24 Jul 2014 08:16:14 -0700

Just FYI, the blog Joe mentioned below (authored by me) has been adjusted to 
Solr 4.x in the original blog location here:


   <http://searchhub.org/2012/02/22/custom-security-filtering-in-solr/>

        Erik

On Jul 24, 2014, at 8:03 AM, Joe Gresock <jgres...@gmail.com> wrote:

> Ali,
> 
> Sounds like a good choice.  It's pretty standard to store the primary
> storage id as a field in Solr so that you can search the full text in Solr
> and then retrieve the full document elsewhere.
> 
> I would recommend creating a document structure in Solr with whatever
> fields you want indexed (most likely as text_en, etc.), and then store a
> "string" field named "content_id", which would be the Accumulo row id that
> you look up with a scan.
> 
> One caveat -- Accumulo will be protected at the cell level, but if you need
> your Solr search results to be protected by complex authorization strings
> similar to Accumulo, you will need to write your own QParserPlugin and use
> post filtering:
> http://java.dzone.com/articles/custom-security-filtering-solr
> 
> The code you see in that article is written for an earlier version of Solr,
> but it's not too difficult to adjust it for the latest (we've done so in
> our project).  Once you've implemented this, you would store an
> "authorizations" string field in each Solr document, and pass in the
> authorizations that the user has access to in the fq parameter of every
> query.  It's also not too bad to write something that parses the Accumulo
> authorizations string (like A&B&(C|D|E|F)) and interpret it accordingly in
> the QParserPlugin.
> 
> This will give you true row level security in Solr and Accumulo, and it
> performs quite well in Solr.
> 
> Let me know if you have any other questions.
> 
> Joe
> 
> 
> On Thu, Jul 24, 2014 at 4:07 AM, Ali Nazemian <alinazem...@gmail.com> wrote:
> 
>> Dear Joe,
>> Hi,
>> I am going to store the crawl web pages in accumulo as the main storage
>> part of my project and I need to give these data to solr for indexing and
>> user searches. I need to do some social and web analysis on my data as well
>> as having some security features. Therefore accumulo is my choice for the
>> database part and for index and search I am going to use Solr. Would you
>> please guide me through that?
>> 
>> 
>> 
>> On Thu, Jul 24, 2014 at 1:28 AM, Joe Gresock <jgres...@gmail.com> wrote:
>> 
>>> We store data in both Solr and Accumulo -- do you have more details about
>>> what kind of data and indexing you want?  Is there a reason you're
>> thinking
>>> of using both databases in particular?
>>> 
>>> 
>>> On Wed, Jul 23, 2014 at 5:17 AM, Ali Nazemian <alinazem...@gmail.com>
>>> wrote:
>>> 
>>>> Dear All,
>>>> Hi,
>>>> I was wondering is there anybody out there that tried to integrate Solr
>>>> with Accumulo? I was thinking about using Accumulo on top of HDFS and
>>> using
>>>> Solr to index data inside Accumulo? Do you have any idea how can I do
>>> such
>>>> integration?
>>>> 
>>>> Best regards.
>>>> 
>>>> --
>>>> A.Nazemian
>>>> 
>>> 
>>> 
>>> 
>>> --
>>> I know what it is to be in need, and I know what it is to have plenty.  I
>>> have learned the secret of being content in any and every situation,
>>> whether well fed or hungry, whether living in plenty or in want.  I can
>> do
>>> all this through him who gives me strength.    *-Philippians 4:12-13*
>>> 
>> 
>> 
>> 
>> --
>> A.Nazemian
>> 
> 
> 
> 
> -- 
> I know what it is to be in need, and I know what it is to have plenty.  I
> have learned the secret of being content in any and every situation,
> whether well fed or hungry, whether living in plenty or in want.  I can do
> all this through him who gives me strength.    *-Philippians 4:12-13*

Re: integrating Accumulo with solr

Reply via email to