Re: integrating Accumulo with solr

Ali Nazemian Fri, 25 Jul 2014 01:30:17 -0700

Dear Jack,
Actually I am going to do benefit-cost analysis for in-house developement
or going for sqrrl support.
Best regards.



On Thu, Jul 24, 2014 at 11:48 PM, Jack Krupansky <j...@basetechnology.com>
wrote:

> Like I said, you're going to have to be a real, hard-core gunslinger to do
> that well. Sqrrl uses Lucene directly, BTW:
>
> "Full-Text Search: Utilizing open-source Lucene and custom indexing
> methods, Sqrrl Enterprise users can conduct real-time, full-text search
> across data in Sqrrl Enterprise."
>
> See:
> http://sqrrl.com/product/search/
>
> Out of curiosity, why are you not using that integrated Lucene support of
> Sqrrl Enterprise?
>
>
> -- Jack Krupansky
>
> -----Original Message----- From: Ali Nazemian
> Sent: Thursday, July 24, 2014 3:07 PM
>
> To: solr-user@lucene.apache.org
> Subject: Re: integrating Accumulo with solr
>
> Dear Jack,
> Thank you. I am aware of datastax but I am looking for integrating accumulo
> with solr. This is something like what sqrrl guys offer.
> Regards.
>
>
> On Thu, Jul 24, 2014 at 7:27 PM, Jack Krupansky <j...@basetechnology.com>
> wrote:
>
>  If you are not a "true hard-core gunslinger" who is willing to dive in and
>> integrate the code yourself, instead you should give serious consideration
>> to a product such as DataStax Enterprise that fully integrates and
>> packages
>> a NoSQL database (Cassandra) and Solr for search. The security aspects are
>> still a work in progress, but certainly headed in the right direction. And
>> it has Hadoop and Spark integration as well.
>>
>> See:
>> http://www.datastax.com/what-we-offer/products-services/
>> datastax-enterprise
>>
>> -- Jack Krupansky
>>
>> -----Original Message----- From: Ali Nazemian
>> Sent: Thursday, July 24, 2014 10:30 AM
>> To: solr-user@lucene.apache.org
>> Subject: Re: integrating Accumulo with solr
>>
>>
>> Thank you very much. Nice Idea but how can Solr and Accumulo can be
>> synchronized in this way?
>> I know that Solr can be integrated with HDFS and also Accumulo works on
>> the
>> top of HDFS. So can I use HDFS as integration point? I mean set Solr to
>> use
>> HDFS as a source of documents as well as the destination of documents.
>> Regards.
>>
>>
>> On Thu, Jul 24, 2014 at 4:33 PM, Joe Gresock <jgres...@gmail.com> wrote:
>>
>>  Ali,
>>
>>>
>>> Sounds like a good choice.  It's pretty standard to store the primary
>>> storage id as a field in Solr so that you can search the full text in
>>> Solr
>>> and then retrieve the full document elsewhere.
>>>
>>> I would recommend creating a document structure in Solr with whatever
>>> fields you want indexed (most likely as text_en, etc.), and then store a
>>> "string" field named "content_id", which would be the Accumulo row id
>>> that
>>> you look up with a scan.
>>>
>>> One caveat -- Accumulo will be protected at the cell level, but if you
>>> need
>>> your Solr search results to be protected by complex authorization strings
>>> similar to Accumulo, you will need to write your own QParserPlugin and
>>> use
>>> post filtering:
>>> http://java.dzone.com/articles/custom-security-filtering-solr
>>>
>>> The code you see in that article is written for an earlier version of
>>> Solr,
>>> but it's not too difficult to adjust it for the latest (we've done so in
>>> our project).  Once you've implemented this, you would store an
>>> "authorizations" string field in each Solr document, and pass in the
>>> authorizations that the user has access to in the fq parameter of every
>>> query.  It's also not too bad to write something that parses the Accumulo
>>> authorizations string (like A&B&(C|D|E|F)) and interpret it accordingly
>>> in
>>> the QParserPlugin.
>>>
>>> This will give you true row level security in Solr and Accumulo, and it
>>> performs quite well in Solr.
>>>
>>> Let me know if you have any other questions.
>>>
>>> Joe
>>>
>>>
>>> On Thu, Jul 24, 2014 at 4:07 AM, Ali Nazemian <alinazem...@gmail.com>
>>> wrote:
>>>
>>> > Dear Joe,
>>> > Hi,
>>> > I am going to store the crawl web pages in accumulo as the main storage
>>> > part of my project and I need to give these data to solr for indexing >
>>> and
>>> > user searches. I need to do some social and web analysis on my data as
>>> well
>>> > as having some security features. Therefore accumulo is my choice for >
>>> the
>>> > database part and for index and search I am going to use Solr. Would >
>>> you
>>> > please guide me through that?
>>> >
>>> >
>>> >
>>> > On Thu, Jul 24, 2014 at 1:28 AM, Joe Gresock <jgres...@gmail.com>
>>> wrote:
>>> >
>>> > > We store data in both Solr and Accumulo -- do you have more details
>>> about
>>> > > what kind of data and indexing you want?  Is there a reason you're
>>> > thinking
>>> > > of using both databases in particular?
>>> > >
>>> > >
>>> > > On Wed, Jul 23, 2014 at 5:17 AM, Ali Nazemian <alinazem...@gmail.com
>>> >
>>> > > wrote:
>>> > >
>>> > > > Dear All,
>>> > > > Hi,
>>> > > > I was wondering is there anybody out there that tried to integrate
>>> Solr
>>> > > > with Accumulo? I was thinking about using Accumulo on top of HDFS >
>>> > > and
>>> > > using
>>> > > > Solr to index data inside Accumulo? Do you have any idea how can I
>>> > > > do
>>> > > such
>>> > > > integration?
>>> > > >
>>> > > > Best regards.
>>> > > >
>>> > > > --
>>> > > > A.Nazemian
>>> > > >
>>> > >
>>> > >
>>> > >
>>> > > --
>>> > > I know what it is to be in need, and I know what it is to have > >
>>> plenty.
>>>  I
>>> > > have learned the secret of being content in any and every situation,
>>> > > whether well fed or hungry, whether living in plenty or in want.  I >
>>> > can
>>> > do
>>> > > all this through him who gives me strength.    *-Philippians 4:12-13*
>>> > >
>>> >
>>> >
>>> >
>>> > --
>>> > A.Nazemian
>>> >
>>>
>>>
>>>
>>> --
>>> I know what it is to be in need, and I know what it is to have plenty.  I
>>> have learned the secret of being content in any and every situation,
>>> whether well fed or hungry, whether living in plenty or in want.  I can
>>> do
>>> all this through him who gives me strength.    *-Philippians 4:12-13*
>>>
>>>
>>>
>>
>> --
>> A.Nazemian
>>
>>
>
>
> --
> A.Nazemian
>



-- 
A.Nazemian

Re: integrating Accumulo with solr

Reply via email to