Just FYI, the blog Joe mentioned below (authored by me) has been adjusted to Solr 4.x in the original blog location here:
<http://searchhub.org/2012/02/22/custom-security-filtering-in-solr/> Erik On Jul 24, 2014, at 8:03 AM, Joe Gresock <jgres...@gmail.com> wrote: > Ali, > > Sounds like a good choice. It's pretty standard to store the primary > storage id as a field in Solr so that you can search the full text in Solr > and then retrieve the full document elsewhere. > > I would recommend creating a document structure in Solr with whatever > fields you want indexed (most likely as text_en, etc.), and then store a > "string" field named "content_id", which would be the Accumulo row id that > you look up with a scan. > > One caveat -- Accumulo will be protected at the cell level, but if you need > your Solr search results to be protected by complex authorization strings > similar to Accumulo, you will need to write your own QParserPlugin and use > post filtering: > http://java.dzone.com/articles/custom-security-filtering-solr > > The code you see in that article is written for an earlier version of Solr, > but it's not too difficult to adjust it for the latest (we've done so in > our project). Once you've implemented this, you would store an > "authorizations" string field in each Solr document, and pass in the > authorizations that the user has access to in the fq parameter of every > query. It's also not too bad to write something that parses the Accumulo > authorizations string (like A&B&(C|D|E|F)) and interpret it accordingly in > the QParserPlugin. > > This will give you true row level security in Solr and Accumulo, and it > performs quite well in Solr. > > Let me know if you have any other questions. > > Joe > > > On Thu, Jul 24, 2014 at 4:07 AM, Ali Nazemian <alinazem...@gmail.com> wrote: > >> Dear Joe, >> Hi, >> I am going to store the crawl web pages in accumulo as the main storage >> part of my project and I need to give these data to solr for indexing and >> user searches. I need to do some social and web analysis on my data as well >> as having some security features. Therefore accumulo is my choice for the >> database part and for index and search I am going to use Solr. Would you >> please guide me through that? >> >> >> >> On Thu, Jul 24, 2014 at 1:28 AM, Joe Gresock <jgres...@gmail.com> wrote: >> >>> We store data in both Solr and Accumulo -- do you have more details about >>> what kind of data and indexing you want? Is there a reason you're >> thinking >>> of using both databases in particular? >>> >>> >>> On Wed, Jul 23, 2014 at 5:17 AM, Ali Nazemian <alinazem...@gmail.com> >>> wrote: >>> >>>> Dear All, >>>> Hi, >>>> I was wondering is there anybody out there that tried to integrate Solr >>>> with Accumulo? I was thinking about using Accumulo on top of HDFS and >>> using >>>> Solr to index data inside Accumulo? Do you have any idea how can I do >>> such >>>> integration? >>>> >>>> Best regards. >>>> >>>> -- >>>> A.Nazemian >>>> >>> >>> >>> >>> -- >>> I know what it is to be in need, and I know what it is to have plenty. I >>> have learned the secret of being content in any and every situation, >>> whether well fed or hungry, whether living in plenty or in want. I can >> do >>> all this through him who gives me strength. *-Philippians 4:12-13* >>> >> >> >> >> -- >> A.Nazemian >> > > > > -- > I know what it is to be in need, and I know what it is to have plenty. I > have learned the secret of being content in any and every situation, > whether well fed or hungry, whether living in plenty or in want. I can do > all this through him who gives me strength. *-Philippians 4:12-13*