Modelling Access Control

2010-10-23 Thread Paul Carey
Hi My domain model is made of users that have access to projects which are composed of items. I'm hoping to use Solr and would like to make sure that searches only return results for items that users have access to. I've looked over some of the older posts on this mailing list about access contro

Re: A bug in ComplexPhraseQuery ?

2010-10-23 Thread jmr
iorixxx wrote: > >> > class="org.apache.solr.search.ComplexPhraseQParserPlugin"> >>     > name="inOrder">false >>   >> > > I added this change to SOLR-1604, can you test it give us feedback? > > May thanks. I'll test this quite soon and let you know. J-Michel -- View this message in conte

Re: xpath processing

2010-10-23 Thread Ben Boggess
> processor="FileListEntityProcessor" fileName=".*xml" recursive="true" Shouldn't this be fileName="*.xml"? Ben On Oct 22, 2010, at 10:52 PM, pghorp...@ucla.edu wrote: > > > > > > processor="FileListEntityProcessor" fileName=".*xml" recursive="true" > baseDir="C:\data\sample_records\mod

Re: Spatial

2010-10-23 Thread Grant Ingersoll
On Oct 20, 2010, at 12:14 PM, Pradeep Singh wrote: > Thanks for your response Grant. > > I already have the bounding box based implementation in place. And on a > document base of around 350K it is super fast. > > What about a document base of millions of documents? While a tier based > approac

Re: Import From MYSQL database

2010-10-23 Thread do3do3
what i know is to define you field in schema.xml file and build database_conf.xml file which contain identification for your database finally you should define dataimporthandler in solrconfig.xml file i put sample from what you should done in first post in this topic you can check it, if i know a

Re: Import From MYSQL database

2010-10-23 Thread do3do3
i found this files but i can't found any useful info. inside it, what i found is GET command in http request -- View this message in context: http://lucene.472066.n3.nabble.com/Import-From-MYSQL-database-tp1738753p1756778.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: How to index long words with StandardTokenizerFactory?

2010-10-23 Thread Sergey Bartunov
Here are all the files: http://rghost.net/3016862 1) StandardAnalyzer.java, StandardTokenizer.java - patched files from lucene-2.9.3 2) I patch these files and build lucene by typing "ant" 3) I replace lucene-core-2.9.3.jar in solr/lib/ by my lucene-core-2.9.3-dev.jar that I'd just compiled 4) tha

Re: Solr Javascript+JSON not optimized for SEO

2010-10-23 Thread PeterKerk
Unfortunately its not online yet, but is there anything I can clarify in more detail? Thanks! -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-Javascript-JSON-not-optimized-for-SEO-tp1751641p1758054.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: How to index long words with StandardTokenizerFactory?

2010-10-23 Thread Ahmet Arslan
Did you delete the folder Jetty_0_0_0_0_8983_solr.war_** under apache-solr-1.4.1\example\work? --- On Sat, 10/23/10, Sergey Bartunov wrote: > From: Sergey Bartunov > Subject: Re: How to index long words with StandardTokenizerFactory? > To: solr-user@lucene.apache.org > Date: Saturday, October

Re: Modelling Access Control

2010-10-23 Thread Israel Ekpo
Hi Paul, Regardless of how you implement it, I would recommend you use filter queries for the permissions check rather than making it part of the main query. On Sat, Oct 23, 2010 at 4:03 AM, Paul Carey wrote: > Hi > > My domain model is made of users that have access to projects which > are com

Re: How to index long words with StandardTokenizerFactory?

2010-10-23 Thread Sergey Bartunov
Yes. I did. Won't help. On 23 October 2010 17:45, Ahmet Arslan wrote: > Did you delete the folder Jetty_0_0_0_0_8983_solr.war_** under > apache-solr-1.4.1\example\work? > > --- On Sat, 10/23/10, Sergey Bartunov wrote: > >> From: Sergey Bartunov >> Subject: Re: How to index long words with Stan

Re: How to index long words with StandardTokenizerFactory?

2010-10-23 Thread Ahmet Arslan
I think you should replace your new lucene-core-2.9.3-dev.jar in \apache-solr-1.4.1\lib and then create a new solr.war under \apache-solr-1.4.1\dist. And copy this new solr.war to solr/example/webapps/solr.war --- On Sat, 10/23/10, Sergey Bartunov wrote: > From: Sergey Bartunov > Subject: Re

Re: How to index long words with StandardTokenizerFactory?

2010-10-23 Thread Yonik Seeley
On Fri, Oct 22, 2010 at 12:07 PM, Sergey Bartunov wrote: > I'm trying to force solr to index words which length is more than 255 If the field is not a text field, the Solr's default analyzer is used, which currently limits the token to 256 bytes. Out of curiosity, what's your usecase that you rea

Re: How to index long words with StandardTokenizerFactory?

2010-10-23 Thread Sergey Bartunov
Look at the scheme.xml that I provided. I use my own "text_block" type which is derived from "TextField". And I force using StandardTokenizerFactory using tokenizer tag. If I use StrField type there are no problems with big data indexing. The problem is in the tokenizer. On 23 October 2010 18:55,

Re: How to index long words with StandardTokenizerFactory?

2010-10-23 Thread Sergey Bartunov
This is exactly what I did. Look: >> >> 3) I replace lucene-core-2.9.3.jar in solr/lib/ by >> my >> >> lucene-core-2.9.3-dev.jar that I'd just compiled >> >> 4) than I do "ant compile" and "ant dist" in solr >> folder >> >> 5) after that I recompile >> solr/example/webapps/solr.war On 23 October

Re: xpath processing

2010-10-23 Thread Ken Stanley
On Fri, Oct 22, 2010 at 11:52 PM, wrote: > > > > > > processor="FileListEntityProcessor" fileName=".*xml" recursive="true" > baseDir="C:\data\sample_records\mods\starr"> > url="${f.fileAbsolutePath}" stream="false" forEach="/mods" > transformer="DateFormatTransformer,RegexTransformer,Templat

Re: Modelling Access Control

2010-10-23 Thread Dennis Gearon
Two things will lessen the solr admininstrative load : 1/ Follow examples of databases and *nix OSs. Give each user their own group, or set up groups that don't have regular users as OWNERS, but can have users assigned to the group to give them particular permissions. I.E. Roles, like publisher

Re: Modelling Access Control

2010-10-23 Thread Dennis Gearon
why use filter queries? Wouldn't reducing the set headed into the filters by putting it in the main query be faster? (A question to learn, since I do NOT know :-) Dennis Gearon Signature Warning It is always a good idea to learn from your own mistakes. It is usually a better i

Re: Modelling Access Control

2010-10-23 Thread Dennis Gearon
Forgot to add, 3/ The external, application code selects the GROUPS that the user has permission to read (Solr will only serve up what is to be read?) then search on those groups. Dennis Gearon Signature Warning It is always a good idea to learn from your own mistakes. It is u

Re: Multiple indexes inside a single core

2010-10-23 Thread Erick Erickson
Ah, I should have read more carefully... I remember this being discussed on the dev list, and I thought there might be a Jira attached but I sure can't find it. If you're willing to work on it, you might hop over to the solr dev list and start a discussion, maybe ask for a place to start. I'm sur

Re: FieldCache

2010-10-23 Thread Erick Erickson
Why do you want to? Basically, the caches are there to improve #searching#. To search something, you must index it. Retrieving it is usually a rare enough operation that caching is irrelevant. This smells like an XY problem, see: http://people.apache.org/~hossman/#xyproblem If this seems like gib

Re: How to index long words with StandardTokenizerFactory?

2010-10-23 Thread Ahmet Arslan
Ops I am sorry, I thought that solr/lib refers to solrhome/lib. I just tested this and it seems that you have successfully increased the max token length. You can verify this by analysis.jsp page. Although analysis.jsp's output, it seems that some other mechanism is preventing this huge token t

Re: Solr sorting problem

2010-10-23 Thread Erick Erickson
In general, the behavior when sorting is not predictable when sorting on a tokenized field, which "text" is. What would it mean to sort on a field with "erick" "Moazzam" as tokens in a single document? Should it be in the "e"s or the "m"s? That said, you probably want to watch out for case Be

Re: MoreLikeThis explanation?

2010-10-23 Thread Koji Sekiguchi
Hi Darren, Usually patches are written for the latest trunk branch at the time. I've just updated the patch. Try it for the current trunk if you prefer. Koji -- http://www.rondhuit.com/en/ (10/10/22 19:10), Darren Govoni wrote: Hi Koji, I tried to apply your patch to the 1.4.0 tagged bran

Re: Modelling Access Control

2010-10-23 Thread Savvas-Andreas Moysidis
Pushing ACL logic outside Solr sounds like a prudent choice indeed as in, my opinion, all of the business rules/conceptual logic should reside only within the code boundaries. This way your domain will be easier to model and your code to read, understand and maintain. More information on Filter Qu

Re: pf parameter in edismax (SOLR-1553)

2010-10-23 Thread Jan Høydahl / Cominvent
Answering my own question: The "pf" feature only kicks in with multi term "q" param. In my case I used a field tokenized by KeywordTokenizer, hence pf never kicked in. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com On 14. okt. 2010, at 13.29, Jan Høydahl / Cominvent w

Re: Modelling Access Control

2010-10-23 Thread Israel Ekpo
Hi All, I think using filter queries will be a good option to consider because of the following reasons * The filter query does not affect the score of the items in the result set. If the ACL logic is part of the main query, it could influence the scores of the items in the result set. * Using a

Re: How to delete a SOLR document if that particular data doesnt exist in DB?

2010-10-23 Thread bbarani
Thanks a lot for all your replies. I finally wrote a program which will fetch and store all the UID from source (DB) in one list and fetch and store all the UID from SOLR document in another list. Next using the binarySearch method of collection I was able to filter out the list of UID's that ar