Re: Restrict search to subset (a list of aprrox 40,000 ids from an external service) of corpus

2014-11-17 Thread deviantcode
Hi, I am not sure i understand what you mean. Could you kindly elaborate further? Thanks -- View this message in context: http://lucene.472066.n3.nabble.com/Restrict-search-to-subset-a-list-of-aprrox-40-000-ids-from-an-external-service-of-corpus-tp4169210p4169435.html Sent from the Solr - User

Re: Restrict search to subset (a list of aprrox 40,000 ids from an external service) of corpus

2014-11-17 Thread deviantcode
Hi Jurgen, Thanks for the reply. There actually is a complex query which i oversimplified in the post. as: "q=id:(id1 OR id2 OR id3 OR id4 ... OR id4 ) AND name:*" I am not searching for docs with those ids, i wish to restrict my actual search query to those ids similar to how an 'fq' works.

unloading a solr core doesn't free any memory

2014-11-17 Thread Ofer Fort
Getting a lot of those today. Is it all from the same site we saw last week? OFER FORT Head of R&D 437 Fifth Avenue 9th floor, New York, NY 10016 cell: ISR +972-54-5678339 US +1 212 738 9594 ext 34 skype: oferfort tracx social intelligence www.tracx.com Follow us: [Tracx o

Re: Restrict search to subset (a list of aprrox 40,000 ids from an external service) of corpus

2014-11-17 Thread Alexandre Rafalovitch
On 17 November 2014 04:47, deviantcode wrote: > Find all parent-ids from the list of 40,000 ids, (subset of all the parent > ids) And how do you calculate that subset? Is that absolutely not something that translates into the rules that can be codified in Solr? Just passing 40,000 ids into Solr

Solr HTTP client authentication

2014-11-17 Thread Bai Shen
I am using solrj to connect to my solr server. However I need to authenticate against the server and can not find out how to do so using solrj. Is this possible or do I need to drop solrj? I can manually create an httpclient and set up authentication but then I can't use solrj. Thanks.

Re: Restrict search to subset (a list of aprrox 40,000 ids from an external service) of corpus

2014-11-17 Thread deviantcode
Unfortunately no, the ids come back from an external service (spark) that performs complex aspects of the user query which as far is i can tell, cannot be easily done in solr. For example assuming, from the "family" schema described previously, the children docs record, "weight" "height", "year" an

Boosting mixed fiedl types

2014-11-17 Thread eakarsu
I have several field types and like to assign correct boosting so that I will get results in correct order. Here is a summary of what I have: 1- Product Title - text field , Boost = 160 2- Product Description - text field , Boost = 80 3-Number of clicks - Integer field, having value [1 TO 1000] ,

Re: Restrict search to subset (a list of aprrox 40,000 ids from an external service) of corpus

2014-11-17 Thread Erik Hatcher
&fq={!terms}... in theory ought to do the trick pretty performantly. How's that work for you? Erik > On Nov 17, 2014, at 08:33, deviantcode wrote: > > Unfortunately no, the ids come back from an external service (spark) that > performs complex aspects of the user query which as far is i

Re: Boosting mixed fiedl types

2014-11-17 Thread Ahmet Arslan
Hi, Edismax has the boost parameter for example. It is multiplicative boost. boost=log(NumberOfClicks) https://cwiki.apache.org/confluence/display/solr/The+Extended+DisMax+Query+Parser Ahmet On Monday, November 17, 2014 3:38 PM, eakarsu wrote: I have several field types and like to assign

Re: Restrict search to subset (a list of aprrox 40,000 ids from an external service) of corpus

2014-11-17 Thread deviantcode
Thanks Eric, I will give that go and try to workout the number of ids i can safely pass to {!terms}. Also, is this to confirm that solr cannot execute such queries as those three i listed earlier? Thanks -- View this message in context: http://lucene.472066.n3.nabble.com/Restrict-search-to-s

Re: Can I be added to the Wiki contributors group?

2014-11-17 Thread Erick Erickson
done, thanks! On Sun, Nov 16, 2014 at 8:28 PM, Xavier Morera wrote: > I mean for: https://wiki.apache.org/solr/FrontPage > > My username is XavierMorera > > Regards, > Xavier > > -- > > *Xavier Morera* > > Entrepreneur | Author & Trainer | Consultant | Developer & Scrum Master > > *www.xaviermore

Re: Does ReRankQuery support reranking the result of a FuzzyQuery?

2014-11-17 Thread Brian Sawyer
To answer myself, looks like this was fixed as part of https://issues.apache.org/jira/browse/SOLR-6323. On Mon, Nov 10, 2014 at 1:50 PM, Brian Sawyer wrote: > Hello, > > We are trying to make use of the new ReRankQuery to rescore results > according to a custom function but run into problems whe

Re: Boosting mixed fiedl types

2014-11-17 Thread eakarsu
Ahmet, Thanks boost parameter is determining boost value for whole query. But I am assigning boost for other individual fields. I worry whether or not boost parameter and invidiual boosts with bf and pdf will rank results properly. Erol Akarsu -- View this message in context: http://lucene.

Re: Solr HTTP client authentication

2014-11-17 Thread Anurag Sharma
I think Solr encourage SSL than authentication On Mon, Nov 17, 2014 at 6:08 PM, Bai Shen wrote: > I am using solrj to connect to my solr server. However I need to > authenticate against the server and can not find out how to do so using > solrj. Is this possible or do I need to drop solrj? I

RE: Solr HTTP client authentication

2014-11-17 Thread Fuad Efendi
> I can > manually create an httpclient and set up authentication but then I can't use > solrj. Yes; correct; except that you _can_ use solj with this custom HttpClient instance (which will intercept authentication, which will support cookies, SSL or plain HTTP, Keep-Alive, and etc.) You can

Re: Solr HTTP client authentication

2014-11-17 Thread Bai Shen
I had seen where I could pass in an HttpClient to the SolrServer. The problem is that the HttpClient only receives the authentication information through the execute method using the context. See the example located here. https://hc.apache.org/httpcomponents-client-4.3.x/tutorial/html/authenticat

Re: Solr HTTP client authentication

2014-11-17 Thread Jürgen Wagner (DVT)
Why rely on the default http client? Why not create one with HttpClients.custom() .setDefaultSocketConfig(socketConfig) .setDefaultRequestConfig(requestConfig) .setSSLSocketFactory(sslsf) .build(); that has the SSLConnectionSocketFactory property set up with an SSL

Shard splitting and HDFS

2014-11-17 Thread Joseph Obernberger
I tried to split a shard using HDFS storage, and at first I received this error: org.apache.solr.client.solrj.impl.HttpSolrServer$RemoteSolrException: Error CREATEing SolrCore 'COLLECT1_shard1_0_replica1': Unable to create core [COLLECT1_shard1_0_replica1] Caused by: Direct buffer memory at

Shard splitting and HDFS

2014-11-17 Thread Joseph Obernberger
If I create the directory manually on the server that I'm splitting: COLLECT_shard1_0_replica1 Then do the shard split command, it works OK. -Joe

Re: Hierarchical faceting

2014-11-17 Thread rashmy1
Hi Alexandre, Yes, I've read this post and that's the 'Option1' listed in my initial post. I'm looking to see if Solr has any in-built tokenizer that splits the tokens and prepends with the depth information. I'd like to avoid building depth information into the filed values if Solr already has so

More HDFS and Shard Splitting

2014-11-17 Thread Joseph Obernberger
Originally I had two shards on two machines - shard1 and shard2. I did a SHARDSPLIT on shard1. Now have shard1, shard2, and shard1_0 If I select the core (COLLECT_shard1_0_replica1) and execute a query, I get all the docs OK, but if I specific &distrib=false, I get 0 documents. Under HDFS - when/h

Re: Hierarchical faceting

2014-11-17 Thread Alexandre Rafalovitch
You might be able to stick in a couple of PatternReplaceFilterFactory in a row with regular expressions to catch different levels. Something like: ... I did not test this, you may need to escape some thing or put explicit groups in there. Regards, Alex. P.s. http://www.solr-start.com/java

Internals of Analysis and Token Matching

2014-11-17 Thread Pritesh Patel
Hi Community. Hoping someone can help explain this ... Once all the analysis is done on a field all the tokens to identify that field are stored. What else is affecting a match to the document beyond a simple token match and frequency of terms that match? All the searches I did produce the same

Re: Internals of Analysis and Token Matching

2014-11-17 Thread Alexandre Rafalovitch
Are you trying to match phone numbers despite the spaces/dashes/brackets? By prefix? Suffix? If so, you may look at something more like: And remember, if you are using ngrams, you probably want them in the index-chain of the analyzer, but not in the query-chain. Otherwise, you will be matching

Re: More HDFS and Shard Splitting

2014-11-17 Thread Erick Erickson
Tell us more about your HDFS stuff. Specifically, how do you have your HDFSDirectoryFactory specified in solrconfig.xml? Cause you shouldn't have to do things like create the directory ahead of time I don't think. Best, Erick On Mon, Nov 17, 2014 at 12:17 PM, Joseph Obernberger wrote: > Origina

chaos monkey

2014-11-17 Thread Arpit Agarwal
Hi All, I have setup Solr 4.9.0 in a 3 shard configuration running on tomcat servers. I want to run a test with chaos-monkey to ensure the availability of the entire system. Can someone tell me, how I can integrate chaos-monkey with Solr or make use of Solr test-framework to do the same? Thanks &

Re: More HDFS and Shard Splitting

2014-11-17 Thread Joseph Obernberger
Looks like the shard split failed, and only created one additional shard. I didn't allocate enough memory for 3x - since two additional shards needed to be created. I was allocating 20G for each shard, so in order do the split, I needed to give 60G for the direct memory access. I've now switched

Solr 5 release date ?

2014-11-17 Thread roy123
Hi, Does anyone when Solr 5.0 is scheduled to release ? I'm planning to upgrade to 4.10.2, but will wait if there's a plan to roll-out 5.0 pretty soon. -Thanks -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-5-release-date-tp4169571.html Sent from the Solr - User ma

Re: chaos monkey

2014-11-17 Thread Manoj Bharadwaj
Chaos monkey is designed to work n an AWS context. It uses your AWS account credentials and via API terminates instances in EC2. So if you have your shards in EC2, the documentation can be followed to configure it to terminate the instances as needed. If you are looking specifically to terminate j

Re: Solr 5 release date ?

2014-11-17 Thread Erick Erickson
There are rumblings about sometime in December IIRC, nothing's been committed to though. Best, Erick On Mon, Nov 17, 2014 at 4:24 PM, roy123 wrote: > Hi, > > Does anyone when Solr 5.0 is scheduled to release ? I'm planning to > upgrade to 4.10.2, but will wait if there's a plan to roll-out 5.

Re: Hierarchical faceting

2014-11-17 Thread Jason Hellman
I realize you want to avoid putting depth details into the field values, but something has to imply the depth. So with that in mind, here is another approach (with the assumption that you are chasing down a single branch of a tree (and all its subbranch offshoots)), Use dynamic fields Step fro

Re: Hierarchical faceting

2014-11-17 Thread Evan Pease
>I'm looking to see if Solr has any in-built tokenizer that splits the tokens >and prepends with the depth information. I'd like to avoid building depth >information into the filed values if Solr already has something that can be >used. So the goal is to find out the level of the tree for each cat