Newbie: Searching across 2 collections ?

2016-01-06 Thread Bruno Mannina
Hi All, Solr 5.4, Ubuntu I thought it was simple to request across two collections with the same schema but not. I have one solr instance launch. 300 000 records in each collection. I try to use this request without having both results: http://my_adress:my_port/solr/C1/select?collection=C1,C2&

Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Binoy Dalal
Are Id values for docs in both the collections exactly same? To get proper results, the ids should be unique across both the cores. On Wed, 6 Jan 2016, 15:11 Bruno Mannina wrote: > Hi All, > > Solr 5.4, Ubuntu > > I thought it was simple to request across two collections with the same > schema b

Re: solr 5.2.0 need to build high query response

2016-01-06 Thread Novin Novin
Thanks Erick, this listener doing quite a good job. But not what I needed. Do the solr has any other things that I can look into to make it faster. FYI speed goes to 1 sec to 1.2 sec. I actually needed around 500 ms. On Tue, 5 Jan 2016 at 18:24 Erick Erickson wrote: > Yep. Do note what's happe

Re: Many patterns against many sentences, storing all results

2016-01-06 Thread Charlie Hull
On 05/01/2016 16:05, Allison, Timothy B. wrote: Might want to look into: https://github.com/flaxsearch/luwak Yes, this sounds like a very good fit for Luwak. We built it originally for media monitoring applications where one also needs just a hit/no-hit result. It's running in production at

Re: Query behavior difference.

2016-01-06 Thread Modassar Ather
Please help me understand why queries like wildcard, prefix and few others are re-written into constant score query? Why the scoring factors are not taken into consideration in such queries? Please correct me if I am wrong that this behavior is per the query type irrespective of the parser used.

Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Bruno Mannina
yes id value is unique in C1 and unique in C2. id in C1 is never present in C2 id in C2 is never present in C1 Le 06/01/2016 11:12, Binoy Dalal a écrit : Are Id values for docs in both the collections exactly same? To get proper results, the ids should be unique across both the cores. On Wed, 6

Cleanup solr cloud after failure in collection creation

2016-01-06 Thread Gian Maria Ricci - aka Alkampfer
I've issued a command to create some collections, but there were an error in solrconfig.xml (I've specified wrong path to dataimporthandler.jar files). The creation of the collection failed but now I don't know how to cleanup everything. This is a test solrcloud where I'm experimenting in windo

core,Collection,Shard,Replication

2016-01-06 Thread vidya
Hi I am new to solr. i have a doubt in understanding difference between core and collection. >As far as i understand, cores can be created when solr is run in local mode and collections in solrCloud. Can you please help me if i am wrong. >And why do we shard a collection? i read like - When your d

Re: Solr 6 Distributed Join

2016-01-06 Thread Akiel Ahmed
Hi Dennis/Joel, Thank you for your help to date - I must say this user group is very responsive :-) Cheers Akiel From: Dennis Gove To: solr-user@lucene.apache.org Date: 05/01/2016 13:22 Subject:Re: Solr 6 Distributed Join Akiel, https://issues.apache.org/jira/browse/SOLR

Re: core,Collection,Shard,Replication

2016-01-06 Thread Binoy Dalal
1) A collection is simply a logical group and can consist of multiple cores. The core is a representation of a single physical index or part of an index. Both cores and collections can be created in local as well as cloud modes. 2) Sharding is performed to distribute your index over multiple machin

Re: Query behavior difference.

2016-01-06 Thread Emir Arnautovic
Hi Modassar, It usually helps if you analyze extreme case: e.g. fl:a* What terms should be better match? Those who are shorter or all should be equally good? What should be top document? Assuming standard TF/IDF scoring is used, that would be one with the most terms that start with 'a' especiall

Re: core,Collection,Shard,Replication

2016-01-06 Thread vidya
Hi You described that sharding is to distribute data over multiple machines.Do I have to make sure that all those machines have solr server or gateway should be deplyed ? And what multiple JVM processes run behind a solr server running? I wanted to know what is a node. -> I understood like a mchin

Re: core,Collection,Shard,Replication

2016-01-06 Thread Binoy Dalal
The machines part may have been a bit misleading. I am sorry for that. What I actually meant was shards. Now, you can have multiple shards hosted on a single machine or multiple machines as in the example I gave. "I have to make sure that all those machines have solr server or gateway should be de

Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Susheel Kumar
Hi Bruno, I just tested this scenario in my local solr 5.3.1 and it returned results from two identical collections. I doubt if it is broken in 5.4 just double check if you are not missing anything else. Thanks, Susheel http://localhost:8983/solr/c1/select?q=id_type%3Ahello&wt=json&indent=true&c

Count multivalued field issue

2016-01-06 Thread marotosg
Hi, I am trying to add a new field to my schema to add the number of items of a multivalued field. I am using solr 4.11 These are my fields on *schema.xml* Here is the update done to my *solrconfig.xml*. I created an updateRequestProcessorChain and add it to the update handler

Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Emir Arnautovic
Hi Bruno, Can you check counts? Is it possible that first page is only with results from collection that you sent request to so you assumed it returns only results from single collection? Thanks, Emir On 06.01.2016 14:33, Susheel Kumar wrote: Hi Bruno, I just tested this scenario in my loca

Re: Count multivalued field issue

2016-01-06 Thread GW
When dealing with Solr data you need to decide whether or not to go programming. when I want to count a multi-value I go programming. $count = array_count($array); On 6 January 2016 at 08:43, marotosg wrote: > Hi, > > I am trying to add a new field to my schema to add the number of items of

Re: Count multivalued field issue

2016-01-06 Thread Binoy Dalal
You haven't given any name to your update handler. How are you planning to call it? On Wed, 6 Jan 2016, 19:41 GW wrote: > When dealing with Solr data you need to decide whether or not to go > programming. > > when I want to count a multi-value I go programming. > > > $count = array_count($array)

Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Binoy Dalal
Another diagnostic you can perform is to query for a doc you know is on the other collection and see if it is returned. If it is then you might simply not be seeing the results as emir says. On Wed, 6 Jan 2016, 19:27 Emir Arnautovic wrote: > Hi Bruno, > Can you check counts? Is it possible that

Re: Count multivalued field issue

2016-01-06 Thread Jack Krupansky
Out of curiosity, where did you get your example code from - so we can assure that it gets corrected? Here's a valid example, from de-dupe: dedupe ... Note it is the request handler for "/update", not the "update handler." See: https://cwiki.apache.org/confluence/display/solr/De-Dup

Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Bruno Mannina
Hi Susheel, Emir, yes I check, and I have one result in c1 and in c2 with the same query fid:34520196 http://xxx.xxx.xxx.xxx:/solr/c1/select?q=fid:34520196&wt=json&indent=true&fl=id,fid,cc*,st&collection=c1,c2 { "responseHeader":{ "status":0, "QTime":1, "params":{ "fl":"fid,cc*,st", "ind

Re: Query behavior difference.

2016-01-06 Thread Jack Krupansky
The motivation for the constant-score rewrite is simply performance. As per the Javadoc: "*This method is faster than the BooleanQuery rewrite methods when the number of matched terms or matched documents is non-trivial. Also, it will never hit an errant BooleanQuery.TooManyClauses exception.*" S

Solr server not starting

2016-01-06 Thread agonn Qurdina
Hi, I am using Solr server with Echoprint service (https://github.com/echonest/echoprint-server). The first time I started it everything worked perfectly. This is the way I started it: java -Dsolr.solr.home=/home/echoprint-server/solr/solr/solr/ -Djava.awt.headless=true -Xmx2048m -Xms2048m

Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Susheel Kumar
I'll suggest if you can setup some some test data locally and try this out. This will confirm your understanding. Thanks, Susheel On Wed, Jan 6, 2016 at 10:39 AM, Bruno Mannina wrote: > Hi Susheel, Emir, > > yes I check, and I have one result in c1 and in c2 with the same query > fid:34520196

Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Bruno Mannina
I have a dev' server, I will do some test on it... Le 06/01/2016 17:31, Susheel Kumar a écrit : I'll suggest if you can setup some some test data locally and try this out. This will confirm your understanding. Thanks, Susheel On Wed, Jan 6, 2016 at 10:39 AM, Bruno Mannina wrote: Hi Susheel

Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Bruno Mannina
Same result on my dev' server, it seems that collection param haven't effect on the query... Q: I don't see on the solr 5.4 doc, the "collection" param for select handler, is it always present in 5.4 version ? Le 06/01/2016 17:38, Bruno Mannina a écrit : I have a dev' server, I will do some

Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread esther . quansah
Hi Bruno, You might consider using the JoinQueryParser. Details here : https://cwiki.apache.org/confluence/display/solr/Other+Parsers#OtherParsers-JoinQueryParser Best, Esther > Le 6 janv. 2016 à 08:48, Bruno Mannina a écrit : > > Same result on my dev' server, it seems that collection param

Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Bruno Mannina
Hi Ester, yes, i saw it, but if I use: q={!join from=fid to=fid}fid:34520196 (with or not &collection=c1,c2) I have only the result from the collection used in the select/c1 Le 06/01/2016 17:52, esther.quan...@lucidworks.com a écrit : Hi Bruno, You might consider using the JoinQueryParser. D

Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Binoy Dalal
Bruno, Use join like so: {!join from=f1 to=f2 fromIndex=c2} On c1 On Wed, 6 Jan 2016, 22:30 Bruno Mannina wrote: > Hi Ester, > > yes, i saw it, but if I use: > > q={!join from=fid to=fid}fid:34520196 (with or not &collection=c1,c2) > > I have only the result from the collection used in the selec

Re: Solr server not starting

2016-01-06 Thread Erick Erickson
I doubt we'll be much help, it's probably best to talk to the echoprint people, assuming any are still available. I took a quick look at the project and the Solr implementation is from 4+ years ago... Best, Erick On Wed, Jan 6, 2016 at 8:11 AM, agonn Qurdina wrote: > Hi, > > > > I am using Solr

Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Bruno Mannina
:( not work for me http://my_adress:my_port/solr/c1/select?q={!join from=fid to=fid fromIndex=c2}fid:34520196&wt=json the result is always the same, it answer only for c1 34520196 has result in both collections Le 06/01/2016 18:16, Binoy Dalal a écrit : Bruno, Use join like so: {!join from=

Re: core,Collection,Shard,Replication

2016-01-06 Thread Erick Erickson
bq: But when indexing a document in one shard,it gets reflected in every shard of that collection This is a misunderstanding (and I'm being a bit pedantic here). Each shard contains a portion of the entire corpus. Say you have 1M docs and 2 shards. Each shard will have very close to 500K documents

Re: Cleanup solr cloud after failure in collection creation

2016-01-06 Thread Erick Erickson
The mail server is quite aggressive about removing attachments, none of yours came through. Perhaps put them somewhere else and provide a link? Best, Erick On Wed, Jan 6, 2016 at 3:22 AM, Gian Maria Ricci - aka Alkampfer < alkamp...@nablasoft.com> wrote: > I’ve issued a command to create some co

Re: solr 5.2.0 need to build high query response

2016-01-06 Thread Erick Erickson
Right, the point of the newSearcher and firstSearcher events is to smooth out spikes in response time, and it looks like that's happening now. >From here, you're in to query tuning. Without knowing your queries, your schema, how many docs on a server, memory allocation and all that it's impossibl

Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Esther-Melaine Quansah
Ok, so join won’t work. Distributed search is your answer. This worked for me: http://localhost:8983/solr/temp/select?shards=localhost:8983/solr/job,localhost:8983/solr/temp&q=*:* so for you i

Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Susheel Kumar
Hi Bruno, I just tested on 5.4 for your sake and it works fine. You are somewhere goofing up. Please create a new simple schema different from your use case with 2-3 fields with 2-3 documents and test this out independently on your current problem. That's what i can make suggestion and did same

Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Shawn Heisey
On 1/6/2016 2:41 AM, Bruno Mannina wrote: > I try to use this request without having both results: > > http://my_adress:my_port/solr/C1/select?collection=C1,C2&q=fid:34520196&wt=json > > > this request returns only C1 results and if I do: > > http://my_adress:my_port/solr/C2/select?collection=C1,C2

RE: Cleanup solr cloud after failure in collection creation

2016-01-06 Thread Gian Maria Ricci - aka Alkampfer
Thanks Erick, I did not remember that image attached are not permitted. Here is the links to the two images http://screencast.com/t/HQnWDV6GDoe http://screencast.com/t/qUpt0HxF -- Gian Maria Ricci Cell: +39 320 0136949 -Original Message- From: Erick Erickson [mailto:erickerick...

Re: Memory Usage increases by a lot during and after optimization .

2016-01-06 Thread Shawn Heisey
On 1/5/2016 11:50 PM, Zheng Lin Edwin Yeo wrote: > Here is the new screenshot of the Memory tab of the Resource Monitor. > https://www.dropbox.com/s/w4bnrb66r16lpx1/Resource%20Monitor.png?dl=0 > > Yes, I found that the value under the "Working Set" column is much higher > than the others. Also, the

I cannot create replica in Solr

2016-01-06 Thread persoy
Hi I'm using Solr clouds. I created one solr and zookeeper instances. Then I want to create replica for Solr. I'm using Windows 7 But the following errors are throwing : This is warning message: OverseerCollectionProcessor.processMessage : create , { "name":"contracts", "fromApi":"true", "

Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Bruno Mannina
Yeah ! it works with your method ! thanks a lot Esther ! Le 06/01/2016 19:15, Esther-Melaine Quansah a écrit : Ok, so join won’t work. Distributed search is your answer. This worked for me: http://localhost:8983/solr/temp/select?shards=localhost:8983/solr/job,localhost:8983/solr/temp&q=*:*

Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Bruno Mannina
Hi Shawn, thanks for this info, I use solr alone on my own server. Le 06/01/2016 20:13, Shawn Heisey a écrit : On 1/6/2016 2:41 AM, Bruno Mannina wrote: I try to use this request without having both results: http://my_adress:my_port/solr/C1/select?collection=C1,C2&q=fid:34520196&wt=json thi

Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Bruno Mannina
Hi, is it possible that was the problem wrote by Shawn and you have SolrCloud mode (with zookeeper) ? The solution gives by Esther works fine so it's ok for me :) ** Are you running in SolrCloud mode (with zookeeper)? If you're not, then the collection parameter doesn't do anything, and

Re: I cannot create replica in Solr

2016-01-06 Thread Erick Erickson
It looks like you haven't uploaded the configset to Zookeeper so it can be found by the create command. See: https://cwiki.apache.org/confluence/display/solr/Using+ZooKeeper+to+Manage+Configuration+Files. Best, Erick On Wed, Jan 6, 2016 at 1:33 PM, persoy wrote: > Hi > I'm using Solr clouds. I c

Re: Issue with Join

2016-01-06 Thread William Bell
Can a committer take a look at this please? On Wed, Dec 30, 2015 at 4:35 PM, William Bell wrote: > Yeah I ran it. It appears to only have an issue if the field for the core > on the outside is MultiValue, and the inside field is single value, and the > order of the field has the code not as the

Re: Query behavior difference.

2016-01-06 Thread Modassar Ather
Thanks for your responses. Best, Modassar On Wed, Jan 6, 2016 at 9:27 PM, Jack Krupansky wrote: > The motivation for the constant-score rewrite is simply performance. As per > the Javadoc: > > "*This method is faster than the BooleanQuery rewrite methods when the > number of matched terms or ma

Parent Child Schema Design

2016-01-06 Thread Pranaya Behera
Hi, I have read yonik.com/solr-nested-objects/ which states that there is no need for additional schema changes other than having a _root_ which is already present in the schema.xml. But it never specified on for the child elements what would the schema look like. And the post actually us

Re: Memory Usage increases by a lot during and after optimization .

2016-01-06 Thread Zheng Lin Edwin Yeo
Hi Shawn, Thank you for your explanation. Yes, both of the top two processes are Solr. I have two Solr processes on one machine now, as the second one is a replica of the first one. In the future, the plan is to have them on separate machine. >Subtracting SHR from RES (or in your case, Shareabl