JSON facet performance for aggregations

2017-04-30 Thread Mikhail Ibraheem
Hi, I am trying to do aggregation with JSON faceting but performance is very bad for one of the requests: json.facet={ studentId:{ type:terms, limit:-1, field:"studentId", facet:{ x:"sum(grades)" } } }

Re: JSON facet performance for aggregations

2017-04-30 Thread Vijay Tiwary
Json facet on string fields run lot slower than on numeric fields. Try and see if you can represent studentid as a numeric field. On Apr 30, 2017 1:19 PM, "Mikhail Ibraheem" wrote: > Hi, > > I am trying to do aggregation with JSON faceting but performance is very > bad for one of the requests: >

pagination of results of grouping by more than one field

2017-04-30 Thread Mikhail Ibraheem
Hi, I have a problem that I need to group by X and Y and aggregator on Z and I need to paginate on the results. The results aren't flat they are in hierarchy so how to flat the results so we can paginate on them for each combination of X,Y like: Computers, Computer Laptops, 5.684790733920929

RE: JSON facet performance for aggregations

2017-04-30 Thread Mikhail Ibraheem
Hi Vijay, It is already numeric field. It is huge difference between json and flat here. Do you know the reason for this? Is there a way to improve it ? -Original Message- From: Vijay Tiwary [mailto:vijaykr.tiw...@gmail.com] Sent: Sunday, April 30, 2017 9:58 AM To: solr-user@lucene.apach

RE: JSON facet performance for aggregations

2017-04-30 Thread Vijay Tiwary
Please enable doc values and try. There is a bug in the source code which causes json facet on string field to run very slow. On numeric fields it runs fine with doc value enabled. On Apr 30, 2017 1:41 PM, "Mikhail Ibraheem" wrote: > Hi Vijay, > It is already numeric field. > It is huge differen

RE: JSON facet performance for aggregations

2017-04-30 Thread Mikhail Ibraheem
1- studentId has docValue = true . it is of type double which is 2- If we just facet without aggregation it finishes in good time 60ms: json.facet={ studentId:{ type:terms, limit:-1, field:" studentId " } } Thanks -Original Message- From: Vija

Re: Both main and replica are trying to access solr_gc.log.0.current file

2017-04-30 Thread Zheng Lin Edwin Yeo
I'm starting Solr with this command: bin\solr.cmd start -cloud -p 8983 -s solr\node1\solr -m 8g -z "localhost:9981,localhost:9982,localhost:9983" bin\solr.cmd start -cloud -p 8984 -s solr\node2\solr -m 8g -z "localhost:9981,localhost:9982,localhost:9983" Regards, Edwin On 30 April 2017 at 13:52

Re: JSON facet performance for aggregations

2017-04-30 Thread Yonik Seeley
It is odd there would be quite such a big performance delta. What version of solr are you using? What is the fieldType of "grades"? -Yonik On Sun, Apr 30, 2017 at 5:15 AM, Mikhail Ibraheem wrote: > 1- > studentId has docValue = true . it is of type double which is name="double" class="solr.Trie

Re: Poll: Master-Slave or SolrCloud?

2017-04-30 Thread Yonik Seeley
On Tue, Apr 25, 2017 at 1:33 PM, Otis Gospodnetić wrote: > I think I saw mentions (maybe on user or dev MLs or JIRA) about > potentially, in the future, there only being SolrCloud mode (and dropping > SolrCloud name in favour of Solr). I personally never saw this actually happening, and not becau

RE: JSON facet performance for aggregations

2017-04-30 Thread Mikhail Ibraheem
Hi Yonik, We are using Solr 6.5 Both studentId and grades are double: We have 1.5 million records. Thanks Mikhail -Original Message- From: Yonik Seeley [mailto:ysee...@gmail.com] Sent: Sunday, April 30, 2017 1:04 PM To: solr-user@lucene.apache.org Subject: Re: JSON facet performance

Re: Poll: Master-Slave or SolrCloud?

2017-04-30 Thread Shawn Heisey
On 4/25/2017 3:13 PM, Otis Gospodnetić wrote: > Could one run *only* embedded ZK on some SolrCloud nodes, sans any data? > It would be equivalent of dedicated Elasticsearch nodes, which is the > current ES best practice/recommendation. I've never heard of anyone being > scared of running 3 dedicat

Re: Poll: Master-Slave or SolrCloud?

2017-04-30 Thread Ganesh M
We use zookeeper for Hadoop / HBase and so we use same ensemble for Solr too. We are using Solr Cloud in EC2 instances with 6 collections containing 4 shards and 2 replicas. We followed the one of the blog in the internet fo

Re: Solr performance on EC2 linux

2017-04-30 Thread Jeff Wartes
I’d like to think I helped a little with the metrics upgrade that got released in 6.4, so I was already watching that and I’m aware of the resulting performance issue. This was 5.4 though, patched with https://github.com/whitepages/SOLR-4449 - an index we’ve been running for some time now. Mgan

Re: Spatial Search: can not use FieldCache on a field which is neither indexed nor has doc values: latitudeLongitude_0_coordinate

2017-04-30 Thread David Smiley
Frederick, RE LatLonType: Weird. Is the dynamic field "_coordinate" defined? It should be ensure it has indexed=true on it. I forget if indexed needs to be set on that or on the LLT field that refers to it but to be sure set on both. RE LatLonPointSpatialField: You should use this for sure

Step By Step guide to create Solr Cloud in Solr 6.x

2017-04-30 Thread Nilesh Kamani
Hello All, Sorry to bother you all again. I am having hard time understanding solr terminologies. Is there any step by step guide to create solr cloud in Solr 6.x ? I have two servers on my google cloud and have installed solr on both of them. I would like to create one collection, shard1 on se

Slow indexing speed when collection size is large

2017-04-30 Thread Zheng Lin Edwin Yeo
Hi, I'm using Solr 6.4.2. Would like to check, if there are alot of collections in my Solr which has very large index size, will the indexing speed be affected? Currently, I have created a new collections in Solr which has several collections with very large index size, and the indexing speed is

Re: Step By Step guide to create Solr Cloud in Solr 6.x

2017-04-30 Thread Nilesh Kamani
UPDATE - Able to get shard1 on server and shard2 on server 2 and core on server 1 in the cluster. How can I add another node/core to cluster which is on server 2. On Sun, Apr 30, 2017 at 9:48 PM, Nilesh Kamani wrote: > Hello All, > > Sorry to bother you all again. I am having hard time unde

Re: Step By Step guide to create Solr Cloud in Solr 6.x

2017-04-30 Thread Nilesh Kamani
UPDATE - After restarting the server, I can see that issue has been resolved for now. On Sun, Apr 30, 2017 at 11:12 PM, Nilesh Kamani wrote: > UPDATE - > > Able to get shard1 on server and shard2 on server 2 and core on server 1 > in the cluster. > > How can I add another node/core to cluster

Building Solr greater than 6.2.1

2017-04-30 Thread Ryan Yacyshyn
Hi all, I'm trying to build Solr 6.5.1 but it's is failing. I'm able to successfully build 6.2.1. I've tried 6.4.0, 6.4.2, and 6.5.1 but the build fails. I'm not sure what the issue could be. I'm running `ant server` in the solr dir and this is where it fails: ivy-configure: [ivy:configure] :: lo

BooleanQuery and WordDelimiterFilter

2017-04-30 Thread Avi Steiner
Hi I have a question regarding the use of query parser and BooleanQuery. I have 3 documents indexed. Doc1 contains the words huntman's and huntman Doc2 contains the word huntman's Doc3 contains the word huntman When I search for huntman's I get Doc1 and Doc2 When I search for +huntman's I get D

Re: Building Solr greater than 6.2.1

2017-04-30 Thread Alexandre Rafalovitch
Make sure your Java is latest update. Seriously Also, if still failing, try blowing away your Ivy cache. Regards, Alex On 1 May 2017 6:34 AM, "Ryan Yacyshyn" wrote: > Hi all, > > I'm trying to build Solr 6.5.1 but it's is failing. I'm able to > successfully build 6.2.1. I've tried 6.4.