Re: Sharding configuration

2014-10-30 Thread Anca Kopetz
Hi, You are right, it is a mistake in my phrase, for the tests with 4 shards/ 4 instances, the latency was worse (therefore *bigger*) than for the tests with one shard. In our case, the query rate is high. Thanks, Anca On 10/30/2014 03:48 PM, Shawn Heisey wrote: On 10/30/2014 4:32 AM, Anca

Re: Sharding configuration

2014-10-30 Thread Anca Kopetz
shards as long as the underlying machine has the sufficient amount of parallelism. Again, this advice is for a small number of shards, if you had a lot more (hundreds) of shards and significant volume of requests, things start to become a bit more fuzzy with other limits kicking in. On 28 Oct 201

Sharding configuration

2014-10-28 Thread Anca Kopetz
Hi, We have a SolrCloud configuration of 10 servers, no sharding, 20 millions of documents, the index has 26 GB. As the number of documents has increased recently, the performance of the cluster decreased. We thought of sharding the index, in order to measure the latency. What is the best approa

Re: How to boost documents ?

2014-01-09 Thread Anca Kopetz
Hi, I tested the BoostQueryParser and it works on the simplified example. But we need to keep the edismax Query parser, so I tried the following query and it seems to work (I defined a local bf='' for qq). &q=beautiful Christmas tree &mm=2 &qf=title^12 description^2 &defType=edismax &bf=map(que

Re: PeerSync Recovery fails, starting Replication Recovery

2014-01-09 Thread Anca Kopetz
iller wrote: Sounds like you need to raise your ZooKeeper connection timeout. Also, make sure you are using a concurrent garbage collector as a side note - stop the world pauses should be avoided. Just good advice :) - Mark On Dec 18, 2013, at 5:48 AM, Anca Kopetz <mailto:anca.kop...@kel

Re: How to boost documents ?

2014-01-06 Thread Anca Kopetz
... Is this a bug ? Thank you, Anca On 12/30/2013 02:30 PM, Anca Kopetz wrote: Hi, Thank you for your response. When I try the URL you sent me, I get the following error message : org.apache.solr.search.SyntaxError: Infinite Recursion detected parsing query 'beautiful Christmas tree'

Re: How to boost documents ?

2013-12-30 Thread Anca Kopetz
his original post [1] to a similar discussion. [1] http://search-lucene.com/m/nK6t9j1fuc2 On Monday, December 16, 2013 12:19 PM, Anca Kopetz <mailto:anca.kop...@kelkoo.com> wrote: Hi, How to boost documents that contain all search terms in several of its fields ? Below you cand fin

Re: PeerSync Recovery fails, starting Replication Recovery

2013-12-20 Thread Anca Kopetz
recovery, as I explained in the reply to Mark's mail. Best regards, Anca Kopetz On 12/19/2013 06:39 PM, Daniel Collins wrote: Are you using a NRT solution, how often do you commit? We see similar issues with PeerSync, but then we have a very active NRT system and we soft-commit sub-secon

Re: PeerSync Recovery fails, starting Replication Recovery

2013-12-20 Thread Anca Kopetz
ection timeout. Also, make sure you are using a concurrent garbage collector as a side note - stop the world pauses should be avoided. Just good advice :) - Mark On Dec 18, 2013, at 5:48 AM, Anca Kopetz <mailto:anca.kop...@kelkoo.com> wrote: Hi, In our SolrCloud cluster (2 shards, 8

Re: PeerSync Recovery fails, starting Replication Recovery

2013-12-19 Thread Anca Kopetz
Thanks; Furkan KAMACI 18 Aralık 2013 Çarşamba tarihinde Anca Kopetz adlı kullanıcı şöyle yazdı: Hi, In our SolrCloud cluster (2 shards, 8 replicas), the replicas go from time to time into recovering state, and it takes more than 10 minutes to finish to recover. In logs, we see that

PeerSync Recovery fails, starting Replication Recovery

2013-12-18 Thread Anca Kopetz
ache.solr.cloud.ZkController:publish:1021  - numShards not found on descriptor - reading it from system property 2013-12-05 14:01:22,860 [RecoveryThread] INFO  org.apache.solr.cloud.RecoveryStrategy:doRecovery:495  - Finished recovery process. core=fr_green Best regards, Anca Kopetz -- Anca Kopetz Software enginee

How to boost documents ?

2013-12-16 Thread Anca Kopetz
sts of qf, but the results vary a lot. Is there a way to add a boost on all search fields, the same way we do with pf on one field : pf=title:2^3.0 ? Thank you, Anca -- Anca Kopetz Software engineer E anca.kop...@kelkoo.com  Y!Messenger kelkooancak T +33 (0)4 56 09 07 55    A 4/6 Rue

Solution for "MM ignored in edismax queries with operators" ?

2013-11-25 Thread Anca Kopetz
2&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13822482. Any feedback is welcome. Best regards, Anca Kopetz Kelkoo SAS Société par Actions Simplifiée Au capital de € 4.168.964,30 Siège social : 8, rue du Sentier 75002 Paris 425 093 069 RCS Paris Ce message et les pièces jo

Re: Filter cache pollution during sharded edismax queries

2013-10-18 Thread Anca Kopetz
Hi Ken, Have you managed to find out why these entries were stored into filterCache and if they have an impact on the hit ratio ? We noticed the same problem, there are entries of this type : item_+(+(title:western^10.0 | ... in our filterCache. Thanks, Anca On 07/02/2013 09:01 PM, Ken Krugle

Re: "optimize" index : impact on performance [Republished]

2013-08-05 Thread Anca Kopetz
will test them. Best regards, Anca On 08/05/2013 10:42 AM, Anca Kopetz wrote: Please read and follow this note about thread hijacking: http://people.apache.org/~hossman/#threadhijack<http://people.apache.org/%7Ehossman/#threadhijack><http://people.apache.org/%7Ehossman/#threadhijac

"optimize" index : impact on performance [Republished]

2013-08-05 Thread Anca Kopetz
Hi, [I am sending again my message to the mailing list, as well as Shawn's reply. Thanks Shawn for your explanations] We are trying to improve the performance of our Solr Search application in terms of QPS (queries per second). We tuned SOLR settings (e.g. mergeFactor=3), launched several ben

"optimize" index : impact on performance

2013-08-02 Thread Anca Kopetz
nodes on 8 dedicated servers; 2 shards / 4 replicas Hardware configuration: 2 Processors (16CPU cores) per server; 24GB of memory; 6GB allocated to JVM Index: 13M documents, 15GB Search algorithm : grouping, faceting, filter queries Solr version 4.4 Best regards, A

Problem with CoreAdminRequest/GET_CORE_STATUS in SolrCloud

2013-07-03 Thread Anca Kopetz
Hi, We have a solr cloud cluster with 2 different collections, each collection having many nodes. We try to get the status of each collection using CoreAdminRequest. The code gets all live nodes from the cluster and sends a request to each node until it gets a valid response. We would like to ha