Nutch + Solr - Indexer causes java.lang.OutOfMemoryError: Java heap space

2014-09-07 Thread glumet
Hello everyone, I have configured my 2 servers to run in distributed mode (with Hadoop) and my configuration for crawling process is Nutch 2.2.1 - HBase (as a storage) and Solr. Solr is run by Tomcat. The problem is everytime I try to do the last step - I mean when I want to index data from HBase

Re: SolrCloud : node recovery fails with "No registered leader was found"

2014-09-07 Thread heaven
Seeing the same thing after a crash of one ZK node (from 5): {code} org.apache.solr.common.SolrException: No registered leader was found after waiting for 4000ms , collection: crm-prod slice: shard1 at org.apache.solr.common.cloud.ZkStateReader.getLeaderRetry(ZkStateReader.java:545)

Re: Query ReRanking question

2014-09-07 Thread Erick Erickson
Joel: I find that whenever I say something totally wrong publicly, I remember the correction really really well... Thanks for straightening that out! Erick On Sat, Sep 6, 2014 at 12:58 PM, Joel Bernstein wrote: > This folllowing query: > > http://localhost:8080/solr/select?q=malaysian airline c

New cloud - replica in recovering state?

2014-09-07 Thread Jakov Sosic
Hi guys, I'm trying to set up new solr cloud, with two core's, each with two shards and two replicas. This is my solr.xml: But when I start everything, I can see 4 cores (each for 1 shard) are green in solr01:8080/solr/#/~cloud, but replicas are in yellow, REC

ANNOUNCE: Solr Reference Guide for Solr 4.10

2014-09-07 Thread Chris Hostetter
The Lucene PMC is pleased to announce that there is a new version of the Solr Reference Guide for Solr 4.10. The 511 page PDF serves as the definitive user's manual for Solr 4.10. It can be downloaded from the Apache mirror network: https://www.apache.org/dyn/closer.cgi/lucene/solr/ref-guid

Re: Query ReRanking question

2014-09-07 Thread Joel Bernstein
Ok, just reviewed the code. The ReRankingQParserPlugin always tracks the scores from the main query. So this explains things. Speaking of explaining things, the ReRankingParserPlugin also works with Lucene's explain. So if you use debugQuery=true we should see that the score from the initial query

Re: Query ReRanking question

2014-09-07 Thread Joel Bernstein
Oops wrong usage pattern. It should be: 1) Main query is sorted by a field (scores tracked silently in the background). 2) Reranker is reRanking docs based on the score from the main query. Joel Bernstein Search Engineer at Heliosearch On Sun, Sep 7, 2014 at 4:43 PM, Joel Bernstein wrote: >

Re: statuscode list

2014-09-07 Thread Koji Sekiguchi
Hi Jan, (2014/09/05 21:01), Jan Verweij - Reeleez wrote: Hi, If I'm correct you will get a statuscode="0" in the response if you use XML messages for updating the solr index. I think you mean by statuscode="0" is status=0 here. 07 Is there a list of possible other statuscodes you can re

Re: New cloud - replica in recovering state?

2014-09-07 Thread Erick Erickson
I really recommend you use the new-style core discovery, if for no other reason than this style is deprecated in 5.0. See: https://wiki.apache.org/solr/Solr.xml%204.4%20and%20beyond FWIW, Erick On Sun, Sep 7, 2014 at 8:51 AM, Jakov Sosic wrote: > Hi guys, > > > I'm trying to set up new solr clou

[ANN] Heliosearch 0.07 released

2014-09-07 Thread Yonik Seeley
http://heliosearch.org/download Heliosearch v0.07 Features o Heliosearch v0.07 is based on (and contains all features of) Lucene/Solr 4.10.0 o An optimized Terms Query with native code performance enhancements for efficiently matching multiple terms in a field. http://heliosearch.org/s

Re: Performance of Boolean query with hundreds of OR clauses.

2014-09-07 Thread Yonik Seeley
Solr 4.10 has added a {!terms} query that should speed up these cases. Benchmarks here: http://heliosearch.org/solr-terms-query/ -Yonik http://heliosearch.org - native code faceting, facet functions, sub-facets, off-heap data On Tue, Aug 19, 2014 at 2:57 PM, SolrUser1543 wrote: > I am using Sol

Re: How to implement multilingual word components fields schema?

2014-09-07 Thread Ilia Sretenskii
Thank you for the replies, guys! Using field-per-language approach for multilingual content is the last thing I would try since my actual task is to implement a search functionality which would implement relatively the same possibilities for every known world language. The closest references are t

Deleted Collections not updated in Zookeeper

2014-09-07 Thread RadhaJayalakshmi
Hi, Issue in brief: I am facing a strange issue, where, the collections that are deleted in SOLR, are still having reference in Zookeeper and due to which, in the solr cloud console, i am still seeing the reference to the deleted collections in down state Issue in Detail: I am using Solr 4.5.1 and

Re: Deleted Collections not updated in Zookeeper

2014-09-07 Thread Anshum Gupta
Hi Radha, This is strange as I the collections API delete command is supposed to clean up zk. Do you see any errors in your Solr logs? Does the response from the call include any errors/exceptions? On Sun, Sep 7, 2014 at 11:32 PM, RadhaJayalakshmi < rlakshminaraya...@inautix.co.in> wrote: > Hi,