Re: Getting started with Solr

2015-03-01 Thread Baruch Kogan
OK, got it, works now. Maybe you can advise on something more general? I'm trying to use Solr to analyze html data retrieved with Nutch. I want to crawl a list of webpages built according to a certain template, and analyze certain fields in their HTML (identified by a span class and consisting of

Correct connection methodology for Zookeeper/SolrCloud?

2015-03-01 Thread Julian Perry
Hi I'm really after best practice guidelines for making queries to an index on a Solr cluster. I'm not calling from Java. I have Solr 4.10.2 up and running, seems stable. I have about 6 indexes/collections - am running SolrCloud with two Solr instances (both currently running on the same dev.

Conditional invocation of HTMLStripCharFactory

2015-03-01 Thread SolrUser1543
is it possible to make a considional invocation of a HTMLStripCharFactory? I want to decide when to enable or disable it according to a value of specific field in my document. E.g. when a value of field A is true, then enable a filter on field B,or disable otherwise. -- View this message in co

RE: Is it possible to use multiple index data directory in Apache Solr?

2015-03-01 Thread Susheel Kumar
Under Solr/example folder, you will find "multicore" folder under which you can create multiple core/index directory folders and edit the solr.xml to specify each of the new core/directory. When you start Solr under examples directory, use command line like below to load Solr and then you sho

filtering tfq() function query to specific part of collection not the whole documents

2015-03-01 Thread Ali Nazemian
Hi, I was wondering is it possible to filter tfq() function query to specific selection of collection? Suppose I want to count all occurrences of term "test" in documents with fq=category:2, how can I handle such query with tfq() function query? It seems applying fq=category:2 in a "select" query w

Re: [ANNOUNCE] Luke 4.10.3 released

2015-03-01 Thread Dmitry Kan
Hi Tomoko, I have just created the pivot branch off of the current master. Let's move our discussion there: https://github.com/DmitryKey/luke/tree/pivot-luke Thanks, Dmitry On Fri, Feb 27, 2015 at 7:53 PM, Tomoko Uchida wrote: > Hi Dmitry, > > In my environment, I cannot produce this pivots's

Re: Is it possible to use multiple index data directory in Apache Solr?

2015-03-01 Thread Alexandre Rafalovitch
On 1 March 2015 at 01:03, Shawn Heisey wrote: > How exactly does ES split the index files when multiple paths are > configured? I am very curious about exactly how this works. Google is > not helping me figure it out. I even grabbed the ES master branch and > wasn't able to trace how path.data

Re: About solr recovery

2015-03-01 Thread Erick Erickson
Several. One is if your network has trouble and Zookeeper times out a Solr node. Can you describe your problem though? Or is this just an informational question? Because I'm quite sure how to respond helpfully here. Best, Erick On Fri, Feb 27, 2015 at 10:37 PM, 龚俊衡 wrote: > HI, > > Our producti

Re: Correct connection methodology for Zookeeper/SolrCloud?

2015-03-01 Thread Erick Erickson
bq: I could just set up a load balancer on the two Solr instances and let client query requests use the load balancer to find a working instance. That's all you need to do. The client shouldn't have to really even be aware that Zookeeper exists, there's really no need to query ZK and route your re

Integrating Solr with Nutch

2015-03-01 Thread Baruch Kogan
Hi, guys, I'm working through the tutorial here . I've run a crawl on a list of webpages. Now I'm trying to index them into Solr. Solr's installed, runs fine, indexes .json, .xml, whatever, returns queries. I've edited the N

RE: Integrating Solr with Nutch

2015-03-01 Thread Markus Jelsma
Hello Baruch! You are not pointing to a directory of segments, not a specific segment. You must either point to a directory with the -dir option: bin/nutch solrindex http://127.0.0.1:8983/solr/ crawl/crawldb -linkdb crawl/linkdb -dir crawl/segments/ Or point to a segment: bin/nutch solrin

backport Heliosearch features to Solr

2015-03-01 Thread Yonik Seeley
As many of you know, I've been doing some work in the experimental "heliosearch" fork of Solr over the past year. I think it's time to bring some more of those changes back. So here's a poll: Which Heliosearch features do you think should be brought back to Apache Solr? http://bit.ly/1E7wi1Q (li

Using HDFS with Solr

2015-03-01 Thread Jou Sung-Shik
Hello. I have a question about using HDFS with Solr. I watched when one of shard node is gone, another node take them like this graph in admin console. *(10.62.65.46 is Gone)* +- shard 1-10.62.65.48 (active) collection-hdfs-

solr5 - where does solr5 look for schema files?

2015-03-01 Thread Gulliver Smith
I am running the out-of-the-box solr5 as instructed in the tutorial. The solr documentation has no useful documentation about the shema file argument to create core. I have a schema.xml that I was using for a solr 4 installation by manually editing the core directories as root. When playing with

Re: solr5 - where does solr5 look for schema files?

2015-03-01 Thread Erick Erickson
You haven't stated it explicitly, but I think you're running SolrCloud, right? In which case... the configs are all stored in ZooKeeper, and you don't edit them there. The startup scripts automate the "upconfig" step that pushes your configs to Zookeeper. Thereafter, they are read from Zookeeper b

Re: backport Heliosearch features to Solr

2015-03-01 Thread Otis Gospodnetic
Hi Yonik, Now that you joined Cloudera, why not everything? Otis -- Monitoring * Alerting * Anomaly Detection * Centralized Log Management Solr & Elasticsearch Support * http://sematext.com/ On Sun, Mar 1, 2015 at 4:50 PM, Yonik Seeley wrote: > As many of you know, I've been doing some work i

Re: backport Heliosearch features to Solr

2015-03-01 Thread Yonik Seeley
On Sun, Mar 1, 2015 at 7:18 PM, Otis Gospodnetic wrote: > Hi Yonik, > > Now that you joined Cloudera, why not everything? Everything is on the table, but from a practical point of view I wanted to verify areas of user interest/support before doing the work to get things back. Even when there is

SOLR Backup and Restore - Solr 3.6.1

2015-03-01 Thread abhi Abhishek
Hello, we have solr 3.6.1 in our environment. we are trying to analyse backup and recovery solutions for the same. is there a way to compress the backup taken? we have explored about replicationHandler with backup command. but as our index is in 100's of GB's we would like a solution that p

Re: solr cloud does not start with many collections

2015-03-01 Thread Damien Kamerman
I still see the same cloud startup issue with Solr 5.0.0. I created 4,000 collections from scratch and then attempted to stop/start the cloud. node1: WARN - 2015-03-02 18:09:02.371; org.eclipse.jetty.server.handler.RequestLogHandler; !RequestLog WARN - 2015-03-02 18:10:07.196; org.apache.solr.cl