Sorting in SOLR

2015-03-02 Thread Rajesh
Hi, I want to sort my results based on my query string. For example, If I query by "Test Street", My results should be sorted in such a way that the records containing both words should come first and the records containing Test should come and the records containing street should be displayed.

Re: About solr recovery

2015-03-02 Thread 龚俊衡
Hi, Erick, I found problem, because we have only one shard on server with heavy update request (over 250/s) but solr just only use one core of cpu. > On Mar 2, 2015, at 01:25, Erick Erickson wrote: > > Several. One is if your network has trouble and Zookeeper times out a Solr > node. > > Can

Deploying Solr5.0.0 on Windows using Task Schedular

2015-03-02 Thread John Jenniskens
Hello, I'm deploying Solr5.0.0. on Windows 2008 server. I have deployed V4.9.0 by running Jetty as a Windows Service. I read that Tomcat or Jetty are not supported anymore. I'm planning to add a task to the task scheduler to start the Solr server at system boot time. So I call the bin\solr.bat

Validate data Indexed and versioning

2015-03-02 Thread marotosg
Hi, I am trying to define a way of validating if my index has the same content than my database. I am indexing a very complex denormalized version of the database with many items and nested documents. I have an indexation service which pulls records from a staging table(created based on a ETL pro

Re: Conditional invocation of HTMLStripCharFactory

2015-03-02 Thread Dmitry Kan
Hi, This is an interesting question. Just an idea: make a custom extension of https://github.com/apache/lucene-solr/blob/trunk/lucene/analysis/common/src/java/org/apache/lucene/analysis/charfilter/HTMLStripCharFilterFactory.java class and examine the field value before executing the HTMLStripChar

Re: Does shard splitting double host count

2015-03-02 Thread tuxedomoon
Shawn, in light of Garth's response below "You can't just add a new core to an existing collection. You can add the new node to the cloud, but it won't be part of any collection. You're not going to be able to just slide it in as a 4th shard to an established collection of 3 shards." how is it

Re: Is it possible to use multiple index data directory in Apache Solr?

2015-03-02 Thread Shawn Heisey
On 3/1/2015 9:33 AM, Alexandre Rafalovitch wrote: > On 1 March 2015 at 01:03, Shawn Heisey wrote: >> How exactly does ES split the index files when multiple paths are >> configured? I am very curious about exactly how this works. Google is >> not helping me figure it out. I even grabbed the ES

Re: Does shard splitting double host count

2015-03-02 Thread Shawn Heisey
On 3/2/2015 6:12 AM, tuxedomoon wrote: > Shawn, in light of Garth's response below > > "You can't just add a new core to an existing collection. You can add the > new node to the cloud, but it won't be part of any collection. You're not > going to be able to just slide it in as a 4th shard to an

Re: Deploying Solr5.0.0 on Windows using Task Schedular

2015-03-02 Thread Shawn Heisey
On 3/2/2015 4:27 AM, John Jenniskens wrote: > I'm deploying Solr5.0.0. on Windows 2008 server. > I have deployed V4.9.0 by running Jetty as a Windows Service. > I read that Tomcat or Jetty are not supported anymore. > I'm planning to add a task to the task scheduler to start the Solr server at >

Re: solr cloud does not start with many collections

2015-03-02 Thread Shawn Heisey
On 3/2/2015 12:54 AM, Damien Kamerman wrote: > I still see the same cloud startup issue with Solr 5.0.0. I created 4,000 > collections from scratch and then attempted to stop/start the cloud. > > node1: > WARN - 2015-03-02 18:09:02.371; > org.eclipse.jetty.server.handler.RequestLogHandler; !Reques

Create field date using name file

2015-03-02 Thread Mirko Torrisi
Hi folks, Hopefully this is an easy question but I couldn't do it after several hours.. I created a new field (adding indexed="true" stored="true"/>) and I'd like to use name file value to fill out it. The name files are like: TEXT_CRE_MMGG_X-XXX-XXX.txt or TEXT_CRE_MMGG_X-XXX.txt (w

Re: Create field date using name file

2015-03-02 Thread Mirko Torrisi
I forgot to add that the txt files are divided in directory following this rule: //MM/**files**. Regards, Mirko

Re: Create field date using name file

2015-03-02 Thread Alexandre Rafalovitch
So, what do you have already? Did you get as far as having a Solr document that has a full name as one of the fields and you just want to extract the date portion into a separate field? If so, your best bet is probably a custom UpdateRequestProcessor chain that will 1) Copy filename field to date

RE: Validate data Indexed and versioning

2015-03-02 Thread Reitzel, Charles
First, I would invest the largest effort towards developing good test cases and a good test harness for your ETL software itself. If validation in production does encounter errors, it should be considered a bug in your code! So be sure to always add these cases to your test harness. Also, th

SolrCloud default shard assignment order not correct

2015-03-02 Thread spillane
I'm trying to build a SolrCloud with 4 leader/replica pairs. Starting my first host with this tomcat6.conf setting CATALINA_OPTS="${CATALINA_OPTS} -DnumShards=4 -Dbootstrap_confdir=/data/solr/rockbands/conf -Dcollection.configName=rockbands_conf -DzkHost=zk1:2181,zk2:2181,zk3:2181" and then star

How can I index a "dictionary"?

2015-03-02 Thread tacno
Hello guys. I'm indexing any fields in solr without any problems. Ex: http://192.168.1.27:8000/event/xml/2/solr 2 Title text 2015-03-02T15:53:01.435232Z user that create But now, i have pictures in this data, and I want to put an "array" of fields with different values. Example: h

Re: SolrCloud default shard assignment order not correct

2015-03-02 Thread Shawn Heisey
On 3/2/2015 11:39 AM, spillane wrote: > and then starting up host2, host3 and host4 in order I am expecting to see a > cloud graph like this > > shard1 - host1 ip > rockbands --> shard2 - host2 ip > shard3 - host3 ip >

solr bug 6143 (facet count and CollapsingQParserPlugin)

2015-03-02 Thread Tang, Rebecca
We use the CollapsingQParser to group possible duplicate records. We are running into the issue reported by bug 6143. CollapsingQParser only supports facet.truncate but it returns counts that confuses our customers. What we need is group.facets. I wanted to check if a "new feature" bug has b

Slow cross-core joins

2015-03-02 Thread Matt B
I've recently inherited a Solr instance that is required to perform numerous joins between two cores, usually as filter queries, similar to the one below: q=firstName=Matt&fq=-({!to=emailAddress toIndex=accounts type=join fromIndex=lists from=listValue}list_id:38f2-351b-11e4-9579-001e67654bc

Re: Slow cross-core joins

2015-03-02 Thread Gopal Patwa
You could give a try for this join contrib patch https://issues.apache.org/jira/browse/SOLR-4787 On Mon, Mar 2, 2015 at 12:04 PM, Matt B wrote: > I've recently inherited a Solr instance that is required to perform > numerous joins between two cores, usually as filter queries, similar to the >

Re: SolrCloud default shard assignment order not correct

2015-03-02 Thread spillane
Since the order is consistently 1,4,2,3 it sounds like I can start the leaders in 1,4,2,3 order and then replicas in 1,4,2,3 order and expect the relationships to stick leader1 -> replica1 leader4 -> replica4 leader2 -> replica2 leader3 -> replica3 -- View this message in context: http://luce

Slow highlighting on Solr 5.0.0

2015-03-02 Thread Matt Hilt
Short form: While testing Solr 5.0.0 within our staging environment, I noticed that highlight enabled queries are much slower than I saw with 4.10. Are there any obvious reasons why this might be the case? As far as I can tell, nothing has changed with the default highlight search component or i

Re: How can I index a "dictionary"?

2015-03-02 Thread Ahmet Arslan
Hi Tacno, Sounds like material for block join : https://cwiki.apache.org/confluence/display/solr/Other+Parsers Ahmet On Monday, March 2, 2015 9:05 PM, tacno wrote: Hello guys. I'm indexing any fields in solr without any problems. Ex: http://192.168.1.27:8000/event/xml/2/solr 2 Title tex

Re: How can I index a "dictionary"?

2015-03-02 Thread Alexandre Rafalovitch
The question is what you want to find when you search for photo information and what happens if you search for several keywords that come from different pictures. If you don't care and just want to get the common parent record, you need to flatten that photo information into multivalue fields to b

Re: Validate data Indexed and versioning

2015-03-02 Thread Shawn Heisey
On 3/2/2015 4:32 AM, marotosg wrote: > Is there any general approach to check if your indexed document matches the > database row?. No, there's nothing out of the box that will do this. You'll need to write such an application yourself. You might want to leverage the /export handler in your appl

Re: Suggestion on indexing complex xml

2015-03-02 Thread Vishal Swaroop
Thanks for your time and suggestions Alex... a) So, if I use xslt... then SOLR output result will be xml, or there is a trick to get json also b) I am trying to figure out xslt template for my xml (as below) input to differentiate "parameter1" & "parameter2" as some elements are common (e.g. name

Unable to show the indexed content in Solr 5.0

2015-03-02 Thread Zheng Lin Edwin Yeo
Hi, The content field is unable to be shown during searching, even though the following line has been added to the schema using curl from the resource named in 'managedSchemaResourceName'. I'm using the schema from ManagedIndexSchemaFactory. As the ExtractRequestHandler has already been define

Re: Slow cross-core joins

2015-03-02 Thread William Bell
I agree that join is slow. Adding fq on LocalParams is good. Has this been added to {!lucene} and other calls like join ? On Mon, Mar 2, 2015 at 2:00 PM, Gopal Patwa wrote: > You could give a try for this join contrib patch > > https://issues.apache.org/jira/browse/SOLR-4787 > > > > On Mon, Ma

Fwd: Unable to show the indexed content in Solr 5.0

2015-03-02 Thread Zheng Lin Edwin Yeo
Hi, The content field is unable to be shown during searching, even though the following line has been added to the schema using curl from the resource named in 'managedSchemaResourceName'. I'm using the schema from ManagedIndexSchemaFactory. As the ExtractRequestHandler has already been define

Re: solr cloud does not start with many collections

2015-03-02 Thread Damien Kamerman
Still no luck starting solr with 40s zkClientTimeout. I'm not seeing any expired sessions... There must be a way to start solr with many collections. It runs fine.. until a restart is required. On 3 March 2015 at 03:33, Shawn Heisey wrote: > On 3/2/2015 12:54 AM, Damien Kamerman wrote: > > I st