Re: Collations are not working fine.

2015-02-17 Thread Nitin Solanki
Hey James Dyer, Sorry for late responding because I went out for couple of days. I have tried out the Rajesh Hazari's configuration which he pasted inside the mail. It seems to be working. I feel that It is working because by reducing the *25 *to* 5* by which collations

Re: Release date for Solr 5

2015-02-17 Thread CKReddy Bhimavarapu
Hi, Can i get any developer version to test and run for now. On Tue, Feb 17, 2015 at 12:45 PM, Anshum Gupta wrote: > There's a vote going on for the 3rd release candidate of Solr / Lucene 5.0. > If everything goes smooth and the vote passes, the release should happen in > about 4-5 days. > > On

Re: Collations are not working fine.

2015-02-17 Thread Nitin Solanki
Hey Rajesh, Sorry for late responding because I went out for couple of days. I have tried out the configuration which you sent me. Thanks a lot. It seems to be working. I feel that It is working because by reducing the *25 *to* 5* by which collations come less and spell

Re: Collations are not working fine.

2015-02-17 Thread Nitin Solanki
Hi Charles, Will you please send the configuration which you tried. It will help to solve my problem. Have you sorted the collations on hits or frequencies of suggestions? If you did than please assist me. On Mon, Feb 16, 2015 at 7:59 PM, Reitzel, Charles < charles.reit...@tiaa-cr

Re: Release date for Solr 5

2015-02-17 Thread Anshum Gupta
You can either checkout the release branch and build it yourself from: http://svn.apache.org/repos/asf/lucene/dev/branches/lucene_solr_5_0 or download it from the RC here: http://people.apache.org/~anshum/staging_area/lucene-solr-5.0.0-RC3-rev1659987 You should remember that this is a release can

Re: Release date for Solr 5

2015-02-17 Thread Shalin Shekhar Mangar
You can help by testing out the release candidate available from: http://people.apache.org/~anshum/staging_area/lucene-solr-5.0.0-RC3-rev1659987 Note that this is *NOT* an official release. On Tue, Feb 17, 2015 at 1:43 PM, CKReddy Bhimavarapu wrote: > Hi, > Can i get any developer version to t

Re: Weird Solr Replication Slave out of sync

2015-02-17 Thread Dmitry Kan
Hi, This sounds quite strange. Do you see any error messages either in the solr admin's replication page or in the master's OR slave's logs? When we had issues with slave replicating from the master, they related to slave running out of disk. I'm sure there could be a bunch of other reasons for fa

Sort collation on hits.

2015-02-17 Thread Nitin Solanki
Hi, All I want to sort the collations on hits in descending order. How to do ?

Re: Solr suggest is related to second letter, not to initial letter

2015-02-17 Thread Volkan Altan
First of all thank you for your answer. Example Url: doc 1 suggest_field: galaxy samsung s5 phone doc 2 suggest_field: shoe adidas 2 hiking http://localhost:8983/solr/solr/suggest?q=galaxy+s The result for which I am waiting is just like the one indicated below. But; the ‘’Galaxy shoe’’ isn’t

Possibility of Indexing without feeding again in Solr 4.10.2

2015-02-17 Thread dinesh naik
Hi all, How to can do re-indexing in Solr without importing the data again? Is there a way to do re-indexing only for few documents ? -- Best Regards, Dinesh Naik

Better way of copying/backup of index in Solr 4.10.2

2015-02-17 Thread dinesh naik
What is the best way for copying/backup of index in Solr 4.10.2? -- Best Regards, Dinesh Naik

unusually high 4.10.2 vs 4.3.1 RAM consumption

2015-02-17 Thread Dmitry Kan
Hi, We are currently comparing the RAM consumption of two parallel Solr clusters with different solr versions: 4.10.2 and 4.3.1. For comparable index sizes of a shard (20G and 26G), we observed 9G vs 5.6G RAM footprint (reserved RAM as seen by top), 4.3.1 being the winner. We have not changed th

RE: unusually high 4.10.2 vs 4.3.1 RAM consumption

2015-02-17 Thread Markus Jelsma
We have seen an increase between 4.8.1 and 4.10. -Original message- > From:Dmitry Kan > Sent: Tuesday 17th February 2015 11:06 > To: solr-user@lucene.apache.org > Subject: unusually high 4.10.2 vs 4.3.1 RAM consumption > > Hi, > > We are currently comparing the RAM consumption of two

Re: Possibility of Indexing without feeding again in Solr 4.10.2

2015-02-17 Thread Gora Mohanty
On 17 February 2015 at 15:18, dinesh naik wrote: > Hi all, > How to can do re-indexing in Solr without importing the data again? > Is there a way to do re-indexing only for few documents ? > If you have a unique ID for your documents, updating the index with that ID will update just that documen

Re: Better way of copying/backup of index in Solr 4.10.2

2015-02-17 Thread Gora Mohanty
On 17 February 2015 at 15:19, dinesh naik wrote: > > What is the best way for copying/backup of index in Solr 4.10.2? Please take a look at https://cwiki.apache.org/confluence/display/solr/Backing+Up Regards, Gora

Re: unusually high 4.10.2 vs 4.3.1 RAM consumption

2015-02-17 Thread Dmitry Kan
Have you found an explanation to that? On Tue, Feb 17, 2015 at 12:12 PM, Markus Jelsma wrote: > We have seen an increase between 4.8.1 and 4.10. > > -Original message- > > From:Dmitry Kan > > Sent: Tuesday 17th February 2015 11:06 > > To: solr-user@lucene.apache.org > > Subject: unusual

RE: unusually high 4.10.2 vs 4.3.1 RAM consumption

2015-02-17 Thread Markus Jelsma
I would have shared it if i had one :) -Original message- > From:Dmitry Kan > Sent: Tuesday 17th February 2015 11:40 > To: solr-user@lucene.apache.org > Subject: Re: unusually high 4.10.2 vs 4.3.1 RAM consumption > > Have you found an explanation to that? > > On Tue, Feb 17, 2015 at

Re: unusually high 4.10.2 vs 4.3.1 RAM consumption

2015-02-17 Thread Dmitry Kan
;) ok. Currently I'm trying parallel GC options, mentioned here: http://comments.gmane.org/gmane.comp.jakarta.lucene.solr.user/101377 At least the saw-tooth RAM chart is starting to shape up. On Tue, Feb 17, 2015 at 12:55 PM, Markus Jelsma wrote: > I would have shared it if i had one :) > > ---

spellcheck.count v/s spellcheck.alternativeTermCount

2015-02-17 Thread Nitin Solanki
Hello Everyone, I got confusion between spellcheck.count and spellcheck.alternativeTermCount in Solr. Any help in details?

Re: unusually high 4.10.2 vs 4.3.1 RAM consumption

2015-02-17 Thread Toke Eskildsen
On Tue, 2015-02-17 at 11:05 +0100, Dmitry Kan wrote: > Solr: 4.10.2 (high load, mass indexing) > Java: 1.7.0_76 (Oracle) > -Xmx25600m > > > Solr: 4.3.1 (normal load, no mass indexing) > Java: 1.7.0_11 (Oracle) > -Xmx25600m > > The RAM consumption remained the same after the load has stopped on t

Re: unusually high 4.10.2 vs 4.3.1 RAM consumption

2015-02-17 Thread Dmitry Kan
Thanks Toke! Now I consistently see the saw-tooth pattern on two shards with new GC parameters, next I will try your suggestion. The current params are: -Xmx25600m -XX:+UseParNewGC -XX:+ExplicitGCInvokesConcurrent -XX:+UseConcMarkSweepGC -XX:MaxTenuringThreshold=8 -XX:CMSInitiatingOccupancyFract

Block Join Query Parsers regular expression feature workaround req

2015-02-17 Thread Sankalp Gupta
Hi I need to have a query in which I need to choose only those parent docs none of whose children's field is having the specified value. i.e. I need something like this: http://localhost:8983/solr/core1/select?*q={!parent which=contentType:parent}childField:NOT value1* The problem is* NOT operato

Re: spellcheck.count v/s spellcheck.alternativeTermCount

2015-02-17 Thread Nitin Solanki
Any help please? On Tue, Feb 17, 2015 at 4:57 PM, Nitin Solanki wrote: > Hello Everyone, > I got confusion between spellcheck.count and > spellcheck.alternativeTermCount in Solr. Any help in details? >

Re: Block Join Query Parsers regular expression feature workaround req

2015-02-17 Thread Mikhail Khludnev
try to search all children remove those who has a value1 by dash, then join remaining q={!parent which=contentType:parent}contentType:child -contentType:value1 if the space in underneath query causes the problem try to escape it or wrap to v=$subq On Tue, Feb 17, 2015 at 4:13 PM, Sankalp Gupta

Re: Too many merges, stalling...

2015-02-17 Thread Shawn Heisey
On 2/16/2015 8:12 PM, ralph tice wrote: > Recently I turned on INFO level logging in order to get better insight > as to what our Solr cluster is doing. Sometimes as frequently as > almost 3 times a second we get messages like: > [CMS][qtp896644936-33133]: too many merges; stalling... > > Less fr

Discrepancy between Full import and Delta import query

2015-02-17 Thread Aniket Bhoi
Hi Folks, I am running Solr 3.4 and using DIH for importing data from a SQL server backend. The query for Full import and Delta import is the same ie both pull the same data. Full and Delta import query: SELECT KB_ENTRY.ADDITIONAL_INFO ,KB_ENTRY.KNOWLEDGE_REF ID,SU_ENTITY_TYPE.REF ENTRY_TYPE_RE

Collations are not using suggestions to build collations

2015-02-17 Thread Nitin Solanki
Hi, I want to build collations using suggestions of the query. But collations are building without using suggestions, they are using its own suggesters*(misspellingsAndCorrections)* and don't know from where these suggestions are coming. You can see the result by seeing below response fo

Re: Having a spot of trouble setting up /browse

2015-02-17 Thread Erik Hatcher
And FYI, out of the box with Solr 5.0, using the data driven config (the default when creating a collection with `bin/solr create -c …`), /browse is wired in by default with no templates explicit in the configuration as they are baked into the VrW library itself. But yeah, what Alexandre said -

Using TimestampUpdateProcessorFactory and updateRequestProcessorChain

2015-02-17 Thread Shu-Wai Chow
Hi, all. I’m trying to insert a field into Solr called last_modified, which holds a timestamp of the update. Since this is a cloud setup, I'm using the TimestampUpdateProcessorFactory to update the updateRequestProcessorChain. solrconfig.xml: last_modified la

RE: spellcheck.count v/s spellcheck.alternativeTermCount

2015-02-17 Thread Dyer, James
See http://wiki.apache.org/solr/SpellCheckComponent#spellcheck.count and the following section, for details. Briefly, "count" is the # of suggestions it will return for terms that are *not* in your index/dictionary. "alternativeTermCount" are the # of alternatives you want returned for terms t

Re: Block Join Query Parsers regular expression feature workaround req

2015-02-17 Thread Sankalp Gupta
Hi Mikhail, It won't solve my problem. For ex: Suppose my docs are like this: city1 city2 city2 city3 Now if I want* a query to return me all the users not having any address* related to *city1* (i.e. only userid=2 should be in the result)a

Re: Using TimestampUpdateProcessorFactory and updateRequestProcessorChain

2015-02-17 Thread Ahmet Arslan
Hi, You are using "/update" when registering, but using "/update/extract" when invoking. Ahmet On Tuesday, February 17, 2015 6:28 PM, Shu-Wai Chow wrote: Hi, all. I’m trying to insert a field into Solr called last_modified, which holds a timestamp of the update. Since this is a cloud setu

Re: spellcheck.count v/s spellcheck.alternativeTermCount

2015-02-17 Thread Nitin Solanki
Hi James, How can you say that "count" doesn't use index/dictionary then from where suggestions come. On Tue, Feb 17, 2015 at 10:29 PM, Dyer, James wrote: > See http://wiki.apache.org/solr/SpellCheckComponent#spellcheck.count and > the following section, for details. > > Brie

Re: Too many merges, stalling...

2015-02-17 Thread Shawn Heisey
On 2/17/2015 7:47 AM, Shawn Heisey wrote: > The first message simply indicates that you have reached more > simultaneous merges than CMS is configured to allow (3 by default), so > it will stall all of them except one. The javadocs say that the one > allowed to run will be the smallest, but I have

Re: Using TimestampUpdateProcessorFactory and updateRequestProcessorChain

2015-02-17 Thread Chris Hostetter
: Hi, : : You are using "/update" when registering, but using "/update/extract" when invoking. : : Ahmet if your goal is that *every* doc will get a last_modified, regarldess of how it is indexed, then you don't need to set the "update.chain" default on every requestHandler -- instead just ma

Re: Solr 4.8.1 : Response Code 500 when creating the new request handler

2015-02-17 Thread Chris Hostetter
: 1. Look further down in the stack trace for the "caused by" that details : > the specific cause of the exception. : I am still not able to find the cause of this. jack is refering to the log file from your server ... sometimes there are more details there. : Sorry i but don't know it is non-

RE: spellcheck.count v/s spellcheck.alternativeTermCount

2015-02-17 Thread Dyer, James
Here is an example to illustrate what I mean... - query q=text:(life AND hope)&spellcheck.count=10&spellcheck.alternativeTermCount=5 - suppose at least one document in your dictionary field has "life" in it - also suppose zero documents in your dictionary field have "hope" in them - The spellchec

Re: Block Join Query Parsers regular expression feature workaround req

2015-02-17 Thread Kydryavtsev Andrey
How about  find all parents which have at least one child with address:city1 and then "not" Like (not sure about syntax at all) q=-{!parent which=userid:*}address:city1 17.02.2015, 20:21, "Sankalp Gupta" : > Hi Mikhail, > > It won't solve my problem. > For ex: > Suppose my docs are like this: >

Re: Block Join Query Parsers regular expression feature workaround req

2015-02-17 Thread Mikhail Khludnev
Sankalp, would you mind to post debugQuery=on output, without it it's hard to get what's the problem? However, it's worth to mention that Andrey's suggestion seems really promising. On Tue, Feb 17, 2015 at 8:19 PM, Sankalp Gupta wrote: > Hi Mikhail, > > It won't solve my problem. > For ex: > S

Checkout the source Code to the Release Version of Solr?

2015-02-17 Thread O. Olson
At this time the latest released version of Solr is 4.10.3. Is there anyway we can get the source code for this release version? I tried to checkout the Solr code from http://svn.apache.org/repos/asf/lucene/dev/branches/lucene_solr_4_10/ In the commit log, I see a number of revisions but nothing m

RE: Collations are not working fine.

2015-02-17 Thread Reitzel, Charles
Hi Nitin, I was trying many different options for a couple different queries. In fact, I have collations working ok now with the Suggester and WFSTLookup. The problem may have been due to a different dictionary and/or lookup implementation and the specific options I was sending. In general

Re: Checkout the source Code to the Release Version of Solr?

2015-02-17 Thread Hrishikesh Gadre
Hi, You can get the released code base here https://github.com/apache/lucene-solr/releases Thanks Hrishikesh On Tue, Feb 17, 2015 at 2:20 PM, O. Olson wrote: > At this time the latest released version of Solr is 4.10.3. Is there anyway > we can get the source code for this release version? >

Re: Checkout the source Code to the Release Version of Solr?

2015-02-17 Thread Hrishikesh Gadre
Also the version number is encoded (at least) in the build file https://github.com/apache/lucene-solr/blob/817303840fce547a1557e330e93e5a8ac0618f34/lucene/common-build.xml#L32 Hope this helps. Thanks Hrishikesh On Tue, Feb 17, 2015 at 2:25 PM, Hrishikesh Gadre wrote: > Hi, > > You can get the

Re: Checkout the source Code to the Release Version of Solr?

2015-02-17 Thread O. Olson
Thank you Hrishikesh. Funny how GitHub is not mentioned on http://lucene.apache.org/solr/resources.html I think common-build.xml is what I was looking for. Thank you Hrishikesh Gadre-3 wrote > Also the version number is encoded (at least) in the build file > > https://github.com/apache/luce

CSV entry as multiple documents

2015-02-17 Thread Henrique Oliveira
Hi all, I was wondering if there is a way to tell Solr to treat a CSV entry as multiple documents instead of one document. For instance, suppose that a CSV file has 4 fields and a single entry: t1,v1,v2,v3 2015-01-01T01:00:59Z,0.3,0.5,0.7 I want Solr to update its index like it were 3 different

Re: Checkout the source Code to the Release Version of Solr?

2015-02-17 Thread Shawn Heisey
On 2/17/2015 3:20 PM, O. Olson wrote: > At this time the latest released version of Solr is 4.10.3. Is there anyway > we can get the source code for this release version? > > I tried to checkout the Solr code from > http://svn.apache.org/repos/asf/lucene/dev/branches/lucene_solr_4_10/ In the > comm

Re: Checkout the source Code to the Release Version of Solr?

2015-02-17 Thread Mike Drob
The SVN source is under tags, not branches. http://svn.apache.org/repos/asf/lucene/dev/tags/lucene_solr_4_10_3/ On Tue, Feb 17, 2015 at 4:39 PM, O. Olson wrote: > Thank you Hrishikesh. Funny how GitHub is not mentioned on > http://lucene.apache.org/solr/resources.html > > I think common-build.

Re: Checkout the source Code to the Release Version of Solr?

2015-02-17 Thread O. Olson
Thank you Mike. This is what I was looking for. I apparently did not understand what tags where. Mike Drob wrote > The SVN source is under tags, not branches. > > http://svn.apache.org/repos/asf/lucene/dev/tags/lucene_solr_4_10_3/ -- View this message in context: http://lucene.472066.n3.na

Re: Checkout the source Code to the Release Version of Solr?

2015-02-17 Thread O. Olson
Thank you Shawn. I have not updated my version in a while, so I prefer to do it to 4.10 first, rather than go directly to 5.0. I'd be working on it towards the end of this week. -- View this message in context: http://lucene.472066.n3.nabble.com/Checkout-the-source-Code-to-the-Release-Version-o

Solrcloud sizing

2015-02-17 Thread Dominique Bejean
One of our customers needs to index 15 billions document in a collection. As this volume is not usual for me, I need some advices about solrcloud sizing (how much servers, nodes, shards, replicas, memory, ...) Some inputs : - Collection size : 15 billions document - Collection update : 8 mi

Re: CSV entry as multiple documents

2015-02-17 Thread Anshum Gupta
Hi Henrique, Solr supports posting a csv with multiple rows. Have a look at the documentation in the ref. guide here: https://cwiki.apache.org/confluence/display/solr/Uploading+Data+with+Index+Handlers#UploadingDatawithIndexHandlers-CSVFormattedIndexUpdates On Tue, Feb 17, 2015 at 2:44 PM, Henr

Simple Sort Is Not Working In Solr 4.7?

2015-02-17 Thread Simon Cheng
Hi, I don't know whether it is my setup or any other reasons. But the fact is that a very simple sort is not working in my Solr 4.7 environment. The query is very simple : http://localhost:8983/solr/bibs/select?q=author:soros&fl=id,author,title&sort=title+asc&wt=xml&start=0&indent=true And the o

Re: CSV entry as multiple documents

2015-02-17 Thread Alexandre Rafalovitch
I think the question asked was a bit different. It was about having one row/document split into multiple with some fields replicated and some mapped. JSON (single-document format) has a split command which might be similar to what's being asked. CSV has a split command as well, but I think it is m

Re: Simple Sort Is Not Working In Solr 4.7?

2015-02-17 Thread Alexandre Rafalovitch
What's the field definition for your "title" field? Is it just string or are you doing some tokenizing? It should be a string or a single token cleaned up (e.g. lower-cased) using KeywordTokenizer. In the example schema, you will normally see the original field tokenized and the sort field separat

Re: Simple Sort Is Not Working In Solr 4.7?

2015-02-17 Thread Simon Cheng
Hi Alex, It's simply defined like this in the schema.xml : and it is cloned to the other multi-valued field o_title : Should I simply change the type to be "string" instead? Thanks again, Simon. On Wed, Feb 18, 2015 at 12:00 PM, Alexandre Rafalovitch wrote: > What's the field defi

Re: Simple Sort Is Not Working In Solr 4.7?

2015-02-17 Thread Alexandre Rafalovitch
If you are not searching against the "title" field directly, you can change it to string. If you do, create a separate one, specifically for sorting. You should be able to use docValues with that field even in Solr 4.7. Remember to re-index. Regards, Alex. Sign up for my Solr resources ne

Re: CSV entry as multiple documents

2015-02-17 Thread Henrique Oliveira
Yes, Alexandre is right about my question. To make it clear, a CSV that look like: t1,v1,v2,v2 2015-01-01T01:59:00Z,0.3,0.5,0.7 2015-01-01T02:00:00Z,0.4,0.5,0.8 would be the same of indexing t1,v 2015-01-01T01:59:00Z,0.3 2015-01-01T01:59:00Z,0.5 2015-01-01T01:59:00Z,0.7 2015-01-01T02:00:00Z,0.4 2

Re: CSV entry as multiple documents

2015-02-17 Thread Alexandre Rafalovitch
What's your business use case? You don't need the split command, as you already have those values in separate fields. You could copyField them to a single multiValued field, but you would still have one document per original CSV line. Why do you need multiple documents out of one big CSV entry? R

Re: Simple Sort Is Not Working In Solr 4.7?

2015-02-17 Thread Simon Cheng
Hi Alex, It's okay after I added in a new field "s_title" in the schema and re-indexed. But how can I ignore the articles ("A", "An", "The") in the sorting. As you can see from the below example : http://localhost:8983/solr/bibs/select?q=singapore&fl=id,title&sort=s_title+asc&wt=xml&sta

Re: Using TimestampUpdateProcessorFactory and updateRequestProcessorChain

2015-02-17 Thread Shu-Wai Chow
> if your goal is that *every* doc will get a last_modified, regarldess of > how it is indexed, then you don't need to set the "update.chain" default > on every requestHandler -- instead just mark your > updateRequestProcessorChain as the default... > > > > last_modifi

Re: Simple Sort Is Not Working In Solr 4.7?

2015-02-17 Thread Alexandre Rafalovitch
Like I mentioned before. You could use string type if you just want title it is. Or you can use a custom type to normalize the indexed value, as long as you end up with a single token. So, if you want to strip leading A/An/The, you can use KeywordTokenizer, combined with whatever post-processing y

Re: spellcheck.count v/s spellcheck.alternativeTermCount

2015-02-17 Thread Nitin Solanki
Thanks James, I tried the same thing spellcheck.count=10&spellcheck.alternativeTermCount=5. And I got 5 suggestions of both "life" and "hope" but not like this * The spellchecker will try to return you up to 10 suggestions for "hope", but only up to 5 suggestions for "life". *

Re: Solrcloud sizing

2015-02-17 Thread Erick Erickson
Well, it's really impossible to say, you have to prototype. Here's something explaining this a bit: https://lucidworks.com/blog/sizing-hardware-in-the-abstract-why-we-dont-have-a-definitive-answer/ This is a major undertaking. Your question is simply impossible to answer without prototyping as in

Confirm Solr index corruption

2015-02-17 Thread Thomas Mathew
Hi All, I use Solr 4.4.0 in a master-slave configuration. Last week, the master server ran out of disk (logs got too big too quick due to a bug in our system). Because of this, we weren't able to add new docs to an index. The first thing I did was to delete a few old log files to free up disk spac

Re: Solrcloud sizing

2015-02-17 Thread Dominique Bejean
Thank you Erick. This was also my own opinion. 2015-02-18 7:12 GMT+01:00 Erick Erickson : > Well, it's really impossible to say, you have to prototype. Here's > something > explaining this a bit: > > https://lucidworks.com/blog/sizing-hardware-in-the-abstract-why-we-dont-have-a-definitive-answer

Re: Boosting by calculated distance buckets

2015-02-17 Thread David Smiley
Raav, You may need to actually subscribe to the solr-user list. Nabble seems to not be working to well. p.s. I’m on vacation this week so I can’t be very responsive First of all... it's not clear you actually want to *boost* (since you seem to not care about the relevancy score), it seems you wa

Collations problem even term is available in documents.

2015-02-17 Thread Nitin Solanki
Hi, I am misspelling a query "hota hai" to "hota hain". Inside collations, "hota hai" is not coming, instead of that "hot main, home have. etc" are coming. I have 37 documents where "hota hai" is present. *URL: *localhost:8983/solr/wikingram/spell?q=gram_ci:"hota hain"&wt=json&indent=true&