Re: Solr metrics in Codahale metrics and Graphite?

2013-04-22 Thread Dmitry Kan
Hello Walter, Have you had a chance to get something working with graphite, codahale and solr? Has anyone else tried these tools with Solr 3.x family? How much work is it to set things up? We have tried zabbix in the past. Even though it required lots of up front investment on configuration, it

Re: Test harness can not load existing index data in Solr 4.2

2013-04-22 Thread zhu kane
I think the problem should be EmbeddedSolrServer can't load existing index data. Any committer can help confirm whether it's a bug or not. Thank you. Kane On Mon, Apr 15, 2013 at 7:28 PM, zhu kane wrote: > I'm extending Solr's *AbstractSolrTestCase* for unit testing. > > I have existing 's

Re: Dynamically loading Elevation Info

2013-04-22 Thread Saroj C
Thanks Ravi and Eric. Will try these options. Thanks and Regards, Saroj Kumar Choudhury Experience certainty. IT Services Business Solutions Outsourcing

Re: Solr 4.2 Startup Detects Corrupt Log And is Really Slow to Start

2013-04-22 Thread Umesh Prasad
Sorry for late reply. I was trying to change our indexing pipeline and do explicit intermediate commits for each core. That turned out to be a bit more work that I have time for. So, I do want to explore hard commits. I tried :/solr//*update?commit=true* . But there is no impact on Txn Log si

Re: Too many close, count -1

2013-04-22 Thread Chris Hostetter
: Can you tell what operations cause this to happen? ie: what does your configuration look like? are you using any custom plugins? what types of features of solr do you use (faceting, grouping, highlighting, clustering, dih, etc...) ? -Hoss

Re: Too many close, count -1

2013-04-22 Thread Yonik Seeley
Can you tell what operations cause this to happen? I've added a comment to https://issues.apache.org/jira/browse/SOLR-4749 where we're looking at some related issues around CoreContainer, but perhaps it should get it's own issue. -Yonik http://lucidworks.com On Mon, Apr 22, 2013 at 7:57 PM, yri

RE: Bug? JSON output changes when switching to solr cloud

2013-04-22 Thread David Parks
Thanks Yonik! That was fast! We switched over to XML for the moment and will switch back to JSON when 4.3 comes out. Dave -Original Message- From: ysee...@gmail.com [mailto:ysee...@gmail.com] On Behalf Of Yonik Seeley Sent: Monday, April 22, 2013 8:18 PM To: solr-user@lucene.apache.org Su

SSLInitializationException on startup

2013-04-22 Thread Van Tassell, Kristian
I'm configuring a number of servers to support Solr 4.2 and have come across one that will not start. This is a pre-existing application server (running Tomcat) and I'm not quite sure what to look for. Has anyone seen this before and solved it? Thanks in advance! INFO: Creating new http client

Re: Support of field variants in solr

2013-04-22 Thread Alexandre Rafalovitch
To route different languages, you could use different request handlers and do different alias mapping. There are two alias mapping: On the way in for eDisMax: https://wiki.apache.org/solr/ExtendedDisMax#Field_aliasing_.2BAC8_renaming On the way out: https://wiki.apache.org/solr/CommonQueryParameter

RE: Export Index and Re-Index XML

2013-04-22 Thread Kalyan Kuram
Thank you all very much for your help.I do have field configured as stored and index,i did read the FAQ from wiki,I think SolrEntityProcessor is what i think needed.I am trying to index the data from Adobe CQ and its a push based indexing and pain to index data from a very large repository.I thi

Too many close, count -1

2013-04-22 Thread yriveiro
Hi, Reviewing the solr's log I found this message. The solr version is 4.2.1, running in a tomcat 7 4973652:SEVERE: Too many close [count:-1] on org.apache.solr.core.SolrCore@5795a627. Please report this exception to solr-user@lucene.apache.org 5003386:SEVERE: REFCOUNT ERROR: unreferenced org.ap

Re: Export Index and Re-Index XML

2013-04-22 Thread Jack Krupansky
Any fields which have stored values can be read and output, but indexed-only, non-stored fields cannot be read or exported. Even if they could be, their values are post-analysis, which means that there is a good chance that they cannot be run through term analysis again. It is always best to k

Re: Export Index and Re-Index XML

2013-04-22 Thread Shawn Heisey
On 4/22/2013 5:07 PM, Kalyan Kuram wrote: Hi All I am new to solr and i wanted to know if i can export the Index as XML and then re-index back into Solr, The reason i need to do this is i misconfigured fieldtype and to make it work i need to re-index the content The best option is to do the i

Re: Rogue query killed several replicas with OOM, after recovering - match all docs query problem

2013-04-22 Thread Timothy Potter
Hi Sudhakar, Unfortunately, we don't know the underlying cause and I lost the logs that could have helped diagnose further. FWIW, I think this is an extreme case as I've lost nodes before and haven't had any discrepancies after recovering. In my case, it was a perfect storm of high throughput inde

Re: Rogue query killed several replicas with OOM, after recovering - match all docs query problem

2013-04-22 Thread Sudhakar Maddineni
We had encountered similar issue few days back with 4.0- Beta version. We have 6 node - 3 shard cluster setup.And, one of our replica servers[tomcat] was not responding to any requests because it reached the max no of the threads[200 -default]. To temporarily fix the issue, we had to restart the se

Export Index and Re-Index XML

2013-04-22 Thread Kalyan Kuram
Hi AllI am new to solr and i wanted to know if i can export the Index as XML and then re-index back into Solr,The reason i need to do this is i misconfigured fieldtype and to make it work i need to re-index the content Kalyan

Re: ranking score by fields

2013-04-22 Thread Erick Erickson
You can sometimes use the highlighter component to do this, but it's a little tricky... But note your syntax isn't doing what you expect. (field1:apache solr) parses as field1:apache defaultfield:solr. You want field1:(apache solr) &debug=all is your friend for these kinds of things, especial

Re: Where to use replicationFactor and maxShardsPerNode at SolrCloud?

2013-04-22 Thread Erick Erickson
bq: However what will happen to that 10 nodes when I specify replication factor? I think they just sit around doing nothing. Best Erick On Mon, Apr 22, 2013 at 7:24 AM, Furkan KAMACI wrote: > Sorry but if I have 10 shards and a collection with replication factor of 1 > and if I start up 30 nod

Re: Error creating collection

2013-04-22 Thread Erick Erickson
What version of Sor? More context for the stack trace? You might want to review: http://wiki.apache.org/solr/UsingMailingLists Best Erick On Mon, Apr 22, 2013 at 5:33 AM, yriveiro wrote: > I get this exception when I try to create a new collection. someone have any > idea that what's going on?

Re: Soft Commit and Document Cache

2013-04-22 Thread Niran Fajemisin
Thanks Shawn and Mark! That was very helpful. -Niran > > From: Shawn Heisey >To: solr-user@lucene.apache.org >Sent: Monday, April 22, 2013 5:30 PM >Subject: Re: Soft Commit and Document Cache > > >On 4/22/2013 4:16 PM, Niran Fajemisin wrote: >> A quick (and ho

Re: Soft Commit and Document Cache

2013-04-22 Thread Shawn Heisey
On 4/22/2013 4:16 PM, Niran Fajemisin wrote: A quick (and hopefully simply) question: Does the document cache (or any of the other caches for that matter), get invalidated after a soft commit has been performed? All Solr caches are invalidated when you issue a commit with openSearcher set to

Re: Rogue query killed several replicas with OOM, after recovering - match all docs query problem

2013-04-22 Thread Mark Miller
No worries, thanks for the info. Let me know if you gain any more insight! I'd love to figure out what happened here and address it. And I'm especially interested in knowing if you lost any updates if you are able to determine that. - Mark On Apr 22, 2013, at 5:02 PM, Timothy Potter wrote: >

Re: Soft Commit and Document Cache

2013-04-22 Thread Mark Miller
Yup - all of the top level caches are. It's a trade off - don't NRT more than you need to. - Mark On Apr 22, 2013, at 6:16 PM, Niran Fajemisin wrote: > Hi all, > > A quick (and hopefully simply) question: Does the document cache (or any of > the other caches for that matter), get invalidated

Soft Commit and Document Cache

2013-04-22 Thread Niran Fajemisin
Hi all, A quick (and hopefully simply) question: Does the document cache (or any of the other caches for that matter), get invalidated after a soft commit has been performed? Thanks, Niran

RE: DirectSolrSpellChecker : vastly varying spellcheck QTime times.

2013-04-22 Thread Dyer, James
This doesn't make a lot of sense to me as in both cases the very first collation it tries is the one it is returning. So you're getting a very optimized spellcheck in both cases. But it does have to issue both queries 2 times: the first time, it tries the user's main query anding there are no

Solr dynamic fields scalability

2013-04-22 Thread jhuffaker
Hi All, I was curious how lucene/solr scale as the total number of non-stored fields grow. So, for example, if my average document has 50 fields on it, but the total number of fields in the system is upwards of 100k and I query on one of those fields: Will I see runtime that is proportional to th

RE: DirectSolrSpellChecker : vastly varying spellcheck QTime times.

2013-04-22 Thread SandeepM
Chocolat Factry 0 77 1 0 8 615 chocolate 6544 5 9 15 6 factory 23614 factor 5128

Re: Rogue query killed several replicas with OOM, after recovering - match all docs query problem

2013-04-22 Thread Timothy Potter
I ended up just nuking the index on the replica with less docs and restarting it - which triggered the snap pull from the leader. So now I'm in sync and have better processes in place to capture the information if it happens again, which given some of the queries my UI team develops, is highly like

Re: Rogue query killed several replicas with OOM, after recovering - match all docs query problem

2013-04-22 Thread Mark Miller
What do you know about the # of docs you *should*? Do you have that mean when taking the bad replica out of the equation? - Mark On Apr 22, 2013, at 4:33 PM, Mark Miller wrote: > Bummer on the log loss :( > > Good info though. Somehow that replica became active without actually > syncing? Th

Re: Rogue query killed several replicas with OOM, after recovering - match all docs query problem

2013-04-22 Thread Mark Miller
Bummer on the log loss :( Good info though. Somehow that replica became active without actually syncing? This is heavily tested (though not with OOM's I suppose), so I'm a little surprised, but it's hard to speculate how it happened without the logs. Specially, the logs from the node that is of

RE: DirectSolrSpellChecker : vastly varying spellcheck QTime times.

2013-04-22 Thread Dyer, James
On both queries, set "spellcheck.extendedResults=true" and also "spellcheck.collateExtendedResults=true", then post the full spelling response. Also, how long does each query take on average with spellcheck turned off? James Dyer Ingram Content Group (615) 213-4311 -Original Message-

Solr Cloud 4.2 - Distributed Requests failing with NPE

2013-04-22 Thread Sudhakar Maddineni
Hi, We recently upgraded our solr version from 4.1 to 4.2 and started seeing below exceptions when running distributed queries: Any idea what we are missing here - http:// /solr/core1/select?q=*%3A*&wt=json&indent=true&shards=/solr/core1 http:// /solr/core1/select?q=*%3A*&wt=json&indent=true&sha

RE: DirectSolrSpellChecker : vastly varying spellcheck QTime times.

2013-04-22 Thread SandeepM
James, Thanks. That was very helpful. That helped me understand count and alternativeTermCount a bit more. I also have the following case as pointed out earlier... My query: http://host/solr/select?q=&spellcheck.q=chocolat%20factry&spellcheck=true&df=spell&fl=&indent=on&wt=xml&rows=10&version=2

Support of field variants in solr

2013-04-22 Thread Timo Schmidt
Hi together, i am timo and work for a solr implementation company. During the last projects we came to know that we need to be able to generate different variants of a document.   Example 1 (Language):   To handle all documents in one solr core, we need a field variant for each language.   con

Re: Rogue query killed several replicas with OOM, after recovering - match all docs query problem

2013-04-22 Thread Timothy Potter
nm - can't read my own output - the leader had more docs than the replica ;-) On Mon, Apr 22, 2013 at 11:42 AM, Timothy Potter wrote: > Have a little more info about this ... the numDocs for *:* fluctuates > between two values (difference of 324 docs) depending on which nodes I > hit (distrib=tru

Re: Dynamically loading Elevation Info

2013-04-22 Thread Chris Hostetter
: In-Reply-To: <1366609851170-4057812.p...@n3.nabble.com> : References: <1366383543826-4057312.p...@n3.nabble.com> : : <1366609851170-4057812.p...@n3.nabble.com> : Subject: Dynamically loading Elevation Info https://people.apache.org/~hossman/#threadhijack Thread Hijacking on Mailing Lists Wh

Re: updating documents unintentionally adds extra values to certain fields

2013-04-22 Thread Chris Hostetter
: I am using solr 4.2, and have set up spatial search config as below : : http://wiki.apache.org/solr/SpatialSearch#Schema_Configuration : : But everything I make an update to a document, : http://wiki.apache.org/solr/UpdateJSON#Updating_a_Solr_Index_with_JSON : : more values of the *_coordinat

Re: Dynamically loading Elevation Info

2013-04-22 Thread Ravi Solr
If you place the elevate.xml in the data directory of your index it will be loaded every time a commit happens. Thanks Ravi Kiran Bhaskar On Mon, Apr 22, 2013 at 7:38 AM, Erick Erickson wrote: > I believe (but don't know for sure) that the QEV file is re-read on > core reload, which the same a

Re: Rogue query killed several replicas with OOM, after recovering - match all docs query problem

2013-04-22 Thread Timothy Potter
Have a little more info about this ... the numDocs for *:* fluctuates between two values (difference of 324 docs) depending on which nodes I hit (distrib=true) 589,674,416 589,674,092 Using distrib=false, I found 1 shard with a mis-match: shard15: { leader = 32,765,254 replica = 32,764,930 d

spellcheck: change in behavior and QTime

2013-04-22 Thread SandeepM
I am using the same setup (solrconfig.xml and schema.xml) as stated in my prior message: http://lucene.472066.n3.nabble.com/DirectSolrSpellChecker-vastly-varying-spellcheck-QTime-times-tt4057176.html#a4057389 I am using SOLR 4.2.1 . Just wanted to report something wierd that I am seeing and would l

Re: SolrCloud Leaders

2013-04-22 Thread Jack Krupansky
Leader election will result from nodes coming up and going down as well as changes in network connectivity and even simply responsiveness between the nodes. A "quorum" is always needed. There may be other reasons as well that I don't know about. The point was simply that it is not a "leader" v

Re: SolrCloud Leaders

2013-04-22 Thread Otis Gospodnetic
If the current leader dies, somebody's got to take over. Otis -- Solr & ElasticSearch Support http://sematext.com/ On Mon, Apr 22, 2013 at 9:41 AM, Furkan KAMACI wrote: > Hi Jack; > > You said: "An hour from now some other replica may be the leader" > > What is the criteria to change a leade

Re: SolrCloud Leaders

2013-04-22 Thread Furkan KAMACI
Hi Jack; You said: "An hour from now some other replica may be the leader" What is the criteria to change a leader of a shard? 2013/4/15 Jack Krupansky > All nodes are replicas in SolrCloud since there are no masters. It's a > fully distributed model. A leader is also a replica. A leader is si

Re: ComplexPhraseQParserPlugin not working with solr 4.2

2013-04-22 Thread Ahmet Arslan
Hi ilay, Can you try ComplexPhrase-4.2.1.zip, it supposed to work with 4.2. --- On Mon, 4/22/13, ilay raja wrote: > From: ilay raja > Subject: Re: ComplexPhraseQParserPlugin not working with solr 4.2 > To: solr-user@lucene.apache.org, solr-...@lucene.apache.org > Date: Monday, April 22, 2013

Re: fuzzy search issue with PatternTokenizer Factory

2013-04-22 Thread Jack Krupansky
Once again, fuzzy search is completely independent of your analyzer or pattern tokenizer. Please use the Solr Admin UI Analysis page to debug whether the terms are what you expect. And realize that fuzzy search has a maximum editing distance of 2 and that includes case changes. -- Jack Krupans

Re: Bug? JSON output changes when switching to solr cloud

2013-04-22 Thread Yonik Seeley
Thanks David, I've confirmed this is still a problem in trunk and opened https://issues.apache.org/jira/browse/SOLR-4746 -Yonik http://lucidworks.com On Sun, Apr 21, 2013 at 11:16 PM, David Parks wrote: > We just took an installation of 4.1 which was working fine and changed it to > run as sol

Re: Where to use replicationFactor and maxShardsPerNode at SolrCloud?

2013-04-22 Thread Jack Krupansky
"replicationFactor=2 will stop assigning nodes to that particular collection after there's a leader + 1 replica" They are both replicas, right? I mean, at any given moment one of the replicas will also have a role of "leader", but it's still a replica - in SolrCloud, that is, as opposed to ol

Re: is phrase search possible in solr

2013-04-22 Thread Jack Krupansky
"I want queries within double quotes to be ..." Just to be clear (as already stated), you do not get to set the semantics of quotes, which are set by the query parser and the analyzer for the field - if you want a different semantics, copy the data to another field and use that different seman

Re: Where to use replicationFactor and maxShardsPerNode at SolrCloud?

2013-04-22 Thread Jan Høydahl
2) Does this mean that if you have one physical server with one Solr instance, and you try to create a collection with numShards=2&maxShardsPerNode=2 then it will succeed, putting three shards on the same node? If you then add another node, you still need to move one shard over to the new no

Re: Dynamically loading Elevation Info

2013-04-22 Thread Erick Erickson
I believe (but don't know for sure) that the QEV file is re-read on core reload, which the same app that modifies the elevator.xml file could trigger with an http request, see: http://wiki.apache.org/solr/CoreAdmin#RELOAD At least that's what I would try first. Best Erick On Mon, Apr 22, 2013 a

Re: Where to use replicationFactor and maxShardsPerNode at SolrCloud?

2013-04-22 Thread Furkan KAMACI
Sorry but if I have 10 shards and a collection with replication factor of 1 and if I start up 30 nodes what happens to that last 10 nodes? I mean: 10 nodes as leader 10 nodes as replica if I don't specify replication factor there was going to be a round robin system that assigns other 10 machine

RE: Stats facet on int/tint fields

2013-04-22 Thread Michael Ryan
Sounds like this could be https://issues.apache.org/jira/browse/SOLR-2976. -Michael -Original Message- From: vinothkumar raman [mailto:vinothkr.k...@gmail.com] Sent: Monday, April 22, 2013 5:54 AM To: solr-user@lucene.apache.org; solr-...@lucene.apache.org Subject: Stats facet on int/tin

Re: is phrase search possible in solr

2013-04-22 Thread Erick Erickson
bq: wherein if I have a query in double quotes it simply ignores all the tokenizers and analyzers. Nope. In general you're quite right, you need to re-index whenever you change your schema... You could define the query part of your field to just use KeywordTokenizerFactory, but that would affect _

Re: Solr cloud and batched updates

2013-04-22 Thread Erick Erickson
Thanks Yonik! You see how behind the times I get On Sun, Apr 21, 2013 at 5:07 PM, Timothy Potter wrote: > That's awesome! Thanks Yonik. > > Tim > > On Sun, Apr 21, 2013 at 1:30 PM, Yonik Seeley wrote: >> On Sun, Apr 21, 2013 at 11:57 AM, Timothy Potter >> wrote: >>> There's no problem here

Re: Where to use replicationFactor and maxShardsPerNode at SolrCloud?

2013-04-22 Thread Erick Erickson
1) Imagine you have lots and lots and lots of different Solr indexes and a 50 node cluster. Further imagine that one of those indexes has 2 shards, and a leader + shard is adequate to handle the load. You need some way to limit the number of nodes your index gets distributed to, that's what replica

Stats facet on int/tint fields

2013-04-22 Thread vinothkumar raman
I have a schema like this I wanted to find the average price faceted on cat. So was using the stats facet to get the average on the fields like this http://solr-serv/solr/latest/select?q=*%3A*&wt=xml&indent=true&stats=true&rows=0&stats.field=price&stats.facet=cat Which throws an exception li

The overseer is stucks

2013-04-22 Thread yriveiro
Hi,My overseer has enqueued more than 1 task and apparently is stuck. Exists any way to force to do the enqueued tasks?A screenshot of the overseer queue here - Best regards -- View this message in context: http://lucene.472066.n3.nabble.com/The-overse

Severe errors in log

2013-04-22 Thread yriveiro
I have got this in my logs. What's that mean? ConcurrentLRUCache was not destroyed prior to finalize(),​ indicates a bug -- POSSIBLE RESOURCE LEAK!!! - Best regards -- View this message in context: http://lucene.472066.n3.nabble.com/Severe-errors-in-log-tp4057860.html Sent from the Solr -

Error creating collection

2013-04-22 Thread yriveiro
I get this exception when I try to create a new collection. someone have any idea that what's going on? org.apache.solr.common.SolrException: Error CREATEing SolrCore 'RPS_12': Could not get shard_id for core: RPS_12 coreNodeName:192.168.20.48:8983_solr_RPS_12 - Best regards -- View this me

Re: ComplexPhraseQParserPlugin not working with solr 4.2

2013-04-22 Thread ilay raja
I was able to solve the previous problem of not loading COmplexPhraseQParserPlugin. Still I am able to run this with defType=complexphrase: java.lang.NoSuchMethodError: org.apache.solr.search.QueryParsing.getQueryParserDefaultOperator(Lorg/apache/solr/schema/IndexSchema;Ljava/lang/String;)Lorg/apac

Re: Pros and cons of using RAID or different RAIDS?

2013-04-22 Thread Toke Eskildsen
On Mon, 2013-04-22 at 02:04 +0200, Shawn Heisey wrote: > Aside from cost, the main reason that I have not seriously investigated > SSD drives is because I have not come across a solution for any level of > RAID (even RAID1) with SSDs that exposes TRIM to the operating system. > Without reliable TRI

Re: Max http connections in CloudSolrServer

2013-04-22 Thread J Mohamed Zahoor
On 18-Apr-2013, at 9:43 PM, Shawn Heisey wrote: > Are you using the Jetty included with Solr, or a Jetty installed separately? I am using the Jetty that comes with Solr. > The Jetty included with Solr has a maxThreads value of 1 in its config. > The default would be closer to 200, an

RE: external values source

2013-04-22 Thread Maciej Liżewski
Hi Timothy, Thank you for your answer - it is really helpful. Just to clarify - when using ValueSource then flow is something like this: - user sends query - solr calls ValueSource to prepare values for every document (this part is cached in ExternalFileField implementation I guess) - solr runs

ranking score by fields

2013-04-22 Thread Каскевич Александр
Hi. I want to make subject but don't know exactly how can I do it. Example. I have index with field1, field2, field3. I make a query like: (field1:apache solr) OR (field2:apache solr) OR (field3:apache solr) And I want to know: is it found this doc by field1 or by field2 or by field3? I try to mak

Re: ComplexPhraseQParserPlugin not working with solr 4.2

2013-04-22 Thread Ahmet Arslan
Hi ilay, You cannot load this plugin via lib directives. e.g. You need to embbed this jar into solar.war file. (by unzip and zip) There should be a ReadMe file inside the latest attachment in Jira. -- On Sat, 4/20/13, ilay raja wrote: > From: ilay raja > Subject: ComplexPhraseQParserPlug

Re: fuzzy search issue with PatternTokenizer Factory

2013-04-22 Thread meghana
Jack, the regex will split tokens by anything expect alphabets , numbers, '&' , '-' and ns: (where n is number from 0 to , e.g 4323s: ) Lets say for example my text is like below. *this is nice* day & sun 53s: is risen. * Then pattern tokenizer should create tokens as *this is nice day