Re: SolrJ dependencies

2012-06-13 Thread Thijs
Done see: https://issues.apache.org/jira/browse/SOLR-3541 On 12-6-2012 18:39, Sami Siren wrote: On Tue, Jun 12, 2012 at 4:22 PM, Thijs wrote: Hi I just checked out and build solr&lucene from branches/lucene_4x I wanted to upgrade my custom client to this new version (using solrj). So I copied

Re: Solr PHP highload search

2012-06-13 Thread Erick Erickson
How much memory are you giving the JVM? Have you put a performance monitor on the running process to see what resources have been exhausted (i.e. are you I/O bound? CPU bound?) Best Erick On Tue, Jun 12, 2012 at 3:40 AM, Alexandr Bocharov wrote: > Hi, all. > > I need advice for configuring Solr

Re: Solr PHP highload search

2012-06-13 Thread Alexandr Bocharov
Thank you for help :) I'm giving 2048M the JVM for each node. CPU load is jumping 70-90%. Memory usage is increasing to max during testing (probably cache is filling). I/O I didn't monitor. I'd like to see answers on my other questions. 2012/6/13 Erick Erickson > How much memory are you giving

Re: Exception when optimizing index

2012-06-13 Thread Robert Muir
On Thu, Jun 7, 2012 at 5:50 AM, Rok Rejc wrote: >   - java.runtime.nameOpenJDK Runtime Environment >   - java.runtime.version1.6.0_22-b22 ... > > As far as I see from the JIRA issue I have the patch attached (as mentioned > I have a trunk version from May 12). Any ideas? > its not guaranteed that

Re: Solr PHP highload search

2012-06-13 Thread Erick Erickson
Consider just looking at it with jconsole (should be in your Java release) to get a sense of the memory usage/collection. How much physical memory do you have overall? Because this is not what I'd expect. Your CPU load is actually reasonably high, so it doesn't look like you're swapping. By and

Re: Sharding in SolrCloud

2012-06-13 Thread Lenzner
Mark Miller schrieb am 12.06.2012 19:19:01: > > > On Jun 12, 2012, at 3:39 AM, lenz...@gfi.ihk.de wrote: > > > Hello, > > > > we tested SolrCloud in a setup with one collection, two shards and one > > replica per shard and it works quite fine with some example data. > > Now, we plan to set

Re: Different sort for each facet

2012-06-13 Thread Christopher Gross
Hmm, it seems that if I leave off the initial "facet.sort=index" then it will sort each by index by default, and I can use the "f.people.facet.sort=count" as expected. I thought I tried that yesterday, but I suppose it slipped my mind in my sleep-deprived state. Thanks Jack! -- Chris On Tue, J

LockObtainFailedException after trying to create cores on second SolrCloud instance

2012-06-13 Thread Daniel Brügge
Hi, am struggling around with creating multiple collections on a 4 instances SolrCloud setup: I have 4 virtual OpenVZ instances, where I have installed SolrCloud on each and on one is also a standalone Zookeeper running. Loading the Solr configuration into ZK works fine. Then I startup the 4 in

Re: LockObtainFailedException after trying to create cores on second SolrCloud instance

2012-06-13 Thread Daniel Brügge
BTW: i am running the solr instances using -Xms512M -Xmx1024M so not so little memory. Daniel On Wed, Jun 13, 2012 at 4:28 PM, Daniel Brügge < daniel.brue...@googlemail.com> wrote: > Hi, > > am struggling around with creating multiple collections on a 4 instances > SolrCloud > setup: > > I have

Re: Different sort for each facet

2012-06-13 Thread Jack Krupansky
I'm glad that you have something working, but you shouldn't have to remove that facet.sort=index. I tried the following and it works with the Solr 3.6 example after I indexed with exampledocs/books.json: http://localhost:8983/solr/select/?q=*:*&facet=true&facet.field=name&facet.field=genre_s&

Re: [DIH] Multiple repeat XPath stmts

2012-06-13 Thread alesp
TNX. A lifesaver... -- View this message in context: http://lucene.472066.n3.nabble.com/DIH-Multiple-repeat-XPath-stmts-tp499770p3989439.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Getting maximum / minimum field value - slow query

2012-06-13 Thread rafal.gwizd...@gmail.com
What is more, I tried to get the maximum value using stats query This time the response time was about 30 seconds and server ate 1.5 Gb of memory when calculating the response. But there were no statistics in response: 0 27578 *.* true Id 0 What's wrong here? -- View this message in

Re: LockObtainFailedException after trying to create cores on second SolrCloud instance

2012-06-13 Thread Mark Miller
Thats an interesting data dir location: NativeFSLock@/home/myuser/ data/index/write.lock Where are the other data dirs located? Are you sharing one drive or something? It looks like something already has a writer lock - are you sure another solr instance is not running somehow? On Wed, Jun 13, 20

Re: LockObtainFailedException after trying to create cores on second SolrCloud instance

2012-06-13 Thread Casey Callendrello
What command are you using to create the cores? I had this sort of problem, and it was because I'd accidentally created two cores with the same instanceDir within the same SOLR process. Make sure you don't have that kind of collision. The easiest way is to specify an explicit instanceDir and dataD

Re: Getting maximum / minimum field value - slow query

2012-06-13 Thread Jack Krupansky
Try the query without the sort to get the number of rows, then do a second query using a "start" equal to the number of rows. That should get you the last row/document. -- Jack Krupansky -Original Message- From: rafal.gwizd...@gmail.com Sent: Wednesday, June 13, 2012 3:07 PM To: solr

Solr1.4 and threads ....

2012-06-13 Thread Benson Margulies
We've got a tokenizer which is quite explicitly coded on the assumption that it will only be called from one thread at a time. After all, what would it mean for two threads to make interleaved calls to the hasNext() function()? Yet, a customer of ours with a gigantic instance of Solr 1.4 reports i

Re: Sharding in SolrCloud

2012-06-13 Thread Erick Erickson
Hmmm, are you sure SolrCloud fits your needs? You say that you think everything will fit on one shard and are worried about bulk updates. In that case I should think regular Solr master/slave (rather than cloud) might be a better fit. Using Cloud and all that goes with it for a single shard is cert

Re: FilterCache - maximum size of document set

2012-06-13 Thread Erick Erickson
Hmmm, I think you may be looking at the wrong thing here. Generally, a filterCache entry will be maxDocs/8 (plus some overhead), so in your case they really shouldn't be all that large, on the order of 3M/filter. That shouldn't vary based on the number of docs that match the fq, it's just a bitset.

Re: Solr1.4 and threads ....

2012-06-13 Thread Robert Muir
On Wed, Jun 13, 2012 at 4:38 PM, Benson Margulies wrote: > > Does this suggest anything to anyone? Other than that we've > misanalyzed the logic in the tokenizer and there's a way to make it > burp on one thread? it might suggest the different tokenstream instances refer to some shared object tha

Re: Getting maximum / minimum field value - slow query

2012-06-13 Thread Erik Hatcher
A large start value is probably worse performing than the sort (see SOLR-1726). Once the sort field is cached, it'll be quick from then on. Put in a warming query in solrconfig for new and/or firstSearcher that does this sort and the cache will be built in advance of queries at least.

Re: FilterCache - maximum size of document set

2012-06-13 Thread Pawel Rog
Thanks for your response Yes, maybe you are right. I thought that filters can be larger than 3M. All kinds of filters uses BitSet? Moreover maxSize of filterCache is set to 16000 in my case. There are evictions during day traffic but not during night traffic. Version of Solr which I use is 3.5 I

Regarding number of documents

2012-06-13 Thread Swetha Shenoy
Hi, I have a data config file that contains the data import query. If I just run the import query against MySQL, I get a certain number of results. I assume that if I run the full-import, I should get the same number of documents added to the index, but I see that it's not the case and the number

Re: Regarding number of documents

2012-06-13 Thread Swetha Shenoy
Note: I don't see any errors in the logs when I run the index. On Wed, Jun 13, 2012 at 5:48 PM, Swetha Shenoy wrote: > Hi, > > I have a data config file that contains the data import query. If I just > run the import query against MySQL, I get a certain number of results. I > assume that if I ru

Re: Regarding number of documents

2012-06-13 Thread Afroz Ahmad
Could it be that you are getting records that are not unique. If so then SOLR would just overwrite the non unique documents. Thanks Afroz On Wed, Jun 13, 2012 at 4:50 PM, Swetha Shenoy wrote: > Note: I don't see any errors in the logs when I run the index. > > On Wed, Jun 13, 2012 at 5:48 PM, S

Re: Regarding number of documents

2012-06-13 Thread Swetha Shenoy
That makes sense. But I added a new entry that showed up in the MySQL results and not in the Solr search results. The count of documents also did not increase after the addition. How can a new entry show up in MySQL results and not as a new document? On Wed, Jun 13, 2012 at 6:26 PM, Afroz Ahmad w

Re: Regarding number of documents

2012-06-13 Thread Jack Krupansky
Check the ID for that latest record and try to query it in Solr. One way you can get multiple records in an RDBMS query is via join. In that case, each of the records could have the same value in the column(s) that you are using for your unique key field in Solr. -- Jack Krupansky -Origi

Re: Regarding number of documents

2012-06-13 Thread Gora Mohanty
On 14 June 2012 04:51, Swetha Shenoy wrote: > That makes sense. But I added a new entry that showed up in the MySQL > results and not in the Solr search results. The count of documents also did > not increase after the addition. How can a new entry show up in MySQL > results and not as a new docum

Re: Unexpected DIH behavior for onError attribute

2012-06-13 Thread Gora Mohanty
On 13 June 2012 10:45, Pranav Prakash wrote: > My DIH Config file goes as follows. We have two db hosts, one of which > contains blocks of content and the other contain transcripts of those > content blocks. The makeDynamicTranscript function is used to create row > names like transcript_en, trans

Re: LockObtainFailedException after trying to create cores on second SolrCloud instance

2012-06-13 Thread Daniel Brügge
Will check later to use different data dirs for the core on each instance. But because each Solr sits in it's own openvz instance (virtual server respectively) they should be totally separated. At least from my point of understanding virtualization. Will check and get back here... Thanks. On Wed