Re: problem with tutorial

2014-12-16 Thread Tomoko Uchida
Hi, At first, you should check "solr.log" file in your Solr running directory (maybe JVMs stacktraces in it.) You might able to find clues from logs. If you cannot solve the problem, post to mailing list again with your exact command (options too), and stacktrace log. Thanks, Tomoko 2014-12-16 1

Re: Solr 4.10.2 "Found core" but I get "No cores available" in dashboard page

2014-12-16 Thread Ere Maijala
Do you have the jts libraries (e.g. jts-1.13.jar) in Solr's classpath (quoting from https://wiki.apache.org/solr/SolrAdaptersForLuceneSpatial4 "it needs to be in WEB-INF/lib in Solr's war file, basically")? --Ere 13.12.2014, 1.54, solr-user kirjoitti: I did find out the cause of my problems.

Re: Solr hangs on distributed updates

2014-12-16 Thread Peter Keegan
> A distributed update is streamed to all available replicas in parallel. Hmm, that's not what I'm seeing with 4.6.1, as I tail the logs on leader and replicas. Mark Miller comments on this last May: http://mail-archives.apache.org/mod_mbox/lucene-solr-user/201404.mbox/%3CetPan.534d8d6d.74b0dc51.

Re: Solr hangs on distributed updates

2014-12-16 Thread Shalin Shekhar Mangar
On Tue, Dec 16, 2014 at 11:34 AM, Peter Keegan wrote: > > > A distributed update is streamed to all available replicas in parallel. > > Hmm, that's not what I'm seeing with 4.6.1, as I tail the logs on leader > and replicas. Mark Miller comments on this last May: > > > http://mail-archives.apache.

Re: Solr hangs on distributed updates

2014-12-16 Thread Peter Keegan
> As of 4.10, commits/optimize etc are executed in parallel. Excellent - thanks. On Tue, Dec 16, 2014 at 6:51 AM, Shalin Shekhar Mangar < shalinman...@gmail.com> wrote: > > On Tue, Dec 16, 2014 at 11:34 AM, Peter Keegan > wrote: > > > > > A distributed update is streamed to all available replicas

Re: first time user

2014-12-16 Thread Jack Krupansky
My Solr Deep Dive e-book has full details and lots of examples for CSV indexing: http://www.lulu.com/us/en/shop/jack-krupansky/solr-4x-deep-dive-early-access-release-7/ebook/product-21203548.html -- Jack Krupansky -Original Message- From: Alexandre Rafalovitch Sent: Tuesday, December

Re: first time user

2014-12-16 Thread onyourmark
Thanks Jack. Can I ask, does it give a solution to my problem of the semicolons in the text and as delimiters? Bill On Tue, Dec 16, 2014 at 9:19 PM, Jack Krupansky-2 [via Lucene] < ml-node+s472066n4174529...@n3.nabble.com> wrote: > > My Solr Deep Dive e-book has full details and lots of examples

Re: first time user

2014-12-16 Thread Jack Krupansky
I believe the solution is simply that you need to become much more familiar with the capabilities of the tools that you are using. Asking for a specific solution isn't necessarily the best approach - it runs the risk of what we can "an XY problem", where you are asking us one thing, but the real

RE: WordBreakSolrSpellChecker Usage

2014-12-16 Thread Dyer, James
Matt, Seeing the response, my guess is you have "point" in your index, and that it has a higher frequency than "rockpoint". By default the spellchecker will never try to correct something that exists in your index. Adding "spellcheck.onlyMorePopular=true" might help, but only if the correctio

Re: All documents indexed into the same shard despite different prefix in id field

2014-12-16 Thread Will Miller
Thanks Chris... I changed the test and assigned a unique number to each document as the prefix and the documents did index across the two shards. I then increased the data set to include documents from all 6 expected shard keys and I do see them being indexed across both shards. I was just luck

Re: My new lemmatizer interfers with the highlighter

2014-12-16 Thread Erlend Garåsen
Thanks Ahmet, I think I have solved the problem, but I didn't replace the line you suggested. Instead I added the createToken method with AttributeSource.State as a parameter and overrode the reset method. I cannot reproduce the problem anymore. BTW, what's the purpose of AttributeSource.St

Re: Solr join not working in slorCloud env

2014-12-16 Thread Erick Erickson
Joins do are not supported when the various cores are not on the same node, see: https://wiki.apache.org/solr/DistributedSearch, the line: Doesn't support Join -- (see https://issues.apache.org/jira/browse/LUCENE-3759) Best, Erick On Mon, Dec 15, 2014 at 7:19 PM, ArnabK wrote: > For Eg: I have

Re: WordBreakSolrSpellChecker Usage

2014-12-16 Thread Matt Mongeau
James, Thanks so much, you were spot on and it's great to understand why I was getting the results I was. Solving this has been a breath of fresh air and I appreciate greatly the advice and assistance you have given! - Matt On Tue, Dec 16, 2014 at 9:24 AM, Dyer, James wrote: > > Matt, > > Seein

ApacheCon 2015 (April) UIMA Track

2014-12-16 Thread Marshall Schor
We are planning a UIMA Track at the next ApacheCon conference (being held in Austin Texas, April 13-17th, 2015). Topics / areas where talks are solicited include: - UIMA itself (including its subprojects), new features, directions, etc., which could be of interest to people using UIMA. - In

OutOfMemoryError

2014-12-16 Thread Trilok Prithvi
We are getting OOME pretty often (every hour or so). We are restarting nodes to keep up with it. Here is our setup: SolrCloud 4.10.2 (2 shards, 2 replicas) with 3 zookeepers. Each node has: 16GB RAM 2GB JVM (Xmx 2048, Xms 1024) ~100 Million documents (split among 2 shards - ~50M on each shard) So

Re: OutOfMemoryError

2014-12-16 Thread Shawn Heisey
On 12/16/2014 9:55 AM, Trilok Prithvi wrote: We are getting OOME pretty often (every hour or so). We are restarting nodes to keep up with it. Here is our setup: SolrCloud 4.10.2 (2 shards, 2 replicas) with 3 zookeepers. Each node has: 16GB RAM 2GB JVM (Xmx 2048, Xms 1024) ~100 Million documents

Identical query returning different aggregate results

2014-12-16 Thread David Smith
I have a prototype SolrCloud 4.10.2 setup with 13 collections (of 1 replica, 1 shard each) and a separate 1-node Zookeeper 3.4.6.   The very first app test case I wrote is failing intermittently in this environment, when I only have 4 documents ingested into the cloud. I dug in and found when I q

Re: OutOfMemoryError

2014-12-16 Thread Alexandre Rafalovitch
What's your queries look like? Especially FQs, facets, sort, etc. All of those things require caches of various sorts. Regards, Alex. Personal: http://www.outerthoughts.com/ and @arafalov Solr resources and newsletter: http://www.solr-start.com/ and @solrstart Solr popularizers community: https

ANNOUNCE: CFP and Travel Assistance now open for ApacheCon North America 2015

2014-12-16 Thread Chris Hostetter
(NOTE: cross posted to several lucene lists, if you have replies, please confine them to general@lucene) -- Forwarded message -- In case you've missed it: - ApacheCon North America returns to Austin, Texas, 13-17 April 2015 http://apachecon.com/ - Call for Papers open until

Re: OutOfMemoryError

2014-12-16 Thread Trilok Prithvi
Thanks Shawn. We will increase the JVM to 4GB and see how it performs. Alexandre, Our queries are simple (with strdist() function in almost all the queries). No facets, or sorts. But we do a lot of data loads. We index data a lot (several documents, ranging from 10 - 10 documents) and we uploa

splitshard the collection time out:900s

2014-12-16 Thread Trilok Prithvi
500 900395 splitshard the collection time out:900s org.apache.solr.common.SolrException: splitshard the collection time out:900s at org.apache.solr.handler.admin.CollectionsHandler.handleResponse(CollectionsHandler.java:368) at org.apache.solr.handler.admin.CollectionsHandler.handleSplitShardA

Re: splitshard the collection time out:900s

2014-12-16 Thread Trilok Prithvi
Sorry... I sent without explaining the situation. We did splitshard: solr/admin/collections?action=SPLITSHARD&collection=anotherCollection&shard=shard1 and we got the above error. Any idea? On Tue, Dec 16, 2014 at 10:41 AM, Trilok Prithvi wrote: > > > > 500 > 900395 > > > splitshard the c

Re: Identical query returning different aggregate results

2014-12-16 Thread Alexandre Rafalovitch
Facet counts include deleted documents until the segments merge. Could that be an issue? Regards, Alex On 16/12/2014 12:18 pm, "David Smith" wrote: > I have a prototype SolrCloud 4.10.2 setup with 13 collections (of 1 > replica, 1 shard each) and a separate 1-node Zookeeper 3.4.6. > The ver

Re: Identical query returning different aggregate results

2014-12-16 Thread David Smith
Alex, Good suggestion, but in this case, no.  This example is from a cleanroom type test environment where the collections have very recently been created, there are only 4 documents total across all collections, and no delete's have been issued. Kind regards, David On Tuesday, December

Re: splitshard the collection time out:900s

2014-12-16 Thread Joseph Obernberger
Shard splits can take a long time - the 900 seconds is just the REST timeout. The split is still taking place. On Tue, Dec 16, 2014 at 12:43 PM, Trilok Prithvi wrote: > > Sorry... I sent without explaining the situation. > > We did splitshard: > > solr/admin/collections?action=SPLITSHARD&collect

Re: splitshard the collection time out:900s

2014-12-16 Thread Anshum Gupta
As Joseph mentioned, the shard split is still running the background. In case it fails (it shouldn't), if you're running Solr 4.8 or newer, I would recommend using the ASYNC calls for long running collections API calls as they have an accompanying REQUESTSTATUS Api call that gets you a confirmatio

Re: Identical query returning different aggregate results

2014-12-16 Thread Erick Erickson
bq: Facet counts include deleted documents until the segments merge Whoa! Facet counts do _not_ require segment merging to be accurate. What merging does is remove the _term_ information associated with deleted documents, and removes their contribution to the TF/IDF scores. David: Hmmm, what happ

Re: SolrCloud Collection creation timeout

2014-12-16 Thread E S J
Hi Shanaka, Try out this, http://:/solr/admin/collections?action=CREATE&name=&replicationFactor=<# OF REPLICATION>&numShards=<# OF SHARDS>&collection.configName=&maxShardsPerNode=<# OF MAX SHARDS>&wt=json&indent=2 ex : http://solr1.internal:7070/solr/admin/collections?action=CREATE&name=c-ins&re

Re: Identical query returning different aggregate results

2014-12-16 Thread David Smith
Hi Erick, Thanks for your reply. My test environment only has one shard and one replica per collection.  So, I think there is no possibility of replicas getting out of sync.  Here is how I create each (month-based) collection: http://192.168.59.103:8983/solr/admin/collections?action=CREATE&name=2

Re: Identical query returning different aggregate results

2014-12-16 Thread Erick Erickson
Ah, OK. I didn't get that when I read your first e-mail... Hmmm, this is still a puzzle then. Tail the respective Solr logs, you _should_ be seeing the sub-query go to each of them and the sub-query _should_ carry along all of the faceting information. Or this might just be a flat bug... Best, Er

Re: OutOfMemoryError

2014-12-16 Thread Trilok Prithvi
Shawn, looks like the JVM bump did the trick. Thanks! On Tue, Dec 16, 2014 at 10:39 AM, Trilok Prithvi wrote: > > Thanks Shawn. We will increase the JVM to 4GB and see how it performs. > > Alexandre, > Our queries are simple (with strdist() function in almost all the > queries). No facets, or sor

splitshard the collection time out:900s

2014-12-16 Thread Randy Castro
Hello, I'm experiencing the exact same issue. Unfortunately I'm using Solr 4.7 so the async call is not available to me. The only thing I could find in the log is the following entry: solr.log.1:INFO - 2014-12-16 21:49:02.783; org.apache.solr.handler.admin.CollectionsHandler; Splitting shard

Re: Identical query returning different aggregate results

2014-12-16 Thread Chris Hostetter
sounds like this bug... https://issues.apache.org/jira/browse/SOLR-6154 ...in which case it has nothing to do with your use of multiple collections, it's just dependent on wether or not the first node to respond happens to have a doc in every "range bucket" .. any bucket missing (because of y

Solr Node Resource allocation.

2014-12-16 Thread Elan Palani
Hello.. Let’s assume I have a SolrCloud with 10 node cluster . most of the time I want to New collection created with 2 shards and 2 replicas on (4 different nodes) using CloudSolrServer we could randomly find the nodes to allocate. but Is there a way to pick and choose the set of nodes base

Re: Identical query returning different aggregate results

2014-12-16 Thread David Smith
Chris, Yes, your suggestion worked.  Changing the parameter in my query from  ...f.eventDate.facet.mincount=1... to ...f.eventDate.facet.mincount=0... worked around the problem. And I agree that SOLR-6154 describes what I observed almost exactly.  Once 5.0 is available, I'll test this again

Re: Solr Node Resource allocation.

2014-12-16 Thread Erick Erickson
Identifying the "lease used" cores... no tools integrated with Solr that I know of. But once you do figure out what machines to use, the collections API CREATE command has a createNodeSet which will put the new collection on the specified nodes. And the ADDREPLICA command also allows you to speci

Re: Identical query returning different aggregate results

2014-12-16 Thread Erick Erickson
Wow, advancing senility... _I'm_ actually the person that committed that fix... Siiihhh. On Tue, Dec 16, 2014 at 5:38 PM, David Smith wrote: > Chris, > > Yes, your suggestion worked. Changing the parameter in my query from > > ...f.eventDate.facet.mincount=1... > > > to > > ...f.eventDate.f

Partial match autosuggest (match a word occurring anywhere in a field)

2014-12-16 Thread bbarani
Hi, I am trying to figure out a way to implement partial match autosuggest but it doesn't work in some cases. When I search for iphone 5s, I am able to see the below results. title_new:Apple iPhone 5s - 16GB - Gold but when I search for iphone gold (in title_new field), I am not able to see th

Re: Partial match autosuggest (match a word occurring anywhere in a field)

2014-12-16 Thread Ahmet Arslan
Hi BBrani, Yes it is possible. Create another field, say edgytext_partial, use whitespace tokenises this time. And query on both edgytext and edgytext_partial. you can even apply different boosts. Ahmet On Wednesday, December 17, 2014 2:44 AM, bbarani wrote: Hi, I am trying to figure ou

questions about BlockJoinParentQParser

2014-12-16 Thread Michael Sokolov
I'm trying to use BJPQP and ran into a few little gotchas that I'd like to share with y'all in case you have any advice. First I ran into an NPE that probably should be handled better - maybe just an exception with a better message. The framework I'm working in makes it slightly annoying to u

Re: Join in SOLR

2014-12-16 Thread Rajesh
Thanks Mikhail. As per what you have mentioned can I get a list of sub entities with this new Zipper join. Because in existing DIH I'm getting a list for individual fields of the sub entities. And also, I've not found DIH 5 jar anywhere. Is it still in development. -- View this message in con

Re: Get groups which has the number of elements greater than...

2014-12-16 Thread andreic9203
Hello again lboutros, The faceting seems to bring only the pivot field with the associated counting. It's ok, but it's not what I want. Do you know a way to bring also the documents? I don't know, look at this example Faceting-Pivot(DecisionTree)Faceting