Solr 4.7.2 not recovering - "ClusterState says we are the leader, but locally we don't think so"

2015-03-19 Thread Guy Moshkowich
Hi, one morning my Solr server broke with this message below, it didn't recover on its own - had to restart it - Is that a 4.7.2 known issue? My topology is very simple: single Solr with a single shard replica, and an embedded ZK (-zkrun). Could it be related to a 4.8 fix: SOLR-5799: When regis

IP Address assgined to solr instance during the Cloud mode start

2015-03-19 Thread davidphilip cherian
Hi, When I started solr in cloud mode(interactive) and chose 2 nodes, it started and in the cloud-view screen it showed some different ip with url 169.254.5.207:7574, when clicked on that, it says page not found. When I modified url to localhost(http://localhost:7574/solr/#/~cloud) it worked(loade

Re: index duplicate records from data source into 1 document

2015-03-19 Thread Derek Poh
Hi Erick Am I right to saywe need todo the combine of duplicate records into 1 before feeding it to Solr to index? I am coming from Endecawhich support the combine of duplicate records into 1 recordduring indexing. Was wondering if Solr support this. -Derek On 3/18/2015 11:21 PM, Erick Eri

Start stop solr started in solr cloud mode

2015-03-19 Thread davidphilip cherian
Hi, I started solr in cloud mode (interactive set up). 3 nodes, 3 shards and 1 replica and a collection. I stopped it using ./solr stop -all. How do I get the same above cloud mode setup to start? "./solr -c start" started the new solr cloud instance all together where as I was looking for the pr

Re: IP Address assgined to solr instance during the Cloud mode start

2015-03-19 Thread davidphilip cherian
I think this is because of change in network ip address. I got it. Thanks. On Thu, Mar 19, 2015 at 1:32 PM, davidphilip cherian < davidphilipcher...@gmail.com> wrote: > Hi, > > When I started solr in cloud mode(interactive) and chose 2 nodes, it > started and in the cloud-view screen it showed s

Re: Whole RAM consumed while Indexing.

2015-03-19 Thread Nitin Solanki
Hi Alxeandre, Number of segment counts are different but document counts are same. With (soft commit - 300 and hardcommit - 6000) = No. of segment - 43 AND With (soft commit - 6 and hardcommit - 6) = No. of segment - 31 I dont' have any idea related to segment count

Documents cannot be searched immediately when indexed using REST API with Solr Cloud

2015-03-19 Thread Zheng Lin Edwin Yeo
Hi, I'm using Solr Cloud now, with 2 shards known as shard1 and shard2, and when I try to index rich-text documents using REST API or the default Documents module in Solr Admin UI, the documents that are indexed do not appear immediately when I do a search. It only appears after I restarted the So

Re: Whole RAM consumed while Indexing.

2015-03-19 Thread Nitin Solanki
Hi Erick.. I read your Article. Really nice... Inside that you said that for bulk indexing. Set soft commit = 10 mins and hard commit = 15sec. Is it also okay for my scenario? On Thu, Mar 19, 2015 at 1:53 AM, Erick Erickson wrote: > bq: As you said, do commits after 6 seconds >

Re: Documents cannot be searched immediately when indexed using REST API with Solr Cloud

2015-03-19 Thread Liu Bo
Hi Edvin Please review your commit/soft-commit configuration, "soft commits are about visibility, hard commits are about durability" by a wise man. :) If you are doing NRT index and searching, your probably need a short soft commit interval or commit explicitly in your request handler. Be a

Re: how to store _text field

2015-03-19 Thread Mirko Torrisi
Hi Erick, I'm sorry for this delay but I've just seen this reply. I'm using the last version of solr and the default setting is to use the new kind of indexing, it doesn't use schema.xml and for that I have no idea about how set "store" for this field. The content is grabbed because I've obtai

Connection pool shutdown error

2015-03-19 Thread phiroc
Hello, I am trying to use the 4.9.1 SOLR Core API and the 1.3.2.RELEASE version of the Spring Data SOLR API, to connect to a SOLR server, but to no avail. When I run Java application, I get the following errors: --- Exception in thread "main" org.springframework.data.s

Re: Connection pool shutdown error

2015-03-19 Thread Andrea Gazzarini
I bet the problem is how the SolrServer instance is used within Spring Repository. I think somewhere you should alternatively - explicitly close the client each time. - reuse the same instance (and finally close that) But being a Spring newbie I cannot give you further information. Best, Andre

Re: IP Address assgined to solr instance during the Cloud mode start

2015-03-19 Thread Shawn Heisey
On 3/19/2015 2:02 AM, davidphilip cherian wrote: > When I started solr in cloud mode(interactive) and chose 2 nodes, it > started and in the cloud-view screen it showed some different ip with url > 169.254.5.207:7574, when clicked on that, it says page not found. When I > modified url to localhost(

Re: How to configure Solr to use ZooKeeper ACLs in order to protect it's content

2015-03-19 Thread Dmitry Karanfilov
Looks like it is still broken. The fixed name of system property zkCredentialsProvider and zkACLProvider are only impacted on the zkcli.sh script (org.apache.solr.cloud.ZkCLI). So using command bellow, I'm able to *bootstrap *and *upconfig *to the Zookeeper with appropriate credentials and ACLs:

Re: Solr Deleted Docs Issue

2015-03-19 Thread Shawn Heisey
On 3/19/2015 12:24 AM, vicky desai wrote: > I fail to understand why this deleted docs are not removed from index on > merging. Is there a good documentation which explains how exactly is merging > done? > > What can I do to solve this problem other than optimization? Deleted docs *are* removed by

Re: index duplicate records from data source into 1 document

2015-03-19 Thread Shawn Heisey
On 3/19/2015 2:09 AM, Derek Poh wrote: > Am I right to saywe need todo the combine of duplicate records into 1 > before feeding it to Solr to index? > > I am coming from Endecawhich support the combine of duplicate records > into 1 recordduring indexing. Was wondering if Solr support this. If you

Re: IP Address assgined to solr instance during the Cloud mode start

2015-03-19 Thread davidphilip cherian
Hi Shawn, Thanks you for the detailed explanation. On Thu, Mar 19, 2015 at 7:31 PM, Shawn Heisey wrote: > On 3/19/2015 2:02 AM, davidphilip cherian wrote: > > When I started solr in cloud mode(interactive) and chose 2 nodes, it > > started and in the cloud-view screen it showed some different i

Re: CloudSolrServer : Could not find collection : gettingstarted

2015-03-19 Thread Adnan Yaqoob
Erick Does the Solr admin UI>>cloud view show the gettingstarted collection? The "graph" view might help. It _sounds_ like somehow you didn't actually create the collection. [Adnan]- Yes What steps did you follow to create the collection in SolrCloud? It's possible you have the wrong ZK root some

Re: Start stop solr started in solr cloud mode

2015-03-19 Thread Adnan Yaqoob
David starting 1st node bin\solr start -cloud -p 8983 -s C:\Java\solr-5.0.0\example\cloud\node1\solr starting 2nd node -- bin\solr -cloud -p 7574 -s C:\Java\solr-5.0.0\example\cloud\node2\solr -z localhost:9983 The third would be similar to

Re: Solr returns incorrect results after sorting

2015-03-19 Thread kumarraj
*if the number of documents in one group is more than one then you cannot ensure that this document reflects the main sort Is there a way the top record which is coming up in the group is considered for sorting? We require to show the record from 212(even though price is low) in both the cases o

Re: Solr returns incorrect results after sorting

2015-03-19 Thread jim ferenczi
Then you just have to remove the group.sort especially if your group limit is set to 1. Le 19 mars 2015 16:57, "kumarraj" a écrit : > *if the number of documents in one group is more than one then you cannot > ensure that this document reflects the main sort > > Is there a way the top record whic

Re: CloudSolrServer : Could not find collection : gettingstarted

2015-03-19 Thread Chris Hostetter
: Does the Solr admin UI>>cloud view show the gettingstarted collection? : The "graph" view might help. It _sounds_ like somehow you didn't : actually create the collection. : [Adnan]- Yes if you can see the collection in the admin ui, can you please use the "Dump" menu option in the "Cloud" sec

Re: data import

2015-03-19 Thread abhishek tiwari
Hi , - architecture : master (1) - slave(3) solrconfig: 500 15000 false schema : < field name="selling_price" type="tfloat" indexed="true" stored="true" /> < field name="third_price" type="tfloat" indexed="true" stored="true" /> < field name="discount_percentage" type

Re: Have anyone used Automatic Phrase Tokenization (AutoPhrasingTokenFilterFactory) ?

2015-03-19 Thread James Strassburg
Sorry, I've been a bit unfocused from this list for a bit. When I was working with the APTF code I rewrote a big chunk of it and didn't include the inclusion of the original tokens as I didn't need it at the time. That feature could easily be added back in. I will see if I can find a bit of time fo

Re: Whole RAM consumed while Indexing.

2015-03-19 Thread Erick Erickson
That or even hard commit to 60 seconds. It's strictly a matter of how often you want to close old segments and open new ones. On Thu, Mar 19, 2015 at 3:12 AM, Nitin Solanki wrote: > Hi Erick.. > I read your Article. Really nice... > Inside that you said that for bulk indexing. Set s

Re: Documents cannot be searched immediately when indexed using REST API with Solr Cloud

2015-03-19 Thread Erick Erickson
The post jar issues a hard commit (openSearcher=true) as part of the operation. As Liu says, you are probably not committing the changes after ingestion. You can issue this from a browser: .solr/collection/update?commit=true to force a commit manually. Best, Erick On Thu, Mar 19, 2015 at 3:5

Re: how to store _text field

2015-03-19 Thread Erick Erickson
Hmm, not all that sure. That's one thing about schemaless indexing, it has to guess. It does the best it can, but it's quite possible that it guesses wrong. If this is a "mananged schema", you can use the REST API commands to make whatever field you want. Or you can start over with a concrete sche

Re: index duplicate records from data source into 1 document

2015-03-19 Thread Erick Erickson
bq: Am I right to saywe need todo the combine of duplicate records into 1 before feeding it to Solr to index? That's what I'd do. As Shawn says, if you simply fire them both at Solr the more recent one will replace the older one. Best, Erick On Thu, Mar 19, 2015 at 7:44 AM, Shawn Heisey wrote:

Re: data import

2015-03-19 Thread Shawn Heisey
On 3/19/2015 11:47 AM, abhishek tiwari wrote: > 500 You're doing soft commits as often as twice a second. You have configured 500 milliseconds here. This might have something to do with your slow indexing speed. A soft commit is less expensive than a full hard commit, but soft commits are *NO

Spatial Search killing Solr process

2015-03-19 Thread Henrique O. Santos
Hello all, I have a Solr 4.10.3 collection with ~55 million documents (index size about 6GB) with a LatLonType field and a dynamic field for storing the coordinates, like stated here https://wiki.apache.org/solr/SpatialSearch#Schema_Configuration

Re: CloudSolrServer : Could not find collection : gettingstarted

2015-03-19 Thread Chris Hostetter
: Chris, : Please find attached Dump nothing jumps out at me as looking odd, but i'm not the expert on this stuff either -- hopefully someone else can take a look. can you provide us with some more detials on what exactly you've done? you said ... : > : What steps did you follow to create th

Re: Spatial Search killing Solr process

2015-03-19 Thread david.w.smi...@gmail.com
Hi Henrique, Please see the Solr reference guide instead of the “community wiki” you referenced: https://cwiki.apache.org/confluence/display/solr/Spatial+Search (you can download one for 4.10; the online link is always for the latest). For spatial filtering, *especially* at-scale, you really sho

ApacheCon NA 2015 in Austin, Texas

2015-03-19 Thread Uwe Schindler
Dear Apache Lucene/Solr enthusiast, In just a few weeks, we'll be holding ApacheCon in Austin, Texas, and we'd love to have you in attendance. You can save $300 on admission by registering NOW, since the early bird price ends on the 21st. Register at http://s.apache.org/acna2015-reg ApacheCon

Re: Spatial Search killing Solr process

2015-03-19 Thread Henrique O. Santos
Thanks, David. I’m looking at it now. > On Mar 19, 2015, at 4:51 PM, david.w.smi...@gmail.com wrote: > > Hi Henrique, > > Please see the Solr reference guide instead of the “community wiki” you > referenced: > https://cwiki.apache.org/confluence/display/solr/Spatial+Search (you can > download o

Re: Facet pivot sorting while combining Stats Component With Pivots in Solr 5

2015-03-19 Thread Yonik Seeley
On Fri, Mar 13, 2015 at 1:43 PM, Dominique Bejean wrote: > Thank you for the response > > This is something Heliosearch can do. Ionic Seeley, created a JIRA ticket > to back port this feature to Solr 5. Oh, I'm charged now, am I? ;-) I'ts been committed, and will be in Solr 5.1 Here's an examp

Re: CloudSolrServer : Could not find collection : gettingstarted

2015-03-19 Thread Timothy Potter
Are you using a SolrJ client from 4.x to connect to a Solr 5 cluster? On Wed, Mar 18, 2015 at 1:32 PM, Adnan Yaqoob wrote: > I'm getting following exception while trying to upload document on > SolrCloud using CloudSolrServer. > > Exception in thread "main" org.apache.solr.common.SolrException:

Solr hangs / LRU operations are heavy on cpu

2015-03-19 Thread Sergey Shvets
Hi, we have quite a problem with Solr. We are running it in a config 6x3, and suddenly solr started to hang, taking all the available cpu on the nodes. In the threads dump noticed things like this can eat lot of CPU time - org.apache.solr.search.LRUCache.put​(LRUCache.java:116) - org.a

Re: CloudSolrServer : Could not find collection : gettingstarted

2015-03-19 Thread Adnan Yaqoob
Yes. Just before your email I was able to figure out. My project was set to user solrj 4.10.3 everything was working fine except cloud so I didn't noticed. After I switched to Solrj 5 it's working now Thanks everyone for supporting

Re: Documents cannot be searched immediately when indexed using REST API with Solr Cloud

2015-03-19 Thread Zheng Lin Edwin Yeo
Thank you for the information. Yes, the program is working correctly now and I can search for the documents immediately after issuing commit=true. Regards, Edwin On 20 March 2015 at 04:07, Erick Erickson wrote: > The post jar issues a hard commit (openSearcher=true) as part of the > operation

Re: Unable to index rich-text documents in Solr Cloud

2015-03-19 Thread Zheng Lin Edwin Yeo
Hi Shawn, Yes, I'm using the /update/extract handler. I'm not sure about the shards.qt parameter too. Regards, Edwin On 19 March 2015 at 13:18, Shawn Heisey wrote: > On 3/18/2015 1:22 AM, Zheng Lin Edwin Yeo wrote: > > I'm having some issues with indexing rich-text documents from the Solr > >

Re: Solr hangs / LRU operations are heavy on cpu

2015-03-19 Thread Umesh Prasad
It might be because LRUCache by default will try to evict its entries on each call to put and putAll. LRUCache is built on top of java's LinkedHashMap. Check the javadoc of removeEldestEntry

Re: data import

2015-03-19 Thread Midas A
Hi Shawn , Thanks for replying .. I need clarity on following points a) Making store false in schema for few fields will improve indexing time ? b) Does soft commit and hard commit configuration depends on hard ware ? c) Should i do merge factor , Rambuffersize configuration ? and how should i dec

Re: Whole RAM consumed while Indexing.

2015-03-19 Thread Nitin Solanki
On Fri, Mar 20, 2015 at 1:35 AM, Erick Erickson wrote: > That or even hard commit to 60 seconds. It's strictly a matter of how often > you want to close old segments and open new ones. > > On Thu, Mar 19, 2015 at 3:12 AM, Nitin Solanki > wrote: > > Hi Erick.. > > I read your Articl

Re: index duplicate records from data source into 1 document

2015-03-19 Thread Derek Poh
Oh that is how Solr works... On 3/19/2015 10:44 PM, Shawn Heisey wrote: On 3/19/2015 2:09 AM, Derek Poh wrote: Am I right to saywe need todo the combine of duplicate records into 1 before feeding it to Solr to index? I am coming from Endecawhich support the combine of duplicate records into 1

Re: Whole RAM consumed while Indexing.

2015-03-19 Thread Nitin Solanki
Hi Erick, I read mergeFactor Policy for indexing. By default, mergerFactor is 10. As said in document, High value merge factor (e.g., 25): - Pro: Generally improves indexing speed - Con: Less frequent merges, resulting in a collection with more index files which may slow searc