[ANNOUNCE] Apache Gora 0.8 Release

2017-09-20 Thread lewis john mcgibbney
Hi Folks, The Apache Gora team are pleased to announce the immediate availability of Apache Gora 0.8. The Apache Gora open source framework provides an in-memory data model and persistence for big data. Gora supports persisting to - column stores, - key value stores, - document stores,

Seeing very low ingestion performance for a single non-cloud Solr core

2017-09-20 Thread saiks
Hi, Environment: - Solr is running in non-cloud mode on 6.4.2, Sun Java8, Linux 4.4.0-31-generic x86_64 - Ingesting into a single core - SoftCommit = 5 seconds, HardCommit = 10 seconds - System has 16 Cpus and 32 Gb of memory (Solr is given 20 Gb of JVM heap) - text = StandardTokenizer, id = solr.

Re: Replicates not recovering after rolling restart

2017-09-20 Thread Bill Oconnor
I have no clue where that number comes from it does not seem to be in the actual post to the leader as seen in my tcpdump. It is mystery. From: Walter Underwood Sent: Wednesday, September 20, 2017 7:00:53 PM To: solr-user@lucene.apache.org Subject: Re: Replicat

Re: Replicates not recovering after rolling restart

2017-09-20 Thread Walter Underwood
> On Sep 20, 2017, at 6:15 PM, Bill Oconnor wrote: > > I restart using the standard "sudo service solr start/stop" You might look into what that actually does. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog)

Re: Replicates not recovering after rolling restart

2017-09-20 Thread Bill Oconnor
Thanks everyone for the response. I do not think we changed anything other than the JVM memory size. I did leave out one piece of info - one of the host is a replicate in another shard. collection1 -> shard1 -> *h1, h2, h3, h4where star is leader collection2 -> shard1 -> *h5, h3 When

Re: Solr- Data search across multiple vores

2017-09-20 Thread Rick Leir
Harshal, You could send your Solr query to both cores but then you could have problems combining the results because the scores are not absolute: they just give a ranking in their own core. It might be ok, if you are searching on fields which are common to both cores. But I suspect that you wou

Re: Replicates not recovering after rolling restart

2017-09-20 Thread Walter Underwood
1578578283947098112 needs 61 bits. Is it being parsed into a 32 bit target? That doesn’t explain where it came from, of course. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Sep 20, 2017, at 3:35 PM, Erick Erickson wrote: > > The numberformatex

Re: Indexes don't synch when node with old data is returned to cluster

2017-09-20 Thread Erick Erickson
this _should_ be OK. I'd expect the new nodes to see that more than 100 docs have been indexed and do a full sync. However, you can insure this by removing the entire data directory from the nodes that are down (rm -rf data). They'll come back up, do a full sync and start answering queries only af

Re: Replicates not recovering after rolling restart

2017-09-20 Thread Erick Erickson
The numberformatexception is...odd. Clearly that's too big a number for an integer, did anything in the underlying schema change? Best, Erick On Wed, Sep 20, 2017 at 3:00 PM, Walter Underwood wrote: > Rolling restarts work fine for us. I often include installing new configs > with that. Here is

Re: SolrCloud Merge Indexes in Solr is running very slow

2017-09-20 Thread Erick Erickson
My very first question is whether you're _absolutely sure_ that the indexes you're merging have the same hash range. If not you're in for a world of hurt. You might try merging on local disks rather than in HDFS... Best, Erick On Wed, Sep 20, 2017 at 11:05 AM, Avinash Patil wrote: > > > I have

SolrCloud Merge Indexes in Solr is running very slow

2017-09-20 Thread Avinash Patil
I have a lot of data coming in SolrCloud and we create multiple collections dynamically after a collection threshold is reached.Currently to maintain fast search response speeds after 100M docs a new collection is triggered( 300G in HDFS ) . After SolrCloud ( CDH solr 4.10.3) reaches 150 - 200

Re: Replicates not recovering after rolling restart

2017-09-20 Thread Walter Underwood
Rolling restarts work fine for us. I often include installing new configs with that. Here is our script. Pass it any hostname in the cluster. I use the load balancer name. You’ll need to change the domain and the install directory of course. #!/bin/bash cluster=$1 hosts=`curl -s "http://${cl

Indexes don't synch when node with old data is returned to cluster

2017-09-20 Thread Joe Heasly
Hello, We have just moved from solr 4.6 master/slave to 6.4.2 SolrCloud. We have three collections, each with a single shard and a varying number of replicas, all kept by an ensemble of three zooKeepers (on their own hosts). As an ecommerce site, our capacity needs vary so we add and remove r

Replicates not recovering after rolling restart

2017-09-20 Thread Bill Oconnor
Hello, Background: We have been successfully using Solr for over 5 years and we recently made the decision to move into SolrCloud. For the most part that has been easy but we have repeated problems with our rolling restart were server remain functional but stay in Recovery until they stop tr

Re: question about an entry in the log file

2017-09-20 Thread kaveh minooie
Thanks Shalin, that was very helpfull. On 09/20/2017 01:02 PM, Shalin Shekhar Mangar wrote: That log shows that the searcher being opened is the "realtime" searcher as opposed to the "main" searcher. The realtime searcher is quite lightweight. It causes a flush of built index segments from the m

Re: question about an entry in the log file

2017-09-20 Thread Shalin Shekhar Mangar
That log shows that the searcher being opened is the "realtime" searcher as opposed to the "main" searcher. The realtime searcher is quite lightweight. It causes a flush of built index segments from the memory to the disk and opens a new searcher over them. No autowarming or fsync happens for realt

Rescoring from 0 - full

2017-09-20 Thread Dariusz Wojtas
Hi, When I use boosting fuctionality, it is always about adding or multiplicating the score calculated in the 'q' param. I mau use function queries inside 'q', but this may hit performance on calling multiple nested functions. I thaught that 'rerank' could help, but it is still about changing the o

Re: [ANNOUNCE] Apache Solr 7.0.0 released

2017-09-20 Thread Anshum Gupta
It’s strange but something seems to have stripped off all the formatting from the announce mail. Here’s a plain text version of the same and hope this is more readable. 20 September 2017, Apache Solr™ 7.0.0 available Solr is the popular, blazing fast, open source NoSQL search platform from the

[ANNOUNCE] Apache Solr 7.0.0 released

2017-09-20 Thread Anshum Gupta
20 September 2017, Apache Solr™ 7.0.0 available Solr is the popular, blazing fast, open source NoSQL search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted search, dynamic clustering, database integration, rich document (e

Re: question about an entry in the log file

2017-09-20 Thread kaveh minooie
Hi Erick Thanks for your response. I understand your point, but what I was asking was does solr reopen searchers after a commit call even if the commit was called with openSearcher=false since this is what seems to be happening based on these log entries? Also, it seems that if autocommit is

Strange Behavior When Extracting Features

2017-09-20 Thread Michael Alcorn
Hi all, I'm getting some extremely strange behavior when trying to extract features for a learning to rank model. The following query incorrectly says all features have zero values: http://gss-test-fusion.usersys.redhat.com:8983/solr/access/query?q=added couple of fiber channel&rq={!ltr model=red

Re: Solr replication

2017-09-20 Thread Satyaprashant Bezwada
Thanks Eric, fixed the issue. The IT team corrected the solrconfig.xml but forgot to execute the zkcli.sh script on solr node. After I executed the script its working now. On 9/20/17, 10:20 AM, "Erick Erickson" wrote: WARNING - External email; exercise caution. Your solrconf

Re: Solr replication

2017-09-20 Thread Erick Erickson
Your solrconfig.xml file is mal-formed. The smoking gun is: Exception during parsing file: solrconfig.xml Best, Erick On Tue, Sep 19, 2017 at 4:48 PM, Satyaprashant Bezwada wrote: > Need some inputs or help in resolving replication across solr nodes. We have > installed Solr 6.5 in cloud mode

Re: question about an entry in the log file

2017-09-20 Thread Erick Erickson
First, I would not recommend you call commit from the client. It's usually far better to let your autocommit settings in solrconfig.xml deal with it. When you need to search, you either need to configure with true or set to something other than -1. https://lucidworks.com/2013/08/23/understandin

Re: no search results for specific search in solr 6.6.0

2017-09-20 Thread Erick Erickson
Just go to the admin/analysis page and enter the terms in the "index" box (I usually uncheck the "verbose" checkbox). You will see exactly what element in your analysis chain is doing this. You'll see light gray two-letter codes on the size, e.g. "ST". Hover over it with your mouse, and you should

TermVectors and ExactStatsCache

2017-09-20 Thread Patrick Plante
Hi! I have a SolrCloud 6.6 collection with 3 shards setup where I need the TermVectors TF and DF values when querying. I have configured the ExactStatsCache in the solrConfig: When I query "detector works" in my collection, it returns different docfreq values based on the shard the document

cannot create core when SSL is enabled

2017-09-20 Thread Younge, Kent A - Norman, OK - Contractor
Hello, I am getting an error message when trying to create a core when ssl is enabled ERROR: Certificate for doesn't match any of the subject alternative names: However, if I turn off ssl I can create the core just fine. I have my certificates in the solr-6.5.1 directory should they be plac

Re: Not able to import timestamp data into Solr

2017-09-20 Thread Susheel Kumar
Checkout this article for working with date types and format etc. http://lucene.apache.org/solr/guide/6_6/working-with-dates.html On Wed, Sep 20, 2017 at 6:32 AM, shankhamajumdar < shankha.majum...@lexmark.com> wrote: > Hi, > > I have a field with timestamp data in Cassandra for example - 2017-09

Re: no search results for specific search in solr 6.6.0

2017-09-20 Thread Sascha Tuschinski
Hello Erik and Josh, Thanks for your hints and comments. I found out that the “text_fr” field type didn’t stored the “fraoo” as term. It stored “frao” only. Maybe because of French field type. This field had been automatically created. I’m new to Solr and this is maybe correct. I use “text_gen

Not able to import timestamp data into Solr

2017-09-20 Thread shankhamajumdar
Hi, I have a field with timestamp data in Cassandra for example - 2017-09-20 10:25:46.752000+. I am not able to import the data using Solr DataImportHandler, getting the bellow error in the Solr log. Caused by: java.lang.StringIndexOutOfBoundsException: String index out of range: -1 I am abl

Re: Meet CorruptIndexException while shutdown one node in Solr cloud

2017-09-20 Thread wg85907
Hi Erick, Thanks for your advice about having openSearcher set to true unnecessary for my case. For CorruptIndexException issue, I think Solr should handle this quite well too. Because I always shutdown tomcat gracefully. Recently I did a couple of tests about this issue. When kee

Re: Cannot load LTRQParserPlugin inot my core

2017-09-20 Thread alessandro.benedetti
Hi Billy, there is a README.TXT in the contrib/ltr directory. Reading that you find this useful link[1] . >From that useful link you see where the Jar of the plugin is located. Specifically : Taking a look to the contrib and dist structure it seems quite a standard approach to keep the readme i