Re: Commit takes very long with NoSuchFileException

2017-08-03 Thread Nawab Zada Asad Iqbal
I realized that after doing the commit manually, two shards had a lot fewer files than the 3rd shard (which failed on commit). However, with the passage of time, the number of files continued to decrease for the shard with more files. FWIW, each shard has exactly same number of document and similar

Metrics in 6.5.1 names and stuff

2017-08-03 Thread Walter Underwood
I’m trying to get what I want out of the metrics reporting in Solr. I want the counts and percentiles for each request handler in each collection. If I have “/srp”, “/suggest”, and “/seo”, I want three sets of metrics. I’m getting a lot of weird stuff. For counts for /srp in an eight node cluste

Re: Error when trying to replace node with Solr 6.6.0

2017-08-03 Thread Björn Häuser
Okay, after digging a little bit through the code, I think the problem is in this line: https://github.com/apache/lucene-solr/blob/master/solr/core/src/java/org/apache/solr/cloud/OverseerCollectionMessageHandler.java?utf8=%E2%9C%93#L153

Commit takes very long with NoSuchFileException

2017-08-03 Thread Nawab Zada Asad Iqbal
Hi, I have a host with 3 solr processes running, each with one shard only; there are no replicas. I am reindexing some 100 GB of data per solr (or per shard since each solr has one shard). After about 3 hours, I manually committed once. I was able to get through 40 GB in each shard, and the commi

Error when trying to replace node with Solr 6.6.0

2017-08-03 Thread Björn Häuser
Hey Folks, we today hit the same error three times, a REPLACENODE call was not successful. Here is our scenario: 3 Node Solrcloud cluster running in Kubernetes on top of AWS. Today we wanted to rotate the underlying storage (increased from 50gb to 300gb). After we rotated one node we tried

Re: mixed index with commongrams

2017-08-03 Thread David Hastings
Haven't really looked much into that, here is a snipped form todays gc log, if you wouldn't mind shedding any details on it: 2017-08-03T11:46:16.265-0400: 3200938.383: [GC (Allocation Failure) 2017-08-03T11:46:16.265-0400: 3200938.383: [ParNew Desired survivor size 1966060336 bytes, new threshold

Re: mixed index with commongrams

2017-08-03 Thread Walter Underwood
How long are your GC pauses? Those affect all queries, so they make the 99th percentile slow with queries that should be fast. The G1 collector has helped our 99th percentile. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Aug 3, 2017, at 8:48 AM,

Re: mixed index with commongrams

2017-08-03 Thread David Hastings
Thanks, thats what i kind of expected. still debating whether the space increase is worth it, right now Im at .7% of searches taking longer than 10 seconds, and 6% taking longer than 1, so when i see things like this in the morning it bugs me a bit: 2017-08-02 11:50:48 : 58979/1000 secs : ("Rules

Re: mixed index with commongrams

2017-08-03 Thread Erick Erickson
bq: will that search still return results form the earlier documents as well as the new ones In a word, "no". By definition the analysis chain applied at index time puts tokens in the index and that's all you have to search against for the doc unless and until you re-index the document. You reall

mixed index with commongrams

2017-08-03 Thread David Hastings
Hey all, I have yet to run an experiment to test this but was wondering if anyone knows the answer ahead of time. If i have an index built with documents before implementing the commongrams filter, then enable it, and start adding documents that have the filter/tokenizer applied, will searches that

Re: High CPU utilization on Upgrading to Solr Version 6.3

2017-08-03 Thread Erick Erickson
Atita: Thanks for that update. I've opened SOLR-11188 to track this, please add any details to that JIRA that you can, in particular what bit of code you've identified as the problem. Also, do you have any document you can share that would cause this? I'm wondering if it's sensitive to the partic

Re: Ambiguous response on TrieDateField

2017-08-03 Thread Erick Erickson
Solr only deals with UTC times. My bet: you're seeing the _stored_ value of the time which is PDT. How are you indexing this field? You have to have something hanging around that converts the input to UTC... Best, Erick On Thu, Aug 3, 2017 at 2:48 PM, Imran Rajjad wrote: > Hello, > > I have obse

Re: plus sign in request / looking for + in title

2017-08-03 Thread Erick Erickson
Take a look at your analysis chain. My bet is that the + is being stripped by some part of the chain. See the admin UI>>analysis page. Best, Erick On Aug 3, 2017 06:47, "d.ku...@technisat.de" wrote: > Hey, > > in our title we are having a word named "hd+". > Now I want to do a query right on th

plus sign in request / looking for + in title

2017-08-03 Thread d.ku...@technisat.de
Hey, in our title we are having a word named "hd+". Now I want to do a query right on these word, but if I do so, solr is just looking for "hd" and ignoring the plus sign. But I relay need to search for the whole string Of course I did a url encode for the plus sign: q=title:hd%2B Can please a

Re: Get handler failure

2017-08-03 Thread Chris Ulicny
By 1 replica, I mean a single copy of the shard with no redundancy. We haven't encountered any problems with the testing environment solr instances, that weren't expected. At least that I'm aware of. I do have the logs saved from the time frame the issue occurred in if those would be useful. We'r

Re: Sentence level searching

2017-08-03 Thread Naveen33
Hi Michael, what are you were looking for ,it can be achieved in Solr but not directly. We will have to write a custom query parser which will use Lucene Query parser. In the parser you will have to use the span queries. SpanQuery1- your term1, term2, .termN and the range like standard its 50

Re: Get handler failure

2017-08-03 Thread Shawn Heisey
On 8/3/2017 6:30 AM, Chris Ulicny wrote: > I've run into an issue in a test environment where a document exists, but > fails to be retrieved consistently by /get requests. In a series of 10 > requests for the specific document across a few minute timespan, one of the > middle requests returned a nu

Get handler failure

2017-08-03 Thread Chris Ulicny
Hi all, I've run into an issue in a test environment where a document exists, but fails to be retrieved consistently by /get requests. In a series of 10 requests for the specific document across a few minute timespan, one of the middle requests returned a null document. Currently, nothing is upda

Re: Ambiguous response on TrieDateField

2017-08-03 Thread Shawn Heisey
On 8/3/2017 3:48 PM, Imran Rajjad wrote: > I have observed a difference of Day in TrieDateField when queried from Solr > Cloud web interface and SolrK (Java API) > > Below is the query response from Web Interface > > { > "responseHeader":{ > "zkConnected":true, > "status":0, > "QTime

Re: Solr Pagination

2017-08-03 Thread Vincenzo D'Amore
Don't spend your time reading this, I've just found an answer in the documentation: > *One way to ensure that a document will never be returned more then once, > is to use the uniqueKey field as the primary (and therefore: only > significant) sort criterion. **In this situation, you will be guara

Solr Pagination

2017-08-03 Thread Vincenzo D'Amore
Hi all, I have a collection that is frequently updated, is it possible that a Solr Cloud query returns duplicate documents while paginating? Just to be clear, there is a collection with about 3M of documents and a Solr query selects just 500K documents sorted by Id, which are returned simply pagi

SOLR Learning to Rank Questions

2017-08-03 Thread Joao Palotti
​ Dear all, First of all, I would like to thank you guys for the amazing job with SOLR. In special, I highly appreciate the learning to rank plugin. It is a fantastic work. I have two ​ ​ two questions for the LTR people and I hope this mailing list is the right place for that. *1)​ ​This is a d

Ambiguous response on TrieDateField

2017-08-03 Thread Imran Rajjad
Hello, I have observed a difference of Day in TrieDateField when queried from Solr Cloud web interface and SolrK (Java API) Below is the query response from Web Interface { "responseHeader":{ "zkConnected":true, "status":0, "QTime":22, "params":{ "q":"id:01af04e1-83ce-4e

Re: Limiting the number of queries/updates to Solr

2017-08-03 Thread Rick Leir
On 2017-08-02 11:33 PM, Shawn Heisey wrote: On 8/2/2017 8:41 PM, S G wrote: Problem is that peak load estimates are just estimates. It would be nice to enforce them from Solr side such that if a rate higher than that is seen at any core, the core will automatically begin to reject the reques

Re: Solr 4.10.4 export handler NPE

2017-08-03 Thread Lasitha Wattaladeniya
Thank you Eric for the reply. Not possible to change from version 4.10.4, we have built lot of functionalities wrapping 4.10 version. I have decided to use /select handler to fetch data incrementally and write to a file. I think that will work Regards, Lasitha On 2 Aug 2017 23:37, "Erick Ericks

Re: Solr Input and Output format

2017-08-03 Thread Rick Leir
Ranganath, I googled 'getRecordWriter solr' and came up with (among 446 results) this partial stack trace: at org.apache.solr.handler.component.HttpShardHandlerFactory.init(HttpShardHandlerFactory.java:168) at org.apache.solr.handler.component.ShardHandlerFactory.newInstance(ShardHandlerF