Re: tipping point for using solrcloud—or not?

2017-10-02 Thread John Blythe
Nope, NRT is within seconds at most in several cases. Sounds like cloud needs to be whah we plan for. Thanks! On Mon, Oct 2, 2017 at 5:39 PM Erick Erickson wrote: > Short form: Use SolCloud from what you've described. > > NRT and M/S is simply oil and water. The _very_ best you can do when > se

Re: Is there a parsing issue with "OR NOT" or is something else going on? (Solr 6)

2017-10-02 Thread Erick Erickson
Solr does not (and never has) implemented pure boolean logic. See: https://lucidworks.com/2011/12/28/why-not-and-or-and-not/ I think your second clause is evaluated as though it were: ("batman" AND "indiana jones") OR (*:* -"cancer") which is much more what you want. Best, Erick On Mon, Oct 2,

Re: tipping point for using solrcloud—or not?

2017-10-02 Thread Erick Erickson
Short form: Use SolCloud from what you've described. NRT and M/S is simply oil and water. The _very_ best you can do when searching slaves is master's commit interval + slave polling interval + time to transmit the index to the slave + autowarming time on the slave. Now, that said, when you say N

Re: Keeping the index naturally ordered by some field

2017-10-02 Thread Erick Erickson
Have you looked at Streaming and Streaming Expressions? This is pretty much what they were built for. Since you're talking a billion documents, you're probably sharding anyway, in which case I'd guess you're using SolrCloud. That's what I'd be using first if at all possible. Best, Erick On Mon

Authentication error : request has come without principal. failed permission

2017-10-02 Thread Shamik Bandopadhyay
Hi, I'm seeing this random Authentication failure in our Solr Cloud cluster which is eventually rendering the nodes in "down" state. This doesn't seem to have a pattern, just starts to happen out of the blue. I've 2 shards, each having two replicas. They are using Solr basic authentication plugi

Re: Keeping the index naturally ordered by some field

2017-10-02 Thread alexpusch
The reason I'm interested in this is kind of unique. I'm writing a custom query parser and search component. These components go over the search results and perform some calculation over it. This calculation depends on input sorted by a certain value. In this scenario, regular solr sorting is insuf

Re: tipping point for using solrcloud—or not?

2017-10-02 Thread John Blythe
thanks for the responses, guys. erick: we do need NRT in several cases. also in need of HA pending where the line is drawn. we do need it relatively speaking, i.e. w/i our user base. if the largest of our cores falters then our business is completely stopped till we can get everything reindexed.

Re: solr.log rotation

2017-10-02 Thread Shawn Heisey
On 10/2/2017 8:39 AM, Noriyuki TAKEI wrote: > HI,All > > When I restart Solr Service, solr.log is rotated as below. > > solr.log.1 > solr.log.2 > solr.log.3 > ... > > I would like to stop this rotation. To keep Solr startup from rotating the logfile, you'll need to edit the bin/solr or bin\solr.cm

Is there a parsing issue with "OR NOT" or is something else going on? (Solr 6)

2017-10-02 Thread Michael Joyner
Hello all, What is the difference between the following two queries that causes them to give different results? Is there a parsing issue with "OR NOT" or is something else going on? a) ("batman" AND "indiana jones") OR NOT ("cancer") /*only seems to match the and clause*/ parsedquery=Boost

RE: solr.log rotation

2017-10-02 Thread Noriyuki TAKEI
Thanks for your quick reply!! -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Re: solr cloud without hard commit?

2017-10-02 Thread Erick Erickson
ramBufferSizeMB limits the amount of memory used for indexing and when it's exceeded the buffer is flushed to disk into a new segment. This is independent of hard/soft commits. Soft commits do not _force_ the in-memory structures to be written to a segment and do not update the segments file, hard

search request audit logging

2017-10-02 Thread Michal Hlavac
Hi, I would like to ask how to implement search audit logging. I've implemented some idea but I would like to ask if there is better approach to do this. Requirement is to log username, search time, all request parameters (q, fq, etc.), response data (count, etc) and important thing is to log a

RE: solr.log rotation

2017-10-02 Thread Oakley, Craig (NIH/NLM/NCBI) [C]
My guess would be to edit server/resources/log4j.properties to have log4j.appender.file.MaxBackupIndex=0 -Original Message- From: Noriyuki TAKEI [mailto:nta...@sios.com] Sent: Monday, October 02, 2017 10:39 AM To: solr-user@lucene.apache.org Subject: solr.log rotation HI,All When I res

solr.log rotation

2017-10-02 Thread Noriyuki TAKEI
HI,All When I restart Solr Service, solr.log is rotated as below. solr.log.1 solr.log.2 solr.log.3 ... I would like to stop this rotation. Do you have Any idea? -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

ANNOUNCE: Apache Solr Reference Guide for 7.0 released

2017-10-02 Thread Cassandra Targett
The Lucene PMC is pleased to announce that the Solr Reference Guide for 7.0 is now available. This 1,035-page PDF is the definitive guide to using Apache Solr, the search server built on Apache Lucene. The Guide can be downloaded from: https://www.apache.org/dyn/closer.cgi/lucene/solr/ref-guide/a

Re: Solr 7 default Response now JSON instead of XML causing issues

2017-10-02 Thread Emir Arnautović
Hi Roland, I guess you can use defaults in solr config to set wt to XML. Something like: xml You can also use useParams=“xml_out” and in your params.json have xml params defined group xml_out with wt: “xml” HTH, Emir > On 2 Oct 2017, at 13:58, Roland Villemoes wrote: > > H

Solr 7 default Response now JSON instead of XML causing issues

2017-10-02 Thread Roland Villemoes
Hi Default response in Solr 7 is now JSON instead of XML (https://issues.apache.org/jira/browse/SOLR-10494) We are using a system that use the Solr admin/cores api for core status etc. and we can't really change that system. That system expects the XML response. And as far as I can see default

Re: How to Index JSON field Solr 5.3.2

2017-10-02 Thread Emir Arnautović
Hi Sharma, I guess you are looking for nested documents: https://lucene.apache.org/solr/guide/6_6/uploading-data-with-index-handlers.html#UploadingDatawithIndexHandlers-NestedChildDocuments

Re: solr cloud without hard commit?

2017-10-02 Thread alessandro.benedetti
Hi Erick, you said : ""mentions that for soft commit, "new segments are created that will be merged"" Wait, how did that get in there? Ignore it, I'm taking it out. " but I think you were not wrong, based on another mailing list thread message by Shawn, I read : [1] "If you are using the corre

Re: tipping point for using solrcloud—or not?

2017-10-02 Thread Emir Arnautović
Hi John, Your data volume does not require SolrCloud, especially if you isolate core that is related to your business from other cores. You mentioned that the second largest is logs core used for analytics - not sure what sort of logs, but if write intensive logging, you might want to isolate th

How to Index JSON field Solr 5.3.2

2017-10-02 Thread Deeksha Sharma
Hi everyone, I have created a core and index data in Solr using dataImportHandler. The schema for the core looks like this: This is my data in mysql database: md5:"376463475574058bba96395bfb87" rules: {"fileRules":[{"file_id":1321241,"md5":"376463475574058bba96395bfb87",

Re: Distributed IDF configuration query

2017-10-02 Thread alessandro.benedetti
Hi Reth, there are some problem in the debug for the distributed IDF [1] Your case seems different though. It has been a while I experimented that feature but your config seems ok to me. What helped me a lot that time was to debug my Solr instance. [1] https://issues.apache.org/jira/browse/SOLR

Re: Keeping the index naturally ordered by some field

2017-10-02 Thread alessandro.benedetti
Hi Alex, just to explore a bit your question, why do you need that ? Do you need to reduce query time ? Have you tried enabling docValues for the fields of interest ? Doc Values seem to me a pretty useful data structure when sorting is a requirement. I am curious to understand why that was not an o