questions regrading stored fields role in query time

2019-02-26 Thread Saurabh Sharma
Hi All , I am new here on this channel. Few days back we upgraded our solr cloud to version 7.3 and doing real-time document posting with 15 seconds soft commit and 2 minutes hard commit time.As of now we posting full document to solr which includes data accumulations from various sources. Now w

Re: questions regrading stored fields role in query time

2019-02-26 Thread Emir Arnautović
Hi Saurabh, Welcome to the channel! Storing fields should not affect query performances directly if you use lazy field loading and it is the default set. And it should not affect at all if you have enough RAM compared to index size. Otherwise OS caches might be affected by stored fields. The bes

High Availability with two nodes

2019-02-26 Thread Andreas Mock
Hi all, currently we are looking at Apache Solr as a solution for searching. One important component is high availability. I digged around finding out that HA is built in via SolrCloud which means I have to install ZooKeeper in a production environment which needs at least three nodes. So, now t

Re: questions regrading stored fields role in query time

2019-02-26 Thread Saurabh Sharma
Hi Emir, I had this question in my mind if I store my only returnable field as docValue in RAM.will my stored documents be referenced while constructing the response after the query. Ideally, as the field asked to return i.e fl is already in RAM then documents on disk should not be consulted for t

Re: High Availability with two nodes

2019-02-26 Thread Jörn Franke
I would go for SolrCloud, but for simple active / passive scenarios you can use a simple http load balancer with health checks. > Am 26.02.2019 um 10:39 schrieb Andreas Mock : > > Hi all, > > currently we are looking at Apache Solr as a solution > for searching. One important component is high

Re: questions regrading stored fields role in query time

2019-02-26 Thread Emir Arnautović
Hi Saurabh, DocValues can be used for retrieving field values (note that order will not be preserved in case of multivalue field) but they are also stored in files, just different structures. Doc values will load some structure in memory, but will also use memory mapped files to access values (n

AW: High Availability with two nodes

2019-02-26 Thread Andreas Mock
Hi Jörn, thank you. How would this scenario look like? Single Server on both nodes. But how would you keep the indexes in sync? Best regards Andreas > -Ursprüngliche Nachricht- > Von: Jörn Franke > Gesendet: Dienstag, 26. Februar 2019 11:29 > An: solr-user@lucene.apache.org > Betref

Re: LTR feature based on other collection data

2019-02-26 Thread Kamal Kishore Aggarwal
I looks to me that I can modify the *SolrFeature *class, but dont know how to create IndexSearcher and SolrQueryRequest params as per the new request and second collection. @Override public FeatureWeight createWeight(*IndexSearcher searcher*, boolean needsScores, *SolrQueryRequest request*

Re: Spring Boot Solr+ Kerberos+ Ambari

2019-02-26 Thread Rushikesh Garadade
Hi, Thanks for the links. I have followed these steps earlier as well, however I did not excuted steps from Ranger as I don't want authorization. I didn't get any success. Thats why My question is *Is Ranger mandatory when you just want authentication with Kerberos?* Thank you, Rushikesh Garad

RE: MLT and facetting

2019-02-26 Thread Martin Frank Hansen (MHQ)
Hi Edwin, Here it is: - - text 1 1 true Internal - KMD A/S -Original Message- From: Zheng Lin Edwin Yeo Sent: 26. februar 2019 08:24 To: solr-user@lucene.apache.org Subject: Re: MLT and facetting Hi Martin, What is your setting in your /mlt requestHandler in solrconf

%solr_logs_dir% does not like spaces

2019-02-26 Thread Arturas Mazeika
Hi All, I am testing solr 7.7 (and 7.6) under windows. My aim is to set logging into a subdirectory that contains spaces of a directory that contains spaces. If I set on windows: setx /m SOLR_LOGS_DIR "f:\solr_deployment\logs" and start a solr instance: F:\solr_deployment\solr-7.7.0\bin\solr.c

Suggester autocomplete for address information

2019-02-26 Thread Kehan Harman
Hi All, I'm new to Solr & the community so feel free to ignore / remove if this is the incorrect mailing list for this query. I'm trying to build an autocomplete using a Solr index for addresses in a format similar to: 123 Smith Street, KEMPSEY, NSW 2440 I'm looking to have these addresses sugg

AW: %solr_logs_dir% does not like spaces

2019-02-26 Thread paul.dodd
Looks like a bug in solr.cmd. You could try eliminating the spaces and/or opening an issue. Instead of ‘Program Files (x86)’ use ‘PROGRA~2’ And don’t have spaces in your subdirectory… NB: Depending on your Windows Version you may Have another alias for ‘Program Files (x86)’; use «dir /X» t

Re: %solr_logs_dir% does not like spaces

2019-02-26 Thread Arturas Mazeika
Hi Paul, getting rid of space in "program files" is doable, you are right. One way to do it is through - echo %programfiles% ==> C:\Program Files - echo %programfiles(x86)% ==> C:\Program Files (x86) Getting rid of spaces in sub directories is very difficult as we use tons of those for dif

Re: Suggester autocomplete for address information

2019-02-26 Thread Kehan Harman
I'd like to clarify that what I am looking for is the right field type for the address field that will suggest values as follows for the input: Input: "123 SM" Suggestions: - 123-127 SMITH STREET, KEMPSEY NSW 2440 - 123 SMYTHE STREET. RANDOM PLACE And in addition to this I want the se

AW: %solr_logs_dir% does not like spaces

2019-02-26 Thread paul.dodd
Perhaps the instances of %SOLR_LOGS_DIR% in the solr.cmd files should be quoted i.e. "%SOLR_LOGS_DIR%" ?? Gesendet von Mail für Windows 10 Von: Arturas Mazeika Gesendet: Dienstag, 26. Februar 2019 15:10 An: solr-user@l

Re: High Availability with two nodes

2019-02-26 Thread Walter Underwood
Solr Cloud automatically choose a leader and a follower. I am not a fan of cold standby hosts, because you don’t really know whether they work. You have two hosts, so keep them both hot, put a load balancer in front of them, and send all the traffic to both of them all the time. If one fails, y

AW: High Availability with two nodes

2019-02-26 Thread Andreas Mock
Hi Walter, but I thought I need at least 3 zookeeper nodes? Is this not valid? I only have two servers. So, how can I have a two server SolrCloud installation? Am I missing something? Best regards Andreas > -Ursprüngliche Nachricht- > Von: Walter Underwood > Gesendet: Dienstag, 26. Febr

Re: High Availability with two nodes

2019-02-26 Thread Walter Underwood
Yes, you need three Zookeeper nodes. You cannot have an HA Solr Cloud installation with only two hosts. The Zookeeper hosts do not need to be large. A master/slave configuration might be fine, but we need to know more before recommending that. How many documents? How big are they? How fresh doe

Re: %solr_logs_dir% does not like spaces

2019-02-26 Thread Erick Erickson
If you can munge the solr.cmd file and it works for you, _please_ submit a JIRA and a patch! most of the Solr devs develop on *nix boxes, so this kind of thing creeps in and we need to fix it. Best, Erick > On Feb 26, 2019, at 6:38 AM, paul.d...@ub.unibe.ch wrote: > > Perhaps the instances of

Re: SOLR Tokenizer “solr.SimplePatternSplitTokenizerFactory” splits at unexpected characters

2019-02-26 Thread Shawn Heisey
On 2/26/2019 12:18 AM, Stephan Damson wrote: If we take the example input "operative", the analyzer shows that during indexing, the input gets split into the tokens "ope", "a" and "ive", that is the tokenizer splits at the characters "r" and "t", and not at the expected whitespace characters (C

Re: questions regrading stored fields role in query time

2019-02-26 Thread Shawn Heisey
On 2/26/2019 1:34 AM, Saurabh Sharma wrote: Now we want to do partial updates.I went through the documentation and found that all the fields should be stored or docValues for partial updates. I have few questions regarding this? 1) In case i am just fetching only 1 field while making query.What

Re: High Availability with two nodes

2019-02-26 Thread Shawn Heisey
On 2/26/2019 2:39 AM, Andreas Mock wrote: currently we are looking at Apache Solr as a solution for searching. One important component is high availability. I digged around finding out that HA is built in via SolrCloud which means I have to install ZooKeeper in a production environment which need

Re: questions regrading stored fields role in query time

2019-02-26 Thread Erick Erickson
It Depends (tm). See: SOLR-12598 for details. The short form is that as of Solr 7.5, Solr attempts to do the most efficient thing possible when fetching fields to return to the client. 1> if all requested fields are docValues, return from docValues. 2> if _any_ field is stored, return from the

Giving SolrJ credentials for Zookeeper

2019-02-26 Thread Snead, Ryan [USA]
I am following along with the example found in Zookeeper Access Control of the Apache Solr 7.5 Reference Guide. I have gotten to the point where I can use the zkcli.sh control script to access my secured Zookeeper environment. I can also connect using Zookeeper's zkCli.sh and then authenticate u

Re: SolrCloud fails to restart after rebooting

2019-02-26 Thread abhishek_itengg
Shawn, As you mentioned, it was indeed a problem with my network. The port 2888 was blocked restricting zookeeper communication its peers. Thanks, Abhi -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

StreamingSolrClients intermittent Error SolrCloud setup

2019-02-26 Thread abhishek_itengg
Hi, I am using SolrCloud setup with 3 SolrNodes. Intermittently we see errors of streaming solr clients on individual solr node logs. These error do resolved automatically but it comes back every now and then. We have 3 zookeepers and I verified that they have always maintained quorum. Is there a

Python Client for Solr Cloud - Leader aware

2019-02-26 Thread Ganesh Sethuraman
We are using Solr Cloud 7.2.1. Is there a leader aware python client (like SolrJ for Java), which can send the updates to the leader and it its highly available? I see PySolr https://pypi.org/project/pysolr/ project, not able to find any documentation if it supports leader aware updates. Regards G

Re: SolrCloud exclusive features

2019-02-26 Thread Arnold Bronley
Here is what I have found on my own little research. Please correct me if I am wrong. Also, please feel free to add more features. - Collections API - ConfigSets API - Zookeeper CLI - Streaming expressions - Parallel SQL interface - Authorization plugins - Blob store API On

Re: Overseer could not get tags

2019-02-26 Thread dshih
We are seeing the same issue running 7.4.0. Increasing the request and response header size did not resolve the issue. Should we open a JIRA ticket if one does not already exist? -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Re: Overseer could not get tags

2019-02-26 Thread dshih
Opened SOLR-13274 -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Cannot get MBean info via JConsole

2019-02-26 Thread Yasufumi Mizoguchi
Hi, I want to access MBean information via JConsole with Solr 6.2. Now, I could get the information via MBeanRequestHandler, but could not via JConsole from the same host that Solr ran. So, how can I do it via JConsole? Any information about this would be greatly appreciated. Thanks, Yasufumi.

Re: MLT and facetting

2019-02-26 Thread Zheng Lin Edwin Yeo
Hi Martin, I also get the same problem in Solr 7.7 if I turn on faceting in /mlt requestHandler. Found this issue in the JIRA: https://issues.apache.org/jira/browse/SOLR-7883 Seems like it is a bug in Solr and it has not been resolved yet. Regards, Edwin On Tue, 26 Feb 2019 at 21:03, Martin Fra

RE: MLT and facetting

2019-02-26 Thread Martin Frank Hansen (MHQ)
Hi Edwin, Thanks for your response. Are you sure it is a bug? Or is it not meant to work together? After doing some thinking I do see a problem faceting a MLT-result. MLT-results have a clear ordering of the documents which will be hard to maintain with facets. How will faceting MLT-results de