Re: Solr unable to start up after setting up SSL in Solr 7.4.0

2018-08-23 Thread Jan Høydahl
Don't need a git checkout to pull a text file :) https://github.com/apache/lucene-solr/blob/branch_7x/solr/bin/solr.cmd https://github.com/apache/lucene-solr/blob/branch_7x/solr/server/scripts/cloud-scripts/zkcli.bat

How to trace one query?the debug/debugQuery info are not enough to find out why a query is slow

2018-08-23 Thread zhenyuan wei
Hi all, I do care query performance, but do not know how to find out the reason why a query so slow. *How to trace one query?*the debug/debugQuery info are not enough to find out why a query is slow。 Thanks a lot~

Re: How to trace one query?the debug/debugQuery info are not enough to find out why a query is slow

2018-08-23 Thread Jan Høydahl
Hi, With debugQuery you see the timings. What component spends the most time? With shards.info=true you see what shard is the slowest, if your index is sharded. With echoParams=all you get the full list of query parameters in use, perhaps you spot something? If you start Solr with -v option then

Re: How to trace one query?the debug/debugQuery info are not enough to find out why a query is slow

2018-08-23 Thread zhenyuan wei
Thank you very much to answer. @Jan Høydahl My query is simple, just wildcard last 2 char in this query(have more other query to optimize) curl " http://emr-worker-1:8983/solr/collection005/query?q=v10_s:YY*&rows=10&&fl=id&echoParams=all " { "responseHeader":{ "zkConnected"

Re: Solr unable to start up after setting up SSL in Solr 7.4.0

2018-08-23 Thread Shawn Heisey
On 8/23/2018 2:42 AM, Jan Høydahl wrote: Don't need a git checkout to pull a text file :) https://github.com/apache/lucene-solr/blob/branch_7x/solr/bin/solr.cmd https://github.com/apache/lucene-solr/blob/branch_7x/solr/serv

Re: How to trace one query?the debug/debugQuery info are not enough to find out why a query is slow

2018-08-23 Thread zhenyuan wei
I have 4 solr server, each allocated 6GB。My dataset on HDFS is 787GB, 2 billion documents totally,each document is 300 Bytes。 Follow is my cache related configuration。 20 200 zhenyuan wei 于2018年8月23日周四 下午5:41写道: > Thank you very much to answer. @Jan Høydahl > My query is simple, just wil

Re: How to trace one query?the debug/debugQuery info are not enough to find out why a query is slow

2018-08-23 Thread Shawn Heisey
On 8/23/2018 3:41 AM, zhenyuan wei wrote: Thank you very much to answer. @Jan Høydahl My query is simple, just wildcard last 2 char in this query(have more other query to optimize) curl " http://emr-worker-1:8983/solr/collection005/query?q=v10_s:YY*&rows=10&&fl=id&echoParams=a

Re: How to trace one query?the debug/debugQuery info are not enough to find out why a query is slow

2018-08-23 Thread Shawn Heisey
On 8/23/2018 4:03 AM, Shawn Heisey wrote: Configuring caches cannot speed up the first time a query runs.  That speeds up later runs.  To speed up the first time will require two things: 1) Ensuring that there is enough memory in the system for the operating system to effectively cache the in

Re: Want to start contributing.

2018-08-23 Thread Charlie Hull
On 20/08/2018 18:45, Rohan Chhabra wrote: Hi all, I am an absolute beginner (dummy) in the field of contributing open source. But I am interested in contributing to open source. How do i start? Solr is a java based search engine based on Lucene. I am good at Java and therefore chose this to star

Re: How to trace one query?the debug/debugQuery info are not enough to find out why a query is slow

2018-08-23 Thread zhenyuan wei
Thanks for your detail answer @Shawn Yes I run the query in SolrCloud mode, and my collection has 20 shards, each shard size is 30~50GB。 4 solr server, each solr JVM use 6GB, HDFS datanode are 4 too, each datanode JVM use 2.5GB。 Linux server host are 4 node too,each node is 16 core/32GB RAM/1600G

Re: How to trace one query?the debug/debugQuery info are not enough to find out why a query is slow

2018-08-23 Thread Shawn Heisey
On 8/23/2018 5:19 AM, zhenyuan wei wrote: Thanks for your detail answer @Shawn Yes I run the query in SolrCloud mode, and my collection has 20 shards, each shard size is 30~50GB。 4 solr server, each solr JVM use 6GB, HDFS datanode are 4 too, each datanode JVM use 2.5GB。 Linux server host are 4

Re: How to trace one query?the debug/debugQuery info are not enough to find out why a query is slow

2018-08-23 Thread Jan Høydahl
Shawn, the block cache seems to be off-heap according to https://lucene.apache.org/solr/guide/7_4/running-solr-on-hdfs.html So you have 800G across 4 nodes, that gives 500M docs and 200G index data per solr node and 40G per sh

SOLR zookeeper connection timeout during startup is hardcoded to 10000ms

2018-08-23 Thread Danny Shih
Hi, During startup in cloud mode, the SOLR zookeeper connection timeout appears to be hardcoded to 1000ms: https://github.com/apache/lucene-solr/blob/5eab1c3c688a0d8db650c657567f197fb3dcf181/solr/solrj/src/java/org/apache/solr/client/solrj/impl/ZkClientClusterStateProvider.java#L45 And it is not

Re: SOLR zookeeper connection timeout during startup is hardcoded to 10000ms

2018-08-23 Thread Erick Erickson
That's actually 10,000 ms, a typo in your message? Do you have a situation where that setting is causing you trouble? Because 10 seconds for communications with ZK is quite a long time, I'm curious what the circumstances are that you're seeing. Best, Erick On Wed, Aug 22, 2018 at 3:51 PM, Danny

Question on query time boosting

2018-08-23 Thread Pratik Patel
Hello All, I am trying to understand how exactly query time boosting works in solr. Primarily, I want to understand if absolute boost values matter or is it just the relative difference between various boost values which decides scoring. Let's take following two queries for example. // case1: q p

need help with a complicated join query

2018-08-23 Thread Steve Pruitt
At least it is complicated to me. :>) We are investigating how to find return a list documents whose identifier is contained in a multi-value field in another document. The index consists of essentially two different documents sharing some common fields. To make it simple, I will refer to them

Not possible to use NOT queries with Solr Export Handler?

2018-08-23 Thread Antelmo Aguilar
Hello, I asked this question in the IRC channel, but had to leave so was not able to wait for a response. So sending it through here instead with the hopes that someone can give me some insight on the issue I am experiencing. So in our Solr setup, we use the Solr Export request handler. Our use

Permission Denied when trying to connect to Solr running on a different server

2018-08-23 Thread cyndefromva
I have a ruby on rails application that used solr and the sunspot rails gem for search. For development I just run solr locally and that's been working fine. But I'm trying to set up a stand-alone solr server for production. So I installed it on its own server and created the core for my site. I up

Still not seeing Solr listening on 8983 after 30 seconds!

2018-08-23 Thread Abhijit Pawar
Hello All, Recently with no change in the configuration or code we are facing a slowdown of approx 3 minutes while restarting the SOLR instance.Earlier it used to come up in few seconds however now it takes so long. *Error Message Displayed:* Waiting up to 30 seconds to see Solr running on port 8

Re: Permission Denied when trying to connect to Solr running on a different server

2018-08-23 Thread Shawn Heisey
On 8/23/2018 1:36 PM, cyndefromva wrote: But when I try to access search through my web application I'm getting Errno::EACCES Permission denied -- connect(2) for port 8983. This sounds like an error message from your rails app.  You may need to ask whoever created the Solr client that you are

Re: Still not seeing Solr listening on 8983 after 30 seconds!

2018-08-23 Thread Shawn Heisey
On 8/23/2018 2:02 PM, Abhijit Pawar wrote: Recently with no change in the configuration or code we are facing a slowdown of approx 3 minutes while restarting the SOLR instance.Earlier it used to come up in few seconds however now it takes so long. *Error Message Displayed:* Waiting up to 30 seco

Re: Not possible to use NOT queries with Solr Export Handler?

2018-08-23 Thread Shawn Heisey
On 8/23/2018 1:12 PM, Antelmo Aguilar wrote: I asked this question in the IRC channel, but had to leave so was not able to wait for a response. So sending it through here instead with the hopes that someone can give me some insight on the issue I am experiencing. So in our Solr setup, we use th

Re: Permission Denied when trying to connect to Solr running on a different server

2018-08-23 Thread cyndefromva
I'm using the sunspot gem. It can't be rails because I was able to index from the app server and I can search from the rails console. Its just when I'm trying to access from the web application. And yes, my logs are in /var/solr/logs and there was nothing new there. It did write something when I

Re: Still not seeing Solr listening on 8983 after 30 seconds!

2018-08-23 Thread Abhijit Pawar
Hello, Here are the log files: solr.log: https://drive.google.com/open?id=1gvgUuPx5ItbBU7wvPXd9clGJqKWQdWSJ solr-8983-console.log: https://drive.google.com/open?id=1062seYIoRsLL5dcCU9OHbxx7hoH6armX Version of SOLR server is 5.4.1 For Heap Size not sure if this is useful: Num Docs:25837 Max Doc:25

Re: Question on query time boosting

2018-08-23 Thread Kydryavtsev Andrey
Hi, Pratic I believe that your observations are correct. Score for each individual query (in your example it's wildcards query like 'concept_name:(*semantic*)^200') is calculated by a complex formulas (one of possible implementations with a good explanation is described here https://lucene.a

Re: Solr unable to start up after setting up SSL in Solr 7.4.0

2018-08-23 Thread Zheng Lin Edwin Yeo
Thanks for the advice. Regards, Edwin On Thu, 23 Aug 2018 at 17:43, Shawn Heisey wrote: > On 8/23/2018 2:42 AM, Jan Høydahl wrote: > > Don't need a git checkout to pull a text file :) > > https://github.com/apache/lucene-solr/blob/branch_7x/solr/bin/solr.cmd < > https://github.com/apache/lucene

Re: Local development and SolrCloud

2018-08-23 Thread John Blythe
Thanks everyone. I think we forgot that cloud doesn’t have to be clustered. That local overhead being avoided makes it a much easier pill to swallow as far as local performance (vs. having all the extra containers running in docker) Will see what we can spin up and ask questions if/as they arise!

Re: need help with a complicated join query

2018-08-23 Thread damienk
I'm thinking something like this: q={!join v=id:doca_1 from=members to=id} On Fri, 24 Aug 2018 at 03:03, Steve Pruitt wrote: > At least it is complicated to me. :>) > > We are investigating how to find return a list documents whose identifier > is contained in a multi-value field in another doc

Re: Index Upgrader tool

2018-08-23 Thread damienk
Shawn, Is it possible to run optimize on the live collection? For example, /solr/collection/update?commit=true&optimize=true On Wed, 22 Aug 2018 at 06:50, Shawn Heisey wrote: > On 8/21/2018 2:29 AM, Artjoms Laivins wrote: > > We are running Solr cloud with 3 nodes v. 6.6.2 > > We started with ve