Re: Fw: TolerantUpdateProcessorFactory not functioning

2020-06-10 Thread Hup Chen
There was another error which I think it should be an indexing error. The listprice below is a pdouble filed, the update process didn't ignore the error when it was sent wrong data. Response: { "responseHeader":{ "status":400, "QTime":133551}, "error":{ "metadata":[ "error-

Re: How to determine why solr stops running?

2020-06-10 Thread Shawn Heisey
On 6/10/2020 12:13 PM, Ryan W wrote: People keep suggesting I check the logs for errors. What do those errors look like? Does anyone have examples of the text of a Solr oom error? Or the text of any other errors I should be looking for the next time solr fails? Are there phrases I should grep

Re: How to determine why solr stops running?

2020-06-10 Thread Hup Chen
I will check "dmesg" first, to find out any hardware error message. Then use some system admin tools to monitor that server, for instance, top, vmstat, lsof, iostat ... or simply install some nice free monitoring tool into this system, like monit, monitorix, nagios. Good luck!

Re: [EXTERNAL] - SolR OOM error due to query injection

2020-06-10 Thread Isabelle Giguere
Hi Guilherme; The only thing I can think of right now is the number of non-alphanumeric characters. In the first 'q' in your examples, after resolving the character escapes, 1/3 of characters are non-alphanumeric (* / = , etc). Maybe filter-out queries that contain too many non-alphanumeric ch

Use case of UTILIZENODE API

2020-06-10 Thread ChienHuaWang
While exploring the UTILIZENODE API to move replicas, it would depend on the preferences & autoscaling policies defined. But wondering what's the priority for its decision? Let's say I define maximize freedisk & minimize heapUsage, and also set-cluster-policy as example in doc. {"replica": "<2",

SolR OOM error due to query injection

2020-06-10 Thread Guilherme Viteri
Hi, Environment: SolR 6.6.2, with org.apache.solr.solr-core:6.1.0. This setup has been running for at least 4 years without having OutOfMemory error. (it is never too late for an OOM…) This week, our search tool has been attacked via ‘sql injection’ like, and that led to an OOM. These requests

Re: [EXTERNAL] - Re: HTTP 401 when searching on alias in secured Solr

2020-06-10 Thread Isabelle Giguere
Hi Jan; Thank you for your reply. This is security.json as seen in Zookeeper. Credentials are admin / admin { "authentication":{ "blockUnknown":false, "realm":"MTM Solr", "forwardCredentials":true, "class":"solr.BasicAuthPlugin", "credentials":{"admin":"0rTOgObKYwzSyPoYuj2

Re: HTTP 401 when searching on alias in secured Solr

2020-06-10 Thread Jan Høydahl
Please share your security.json file Jan Høydahl > 10. jun. 2020 kl. 21:53 skrev Isabelle Giguere > : > > Hi; > > I'm using Solr 8.5.0. I have uploaded security.json to Zookeeper. I can log > in the Solr Admin UI. I can create collections and aliases, and I can index > documents in Solr.

HTTP 401 when searching on alias in secured Solr

2020-06-10 Thread Isabelle Giguere
Hi; I'm using Solr 8.5.0. I have uploaded security.json to Zookeeper. I can log in the Solr Admin UI. I can create collections and aliases, and I can index documents in Solr. Collections : test1, test2 Alias: test (combines test1, test2) Indexed document "solr-word.pdf" in collection test1

Re: How to determine why solr stops running?

2020-06-10 Thread Ryan W
Hi all, People keep suggesting I check the logs for errors. What do those errors look like? Does anyone have examples of the text of a Solr oom error? Or the text of any other errors I should be looking for the next time solr fails? Are there phrases I should grep for in the logs? Should I be

RE: Timeout issue while doing update operations from clients (using SolrJ)

2020-06-10 Thread Kommu, Vinodh K.
We are getting following socket timeout exception during this error. Any idea on this? ERROR (updateExecutor-3-thread-1392-processing-n:hostname:1100_solr x:TestCollection_shard6_replica_n10 c:TestCollection s:shard6 r:core_node13) [c:TestCollection s:shard6 r:core_node13 x:TestCollection_shard

Re: using solr to extarct keywords from a long text?

2020-06-10 Thread Mikhail Khludnev
Hello, David. >From the code I noticing that MoreLikeThisHandler consumes request body when there's no ?q= and analyzes it for doing what are you asking for. I see that ref guide obscured this feature. On Wed, Jun 10, 2020 at 4:37 PM David Zimmermann wrote: > Dear solr community > > I’m suppose

using solr to extarct keywords from a long text?

2020-06-10 Thread David Zimmermann
Dear solr community I’m supposed to extract keywords from long texts. I do have a solr index with a lot of documents from the same domain as my texts. So, I was wondering if I can use solr to extract those keywords. Ideally I would want to use the TF-IDF basd “importantTerms” from the “more lik

Re: Getting rid of zookeeper

2020-06-10 Thread matthew sporleder
FWIW -- zookeeper is pretty set-and-forget in my experience with settings like autopurge.snapRetainCount, autopurge.purgeInterval, and rotating the zookeeper.out stdout file. It is a big hassle to setup the individual myid files and keep them in sync with the server.$id=hostname in zoo.cfg but, ag

Timeout issue while doing update operations from clients (using SolrJ)

2020-06-10 Thread Kommu, Vinodh K.
Hi, Need some help in fixing intermittent timeout issue please. Recently we came across this timeout issue during QA performance testing when a streaming expression query which runs on a larger set of data (~60-80 million) from a client using solrJ, was timing out exactly in 2mins. Later this i

RE: Atomic updates with nested documents

2020-06-10 Thread Kaminski, Adi
Sure, np. We did same W/A for long period, but eventually it indeed impacted very much our application performance, and partial atomic updates to parent doc improved this significantly (20-30x than whole docs). Regards, Adi -Original Message- From: Ludger Steens Sent: Wednesday, June 1

AW: Atomic updates with nested documents

2020-06-10 Thread Ludger Steens
Hi Adi, thank you for your reply! Although I have to admit that this is not the response that I was hoping for 😊. Upgrading to Solr 8 is currently not possible for us because we found multiple issues when doing so (see http://mail-archives.apache.org/mod_mbox/lucene-solr-user/202005.mbox/%3Ce7d

Re: Getting rid of zookeeper

2020-06-10 Thread Jan Høydahl
Curator is just on the client (solr) side, to make it easier to integrate with Zookeeper, right? If you study Elastic, they had terrible cluster stability a few years ago since everything was too «dynamic» and «zero config». That led to the system outsmarting itself when facing real-life networ