Re: Solr 7.6.0: PingRequestHandler - Changing the default query (*:*)

2019-08-04 Thread dinesh naik
Thanks john,Erick and Furknan. I have already defined the ping request handler in solrconfig.xml as below: /select _root_:abc My question is regarding the custom query being used. Here i am querying for field _root_ which is available in all of my cluster and defined as a string field. The re

Re: Solr 7.6.0: PingRequestHandler - Changing the default query (*:*)

2019-08-04 Thread Furkan KAMACI
Hi, You can change invariants i.e. *qt* and *q* of a *PingRequestHandler*: /search some test query Check documentation fore more info: https://lucene.apache.org/solr/7_6_0//solr-core/org/apache/solr/handler/PingRequestHandler.html Kind Regards, Furkan KAMACI On Sat, Aug 3,

RE: SolrCloud recommended I/O RAID level

2019-08-04 Thread Kaminski, Adi
Erick and Shawn - thanks for the detailed explanation. We will try the number per your guidance that represents the best both ingestion flow (indexing documents rate) and searching flow and good QPS. We just need to start from some number in order to size the lab accordingly, since it take quite

Re: SolrCloud recommended I/O RAID level

2019-08-04 Thread Erick Erickson
Shards are really not about maximizing QPS, they’re more about fitting all your data on your machines. Yes, there’s a tradeoff where if you have more shards, each is working on a smaller data set so potentially will work faster, but there’s a corresponding offset of having to do the bookkeeping.

Re: SolrCloud recommended I/O RAID level

2019-08-04 Thread Shawn Heisey
On 8/4/2019 8:53 AM, Kaminski, Adi wrote: Erick - thanks a lot for answering and sharing the below article, it's very helpful ! I have another follow-up question - assuming we have 400 vCPUs across our SolrCloud cluster nodes, will it be better to have 400 shards with replication factor 2 or

RE: SolrCloud recommended I/O RAID level

2019-08-04 Thread Kaminski, Adi
Erick - thanks a lot for answering and sharing the below article, it's very helpful ! I have another follow-up question - assuming we have 400 vCPUs across our SolrCloud cluster nodes, will it be better to have 400 shards with replication factor 2 or 200 shards with replication factor 4 ? What

Re: get the position of matched word in the response

2019-08-04 Thread Erick Erickson
I think you’re missing a nuance. It’s always a little confusing when people use quotes when talking about searching because in Solr double quotes are a very specific form of a query, i.e. a phrase query which means words must appear within some distance of each other (i.e. the ’slop’) In Solr,

Re: get the position of matched word in the response

2019-08-04 Thread eli chen
thx of course they search for pharses. and if they searched "hello monkey" and solr found "hello my monkey". i want to get the position of "hello" and "monkey" (they words he actually typed in the search). and btw thx you all but i found https://github.com/dbmdz/solr-ocrhighlighting which i think

Re: get the position of matched word in the response

2019-08-04 Thread Alexandre Rafalovitch
What happens if they search for "hello monkey" and match against "hello my monkeys"? What should it return? Why does your database not contain "hello" instead of 199? I am saying because if your clients are truly searching for just one word, then Solr may be an overkill for you. Perhaps you are lo

Re: get the position of matched word in the response

2019-08-04 Thread Erick Erickson
One approach: Payloads. You can store, with each word, an arbitrary amount data. Of course the index is bigger…. Most of the examples use a single float, which could be all you need. You can store an arbitrary binary blob and encode/decode it however you want. Conceivably you could store the co

Re: get the position of matched word in the response

2019-08-04 Thread eli chen
every content field is actually a book content so let say someone search for the word "hello" and i found this word in the book "the story jungle" at position 199 (step by word not char) now i can look at my database and check the OCR of this word in this book (and show highlight on the picture an

Re: get the position of matched word in the response

2019-08-04 Thread Erick Erickson
Eli: What problem are you trying to solve? There’s no really convenient way to do this that know of, although it could be done, probably with some lucene-level code. This may be an XY problem, where you're asking how to do X (find the position of the matched word) because you think it’ll help

get the position of matched word in the response

2019-08-04 Thread eli chen
hi i'm new to solr so please be patient. how can i get the position of matched word in the results. and no, im not talking about highlighting the words. i talkng about getting the postition of the word in the content i have field content which i do in q=content:"some_word" the content field is n