Re: EXT: Re: Solr Query Performance benchmarking

2017-05-05 Thread Suresh Pendap
Thanks everyone for taking time to respond to my email. I think you are correct in that the query results might be coming from main memory as I only had around 7k queries. However it is still not clear to me, given that everything was being served from main memory, why is that I am not able to push

Re: Automatic conversion to Range Query

2017-05-05 Thread Aman Deep Singh
I'm using a custom request handler with deftype as edismax My query is - xiomi Mi 5 -white [64GB/ 3GB] On 06-May-2017 12:48 AM, "Erick Erickson" wrote: OK, what _request handler_ are you using? what is the original query? On Fri, May 5, 2017 at 11:42 AM, Aman Deep Singh wrote: > Hi Erick, > I

Re: Automatic conversion to Range Query

2017-05-05 Thread Erick Erickson
OK, what _request handler_ are you using? what is the original query? On Fri, May 5, 2017 at 11:42 AM, Aman Deep Singh wrote: > Hi Erick, > I don't want to do the range query , > That is why I'm using the pattern replace filter to remove all the non > alphanumeric to space so that this type of si

Re: Automatic conversion to Range Query

2017-05-05 Thread Aman Deep Singh
Hi Erick, I don't want to do the range query , That is why I'm using the pattern replace filter to remove all the non alphanumeric to space so that this type of situation don't arrive,Since end user can query anything, also in the query I haven't mention any range related keyword (TO). If my query

Re: Automatic conversion to Range Query

2017-05-05 Thread Erick Erickson
I'm going to go a little sideways and claim this is an "XY" problem, the range bits are a side-issue. The problem is that you're trying to do ranges on textual data that are really numbers. So even if there's a way to fix the range issue you're talking about, it still won't do what you expect. Con

Re: Couldn't decorate docValues for field message in logs

2017-05-05 Thread Erick Erickson
Hmm, not sure about the annoying message... The bit about searching docValues=true, indexed=false is a bit trappy. It works, it even works OK on small data sets. But it's conceptually a "table scan" and as soon as your corpus reaches any decent size it'll perform terribly. Best, Erick On Fri, Ma

Regarding rule based replica placement in Solr

2017-05-05 Thread Suresh Pendap
Hi, I read the documentation about this feature over here. I did not get a very clear understanding of how this feature works and how I can configure it in Solr. This is what I would like to achieve and am wonderin

Re: Joining more than 2 collections

2017-05-05 Thread Zheng Lin Edwin Yeo
Thanks for the explanation. Yes, all my join keys are the same, so I think both should be ok too. All my 3 collections have a lot of records, but for my last collection, I'm only extracting a few of the fields (about 5) to be shown. So does this considered that I have three very large joins? Re

Re: Query based on output from another collection

2017-05-05 Thread Zheng Lin Edwin Yeo
Meaning most likely gatherNodes will not be able to satisfy the requirements? Is there any thing in Solr that allow us to stored output into things like memory variable, and use that variable to pass it as an input for the next part of the query? Regards, Edwin On 5 May 2017 at 23:39, Joel Berns

Solrj Client Problem - Bad Return type

2017-05-05 Thread Jacob Dunn
Hi. I'm working on a Java application that takes some object metadata and inputs it using Solrj. I'm getting a bad return type error message. Here's my current code: [cid:image004.png@01D2C58B.8A6708E0] And here's the error I'm receiving: [cid:image005.png@01D2C58D.1DCB77A0] I'm setting up th

Re: Query based on output from another collection

2017-05-05 Thread Joel Bernstein
Yeah gatherNodes just supports one key traversals. Joel Bernstein http://joelsolr.blogspot.com/ On Fri, May 5, 2017 at 10:54 AM, Zheng Lin Edwin Yeo wrote: > Hi Joel, > > I may have several fields which requires to pass, so not sure if this will > works? > Eg: I may have field1a, field1b, field

Re: Joining more than 2 collections

2017-05-05 Thread Joel Bernstein
*:* queries will work fine for the innerJoin, which is a merge join that never runs out of memory. The hashJoin read the entire "hashed" query into memory though, so there are limitations. So if you have three very large joins that require *:* then the hashJoin approach will be problematic. In tha

Re: Solr parallel streaming

2017-05-05 Thread Joel Bernstein
Worker nodes can be part of any Solr collection. They do not have to have hold any data. If you have a collection with 3 shards, each with 3 replicas, than you have 9 potential workers in that collection. Currently the rule is if you specify N workers the worker collection must have atleast that m

Re: Joining more than 2 collections

2017-05-05 Thread Zheng Lin Edwin Yeo
I found that using *:* will return the entire resultset, and cause the result from the join query to blow up. Like if from the query, there are 2 results in collection1, and 3 results in collection2, I found that there could be 6 results that will be returned in the join query (using hashJoin or i

Solr parallel streaming

2017-05-05 Thread Mikhail Ibraheem
Hi, I am confused about the difference between solr nodes and worker nodes, are these the same? Can I have more than one worker nodes in one solr node? If we have one solr node, can we use parallel streams? What it means to have worker collection that has no data? Please advise. Thank

Re: Search substring in field

2017-05-05 Thread Emir Arnautovic
Hi, I would start from https://cwiki.apache.org/confluence/display/solr/Understanding+Analyzers%2C+Tokenizers%2C+and+Filters And this https://cwiki.apache.org/confluence/display/solr/Solr+Field+Types And after that https://cwiki.apache.org/confluence/display/solr/Query+Syntax+and+Parsing P

Re: Query based on output from another collection

2017-05-05 Thread Zheng Lin Edwin Yeo
Hi Joel, I may have several fields which requires to pass, so not sure if this will works? Eg: I may have field1a, field1b, field1c from collection1 which requires to be pass to field2a, field2b and field2c from collection2 respectively. And yes, my /export is working now. Regards, Edwin On 5

Search substring in field

2017-05-05 Thread jnobre
Hello, I am making an application that makes queries to solr. But I still can not figure out what the querys syntax is. I have a field that is a string. I want to do 3 operations. 1 - Search for a subtring, an exact expression in the string. Same as "" google search operator. Example: se

Re: Query based on output from another collection

2017-05-05 Thread Joel Bernstein
But /export has to be working to support the gatherNodes() expression. Joel Bernstein http://joelsolr.blogspot.com/ On Fri, May 5, 2017 at 7:40 AM, Joel Bernstein wrote: > This may work for you: > https://cwiki.apache.org/confluence/display/solr/Graph+Traversal > > Joel Bernstein > http://joels

Re: Query based on output from another collection

2017-05-05 Thread Joel Bernstein
This may work for you: https://cwiki.apache.org/confluence/display/solr/Graph+Traversal Joel Bernstein http://joelsolr.blogspot.com/ On Fri, May 5, 2017 at 5:39 AM, Zheng Lin Edwin Yeo wrote: > Hi, > > I'm using Solr 6.4.2. > > Is it possible to do a query based on output from another collectio

Re: in-place atomic updates for numeric docValue field

2017-05-05 Thread Rick Leir
Dan, If you don't mind using a float format DocValues field, you could store NAN (Not A Number). But any sorting operations would be slower, or size comparisons. Floats might be entirely inappropriate, but I thought it is worth a mention. Cheers -- Rick On 2017-05-04 10:55 AM, Dan . wrote: H

Couldn't decorate docValues for field message in logs

2017-05-05 Thread Yago Riveiro
Hi, I have a field type in my schema configured as: The goal of this field type is allow fields to be faceted and displaying data is necessary. The field be searchable is not a requisite. While I'm indexing data I have this annoying warning in logs: Couldn't decorate docValues for field: [fie

Re: Export endpoint broken in solr 6.5.1?

2017-05-05 Thread Yago Riveiro
Joel, Thank for the advice, indeed the /export handler was referenced in the config. The streaming expression is working. - Best regards /Yago -- View this message in context: http://lucene.472066.n3.nabble.com/Export-endpoint-broken-in-solr-6-5-1-tp4333416p4333504.html Sent from the Sol

Query based on output from another collection

2017-05-05 Thread Zheng Lin Edwin Yeo
Hi, I'm using Solr 6.4.2. Is it possible to do a query based on output from another collection in Solr Streaming, so that the entire query can be passed at one go? For example, I have a field called reference_s in collection1. In collection2, I will need to pass what is in the reference_s from c

Re: in-place atomic updates for numeric docValue field

2017-05-05 Thread Emir Arnautovic
Hi Dan, In-place updates are working because index size does not change. Atomic (or any other updates) are flagging existing doc as deleted and writing it again, so even if it removes some fields, such updates are making index larger until segment with deleted doc is merged. In-place updates

How can I group on two different fields

2017-05-05 Thread donjose
How can I group on two different fields for eg:) gorup.field = assetid,categoryid -- View this message in context: http://lucene.472066.n3.nabble.com/How-can-I-group-on-two-different-fields-tp4333490.html Sent from the Solr - User mailing list archive at Nabble.com.

Search inside grouping list

2017-05-05 Thread donjose
Please help me to find a solution to search inside grouped result. Grouped Result - One assetid (32) with 2 different color blue & red === grouped: { assetid: { matches: 619, ngroups: 32, groups: [ { groupValue: 32, doclist: { numFound: 2, start: 0, docs: [