Re: Way for DataImportHandler to use bind variables

2018-05-03 Thread Mikhail Khludnev
DIH does string replacement https://github.com/apache/lucene-solr/blob/8b9c2a3185d824a9aaae5c993b872205358729dd/solr/contrib/dataimporthandler/src/java/org/apache/solr/handler/dataimport/SqlEntityProcessor.java#L73 Hard refactoring is required to make it use preparedStatement. However there shou

Re: Autocomplete returning shingles

2018-05-03 Thread Federico Méndez
Can you just add the SingleFilter to your field? https://lucene.apache.org/solr/guide/6_6/filter-descriptions.html#FilterDescriptions-ShingleFilter On Wed, May 2, 2018 at 2:04 PM, O. Klein wrote: > I need to use autocomplete with edismax (ngrams,edgegrams) to return > shingled > sugges

Re: User queries end up in filterCache if facetting is enabled

2018-05-03 Thread Mikhail Khludnev
Enum facets, facet refinements and https://lucene.apache.org/solr/guide/6_6/query-settings-in-solrconfig.html comes to my mind. On Wed, May 2, 2018 at 11:58 PM, Markus Jelsma wrote: > Hello, > > Anyone here to reproduce this oddity? It shows up in all our collections > once we enable the stats p

Re: User queries end up in filterCache if facetting is enabled

2018-05-03 Thread Mikhail Khludnev
I mean https://lucene.apache.org/solr/guide/6_6/query-settings-in-solrconfig.html#QuerySettingsinSolrConfig-useFilterForSortedQuery On Thu, May 3, 2018 at 10:42 AM, Mikhail Khludnev wrote: > Enum facets, facet refinements and https://lucene.apache.org/ > solr/guide/6_6/query-settings-in-solrcon

RE: User queries end up in filterCache if facetting is enabled

2018-05-03 Thread Markus Jelsma
Thanks Mikhail, But i thought about that setting too, but i do sort by score, as does Solr /select handler by default. The enum method accounts for all the values for a facet field, but not the user queries i see ending up in the cache. Any other suggestions to shed light on this oddity? Thank

Re: Autocomplete returning shingles

2018-05-03 Thread Alessandro Benedetti
So, your problem is you want to return shingle suggestions from a field in input but apply multiple filter queries to the documents you want to fetch suggestions from. Are you building an auxiliary index for that ? You need to design it accordingly. If you want to to map each suggestion to a singl

Re: Regarding LTR feature

2018-05-03 Thread Alessandro Benedetti
Mmmm, first of all, you know that each Solr feature is calculated per document right ? So you want to calculate the payload score for the document you are re-ranking, based on the query ( your External Feature Information) and normalize across the different documents? I would go with this feature

Re: Shard size variation

2018-05-03 Thread Michael Joyner
We generally try not to change defaults when possible, sounds like there will be new default settings for the segment sizes and merging policy? Am I right in thinking that expungeDeletes will (in theory) be a 7.4 forwards option? On 05/02/2018 01:29 PM, Erick Erickson wrote: You can always

RE: User queries end up in filterCache if facetting is enabled

2018-05-03 Thread Markus Jelsma
By the way, the queries end up in the filterCache regardless of the value set in useFilterForSortedQuery. Thanks, Markus -Original message- > From:Markus Jelsma > Sent: Thursday 3rd May 2018 12:05 > To: solr-user@lucene.apache.org; solr-user > Subject: RE: User queries end up in filter

Solr question about deleting core permanently

2018-05-03 Thread Alexey Ponomarenko
Hi, I have a question https://stackoverflow.com/questions/50150507/how-can-i-delete-all-fields-after-corecollection-was-deleted can you help me? This is regarding deleting solr core permanently.

Re: Shard size variation

2018-05-03 Thread Erick Erickson
"We generally try not to change defaults when possible, sounds like there will be new default settings for the segment sizes and merging policy?" usually wise. No, there won't be any change in the default settings. What _will_ change is the behavior of a forceMerge (aka optimize) and expungeDele

solrj (admin) requests

2018-05-03 Thread Arturas Mazeika
Hi Solr Team, Short question: How can I systematically explore the solrj functionality/API? Long question: I am discovering solrj functionality and I am pretty much impressed what solrj can do. What I am less impressed is my knowledge how to find what I am looking for. On the positive side, one

Re: SolrCloud replicaition

2018-05-03 Thread Erick Erickson
Shalin's right, I was hurried in my response and forgot that the min_rf just _allows_ the client to figure out that the update didn't get updated on enough replicas and the client has to "do the right thing" with that information, thanks Shalin! Right, your scenario is correct. When the follower g

Re: solrj (admin) requests

2018-05-03 Thread Erick Erickson
Yeah, that can be a pain. Unfortunately there's no official "programming guide" for instance. What there is, however, is an extensive suite of unit tests in /Users/Erick/apache/solrJiras/master/solr/solrj/src/test/org/apache/solr/client/solrj. >From there it's often a hunt though. Best, Erick On

Re: SolrCloud replicaition

2018-05-03 Thread Greenhorn Techie
Perfect! Thanks Erick and Shalin!! On 3 May 2018 at 16:13:06, Erick Erickson (erickerick...@gmail.com) wrote: Shalin's right, I was hurried in my response and forgot that the min_rf just _allows_ the client to figure out that the update didn't get updated on enough replicas and the client has to

inconsistent results

2018-05-03 Thread Satya Marivada
Hi there, We have a solr (6.3.0) index which is being re-indexed every night, it takes about 6-7 hours for the indexing to complete. During the time of re-indexing, the index becomes flaky and would serve inconsistent count of documents 70,000 at times and 80,000 at times. After the indexing is co

Re: Learning to Rank (LTR) with grouping

2018-05-03 Thread Diego Ceccarelli
Thanks ilayaraja, I updated the PR today integrating your and Alan's comments. Now it works also in distributed mode. Please let me know what do you think :) Cheers Diego On Wed, May 2, 2018, 17:46 ilayaraja wrote: > Figured out that offset is used as part of the grouping patch which I > appli

Re: inconsistent results

2018-05-03 Thread Erick Erickson
The short for is that different replicas in a shard have different commit point if you go by wall-clock time. So during heavy indexing, you can happen to catch the different counts. That really shouldn't happen, though, unless you're clearing the index first on the assumption that you're replacing

Re: inconsistent results

2018-05-03 Thread Satya Marivada
Yes, we are doing clean and full import. Is it not supposed to serve old(existing) index till the new index is built and then do a cleanup, replace old index after new index is built? Would a full import without clean not give this problem? Thanks Erick, this would be useful. On Thu, May 3, 2018

Re: inconsistent results

2018-05-03 Thread Shawn Heisey
On 5/3/2018 12:55 PM, Satya Marivada wrote: > We have a solr (6.3.0) index which is being re-indexed every night, it > takes about 6-7 hours for the indexing to complete. During the time of > re-indexing, the index becomes flaky and would serve inconsistent count of > documents 70,000 at times and

Re: solrj (admin) requests

2018-05-03 Thread Shawn Heisey
On 5/3/2018 9:07 AM, Arturas Mazeika wrote: > Short question: > > How can I systematically explore the solrj functionality/API? As Erick said, there is not an extensive programming guide.  The javadocs for SolrJ classes are pretty decent, but figuring out precisely what the response objects actual

Re: Regarding LTR feature

2018-05-03 Thread prateek . agarwal
Thanks again Alessandro I tried with the feature and the Minmax normalizer you told.But then there is a slight problem with the params in normalization. I don't really know the range(Min, Max) of values the payload_score outputs and they are different for different queries. I even tried lookin

Re: Regarding LTR feature

2018-05-03 Thread prateek . agarwal
Thanks again Alessandro I tried with the feature and the Minmax normalizer you told.But then there is a slight problem with the params in normalization. I don't really know the range(Min, Max) of values the payload_score outputs and they are different for different queries. I even tried lookin

the number of docs in each group depends on rows

2018-05-03 Thread fatduo
Hi, We used Solr Cloud 7.1.0(3 nodes, 3 shards with 2 replicas). When we used group query, we found that the number of docs in each group depends on the rows number(group number). difference: when the rows bigger then 5, the retur