Re: Strategies for effective prefix queries?

2014-07-16 Thread Alexandre Rafalovitch
I guess you did not bother clicking through the link then, because that's exactly the filter I was using. :-) I am glad you found it this way. You can also find the full list of filters and tokenizers at: http://www.solr-start.com/info/analyzers/ Regards, Alex. Personal: http://www.outerthough

Re: Strategies for effective prefix queries?

2014-07-16 Thread Hayden Muhl
Thank you Jorge. I didn't know about that filter. It's just what I was looking for. - Hayden On Wed, Jul 16, 2014 at 4:35 PM, Jorge Luis Betancourt Gonzalez < jlbetanco...@uci.cu> wrote: > Perhaps what you’re trying to do could be addressed by using the > EdgeNGramFilterFactory filter? For quer

Re: Script Transformer Help

2014-07-16 Thread pavan patharde
Thats a good idea Alexandre. I will try it and update the results.. Thanks. Pavan .P.Patharde Phone:9844626450 On Thu, Jul 17, 2014 at 10:08 AM, Alexandre Rafalovitch wrote: > Have you tried putting the transformer on the inner entity definition? > It's like a nested loop and you just put it i

Re: problem with replication/solrcloud - getting 'missing required field' during update intermittently (SOLR-6251)

2014-07-16 Thread Shalin Shekhar Mangar
Phew, thanks for tracking it down. On Thu, Jul 17, 2014 at 7:50 AM, Nathan Neulinger wrote: > FYI. We finally tracked down the problem at least 99.9% sure at this > point, and it was staring me in the face the whole time - just never > noticed: > > [{"id":"4b2c4d09-31e2-4fe2-b767-3868efbdcd

Re: Script Transformer Help

2014-07-16 Thread Alexandre Rafalovitch
Have you tried putting the transformer on the inner entity definition? It's like a nested loop and you just put it in the outer loop. Regards, Alex. Personal: http://www.outerthoughts.com/ and @arafalov Solr resources: http://www.solr-start.com/ and @solrstart Solr popularizers community: https

Script Transformer Help

2014-07-16 Thread pavan patharde
Hi All, I have data-config.xml as below:Script Transformer is omitted. I am able to access NAME and BSIN in the function f1. I

Inconsistant result's on solr cloud 4.8

2014-07-16 Thread Cool Techi
Hi, We are using solr cloud with solr version 4.8, we have 2 shard/2 replica servers in Solr Cloud. During two consecutive request to the solr cloud, the "total results" number varies. 1) As per my understanding this can happen when the leader and the replica have inconsistant number of results

Re: problem with replication/solrcloud - getting 'missing required field' during update intermittently (SOLR-6251)

2014-07-16 Thread Nathan Neulinger
FYI. We finally tracked down the problem at least 99.9% sure at this point, and it was staring me in the face the whole time - just never noticed: [{"id":"4b2c4d09-31e2-4fe2-b767-3868efbdcda1","channel": {"add": "preet"},"channel": {"add": "adam"}}] Look at the JSON... It's trying to add t

Re: Strategies for effective prefix queries?

2014-07-16 Thread Jorge Luis Betancourt Gonzalez
Perhaps what you’re trying to do could be addressed by using the EdgeNGramFilterFactory filter? For query suggestions I’m using a very similar approach, this is an extract of the configuration I’m using: Basically this allows you to get partial matches from any part of the string, let’s s

Re: Strategies for effective prefix queries?

2014-07-16 Thread Alexandre Rafalovitch
Your first and last email seem to be contradicting. You said initially you wanted to search for "solr-u" and match that. Now you are saying you want to search "bo sm" and match that. Either way, I do have very similar scenario working in the project I sent you a link to. I am breaking on full-stop

Re: Memory leak for debugQuery?

2014-07-16 Thread Tomás Fernández Löbbe
Also, is this trunk? Solr 4.x? Single shard, right? On Wed, Jul 16, 2014 at 2:24 PM, Erik Hatcher wrote: > Tom - > > You could maybe isolate it a little further by seeing using the “debug" > parameter with values of timing|query|results > > Erik > > On May 15, 2014, at 5:50 PM, Tom Burt

Re: Upper or Lower Case

2014-07-16 Thread Ahmet Arslan
Hi, you need to put lowercase filter before kstem filter. Ahmet  On Wednesday, July 16, 2014 11:55 PM, "EXTERNAL Taminidi Ravi (ETI, Automotive-Service-Solutions)" wrote: Hi , If I search 'Transmission Flush' it get the good match results, but when I use 'transmission flush' I get differen

Re: Memory leak for debugQuery?

2014-07-16 Thread Erik Hatcher
Tom - You could maybe isolate it a little further by seeing using the “debug" parameter with values of timing|query|results Erik On May 15, 2014, at 5:50 PM, Tom Burton-West wrote: > Hello all, > > I'm trying to get relevance scoring information for each of 1,000 docs > returned for

Upper or Lower Case

2014-07-16 Thread EXTERNAL Taminidi Ravi (ETI, Automotive-Service-Solutions)
Hi , If I search 'Transmission Flush' it get the good match results, but when I use 'transmission flush' I get different order of results, I search for the Name column in the schema and it has below config for the field type. Any clue what is wrong or is there any Conf changes need to get the s

Re: Updating Oracle

2014-07-16 Thread Shawn Heisey
On 7/16/2014 1:45 PM, Jason Bourne wrote: > I am new to Solr so I just want to know is something is possible. I might > need some help coding later on after taking the tutorials. > > I am taking over a program that uses html and java script to dislay metadata > from solr. They now would like to u

Shard Replicas not getting replicated data from leader

2014-07-16 Thread Marc Campeau
Hi, I have setup 4 Solr (4.9.0) Nodes into a single shard for a given collection, meaning I should have 4 replicated nodes. I have 3 Zookeepers in ensemble managing the configs for this collection. I have a load balancer in front of the 4 nodes to split traffic between them. I start this collecti

Updating Oracle

2014-07-16 Thread Jason Bourne
Hi, I am new to Solr so I just want to know is something is possible. I might need some help coding later on after taking the tutorials. I am taking over a program that uses html and java script to dislay metadata from solr. They now would like to update one field. The solr db gets refeshed we

Re: questions on Solr WordBreakSolrSpellChecker and WordDelimiterFilterFactory

2014-07-16 Thread Diego Fernandez
Which tokenizer are you using? StandardTokenizer will split "x-box" into "x" and "box", same as "x box". If there's not too many of these, you could also use the PatternReplaceCharFilterFactory to map "x box" and "x-box" to "xbox" before the tokenizer. Diego Fernandez - 爱国 Software Engineer U

Re: Using hundreds of dynamic fields

2014-07-16 Thread Jack Krupansky
I guess I'm just a big fan of simpler and cleaner data models! Especially if I were to have to look at somebody's data model and try to make sense out of it, such as how to keep all the fields straight for constructing queries. But atomic update and the need to read and rewrite all the fields i

Re: Strategies for effective prefix queries?

2014-07-16 Thread Hayden Muhl
A copy field does not address my problem, and this has nothing to do with stored fields. This is a query parsing problem, not an indexing problem. Here's the use case. If someone has a username like "bob-smith", I would like it to match prefixes of "bo" and "sm". I tokenize the username into the

Re: clearing fieldValueCache in solr 4.6

2014-07-16 Thread IJ
One thing you could do is: 1. If you current index is called A1, then you can create a new index called A2 with the correct schema.xml / solrconfig.xml 2. Index your 18,000 documents into A2 afresh 3. Then delete A1 (the bad index) 4. Then quickly create an Alias with the name of A1 pointng to A2 -

Re: Solr irregularly having QTime > 50000ms, stracing solr cures the problem

2014-07-16 Thread IJ
I know u mentioned you have a single machine at play - but do you have multiple nodes on the machine that talk to one another ?? Does your problem recur when the load on the system is low ? Also faced a similar problem wherein the "5 second delay" (described in detail on my other post) kept happe

clearing fieldValueCache in solr 4.6

2014-07-16 Thread Matthew LeMay
Hello. We're just starting to use solr in production. We've indexed 18,000 documents or so. We've just implemented faceted search results. We mistakenly stored integer ids in what was meant to be a string field. So, our facet results are showing numbers instead of the textual values. Aft

Re: Using hundreds of dynamic fields

2014-07-16 Thread Andy Crossen
Thanks, Jack and Jared, for your input on this. I'm looking into whether parent-child relationships via block or query time join will meet my requirements. Jack, I noticed in a bunch of other posts around the web that you've suggested to use dynamic fields in moderation. Is this suggestion based

solr-4.9.0 : [OverseerExitThread] but has failed to stop it. This is very likely to create a memory leak

2014-07-16 Thread Vijayakumar Ramdoss
Hi, When I am starting the SolrCloud (4.9) version top of the Tomcat, its throwing the below error message, I am using the JAVA runtime for memory leak exception . Summary of error message, [OverseerExitThread] but has failed to stop it. This is very likely to create a memory leak Det

Re: Slow inserts when using Solr Cloud

2014-07-16 Thread Timothy Potter
Hi Ian, What's the CPU doing on the leader? Have you tried attaching a profiler to the leader while running and then seeing if there are any hotspots showing. Not sure if this is related but we recently fixed an issue in the area of leader forwarding to replica that used too many CPU cycles ineffi

Strange Scoring Results

2014-07-16 Thread Michael Carlson
Hey All - I’m a Solr newbie in need of some help. I’m using Apache Nutch to crawl a site and populate a Solr core, which we then use to query search results. I’ve got it all up and running, but the Solr scoring results I get don’t seem to make any sense. Let’s take the following query as an e

RE: questions on Solr WordBreakSolrSpellChecker and WordDelimiterFilterFactory

2014-07-16 Thread Dyer, James
Jia, I agree that for the spellcheckers to work, you need instead of . But the "x-box" => "xbox" example ought to be solved by analyzing using WordDelimiterFilterFactory and "catenateWords=1" at query-time. Did you re-index after changing your analysis chain (you need to)? Perhaps you can

Mixing ordinary and nested documents

2014-07-16 Thread Bjørn Axelsen
Hi Solr users I would appreciate your inputs on how to handle a *mix *of *simple *and *nested *documents in the most easy and flexible way. I need to handle: - simple documens: webpages, short articles etc. (approx. 90% of the content) - nested documents: books containing chapters etc.

RE: Solr score manager

2014-07-16 Thread Doug Turnbull
Shay this presentation I gave at apachecon and dc solr exchange might be useful to you: http://www.slideshare.net/mobile/o19s/hacking-lucene-for-custom-search-results Sent from my Windows Phone From: Shay Sofer Sent: ‎7/‎16/‎2014 6:03 AM To: solr-user@lucene.apache.org Subject: Solr score manager

Re: TrieDateField, precisionStep impact on sorting performance

2014-07-16 Thread Kuehn, Dennis
Thanks for clarifying! Dennis On 7/16/14 3:19 PM, "Yonik Seeley" wrote: >On Wed, Jul 16, 2014 at 5:51 AM, Kuehn, Dennis > wrote: >> I'd like to sort on a TrieDateField which currently has a precisionStep >>value of 6. >> From what I got so far, the precisionStep value only affects range >>que

Fwd: Solr score manager

2014-07-16 Thread Alexandre Rafalovitch
-- Forwarded message -- From: Shay Sofer Date: Wed, Jul 16, 2014 at 6:55 PM That’s my question :-) How should I manage this scoring system. I guess that I need to add new field (my_score) and update him as I want. -Original Message- From: Alexandre Rafalovitch [mailto

Re: Slow inserts when using Solr Cloud

2014-07-16 Thread ian
That's useful to know, thanks very much. I'll look into using CloudSolrServer, although I'm using solrnet at present. That would reduce some of the overhead - but not the extra 200ms I'm getting for forwarding to the replica when the replica is switched on. It does seem a very high overhead.

Re: TrieDateField, precisionStep impact on sorting performance

2014-07-16 Thread Yonik Seeley
On Wed, Jul 16, 2014 at 5:51 AM, Kuehn, Dennis wrote: > I'd like to sort on a TrieDateField which currently has a precisionStep value > of 6. > From what I got so far, the precisionStep value only affects range query > performance and index size. > > However, the documentation for TrieDateField

Re: questions on Solr WordBreakSolrSpellChecker and WordDelimiterFilterFactory

2014-07-16 Thread Ahmet Arslan
Hi Jia, What happens when you use    instead of    Ahmet On Wednesday, July 16, 2014 3:07 AM, "j...@ece.ubc.ca" wrote: Hello everyone :) I have a product called "xbox" indexed, and when the user search for either "x-box" or "x box" i want the "xbox" product to be returned.  I'm new to S

Re: Solr score manager

2014-07-16 Thread Alexandre Rafalovitch
How are you storing this information in your documents? Regards, Alex On 16/07/2014 5:03 pm, "Shay Sofer" wrote: > Hi All, > > I need a specific score mechanism. > > I would like to sort my results based on customize scoring field. > scoring for example - > > > > 1. If this is a new ob

Solr score manager

2014-07-16 Thread Shay Sofer
Hi All, I need a specific score mechanism. I would like to sort my results based on customize scoring field. scoring for example - 1. If this is a new object - 100 2. Edited - 80 3. Recent search - 50 4. Opened - 40 and some more actions... And then when execute a n

TrieDateField, precisionStep impact on sorting performance

2014-07-16 Thread Kuehn, Dennis
Hello, I'd like to sort on a TrieDateField which currently has a precisionStep value of 6. >From what I got so far, the precisionStep value only affects range query >performance and index size. However, the documentation for TrieDateField says: 'precisionStep="0" enables efficient date sorting

weird drastic query latency during performance testing and DIH import delay after performance testing

2014-07-16 Thread YouPeng Yang
Hi I build my SolrCloud using Solr 4.6.0 (java version:1.7.0_45). In my cloud,I have a collection with 30 shard,and each shard has one replica. each core of the shard contains nearly 50 million docs that is 15GB in size,so does the replica. Before applying my cloud in the real world,I do a pe