Re: Solr 4.8: Does eDisMax parser calls analyzer chain to tokenize?

2014-05-17 Thread Jack Krupansky
Your bad experience seems to have occurred because you chose to use all default values for the WDF attributes. In particular, the generateWordParts and generateNumberParts attributes default to "1" (true), resulting in the discrete "abc", "123", and "xyz" tokens, and the catenateAll attribute d

Postingshighlighter with prefix queries

2014-05-17 Thread Puneet Pawaia
Hi all Postingshighlighter in Solr 4.7 is supposed to be able to highlight prefix queries. However you are supposed to subclass it and override getAnalyzer to that used at index time. Any examples to show how this is done when using Solr? Regards Puneet

Re: What is the usage of solr.NumericPayloadTokenFilterFactory

2014-05-17 Thread Ahmet Arslan
Hi Roman, I was referring to this https://issues.apache.org/jira/browse/LUCENE-2878 ticket. Ahmet On Saturday, May 17, 2014 5:50 PM, Roman Chyla wrote: Hi, What will replace spans, if spans are nuked ? Roman On 17 May 2014 09:15, "Ahmet Arslan" wrote: > Hi, > > > Payloads are used to stor

Re: Solr 4.8: Does eDisMax parser calls analyzer chain to tokenize?

2014-05-17 Thread Alexandre Rafalovitch
My understanding was that the lower-case and other things happen on per-field basis and is a step after the dismax formula is applied. In this case, however, this seems to be happening before: DisjunctionMaxQuerywdText:abc123xyz wdText:abc) wdText:123 wdText:xyz) Hence to question to someone w

RE: Question regarding the lastest version of HeliosSearch

2014-05-17 Thread Jean-Sebastien Vachon
Thanks for the information Yonik. > -Original Message- > From: ysee...@gmail.com [mailto:ysee...@gmail.com] On Behalf Of Yonik > Seeley > Sent: May-16-14 8:52 PM > To: solr-user@lucene.apache.org > Subject: Re: Question regarding the lastest version of HeliosSearch > > On Thu, May 15, 20

Re: deep paging without sorting / keep IRs open

2014-05-17 Thread Yonik Seeley
On Sat, May 17, 2014 at 10:30 AM, Yonik Seeley wrote: > I think searcher leases would fit the bill here? > https://issues.apache.org/jira/browse/SOLR-2809 > > Not yet implemented though... FYI, I just put up a simple LeaseManager implementation on that issue. -Yonik http://heliosearch.org - face

Re: What is the usage of solr.NumericPayloadTokenFilterFactory

2014-05-17 Thread Jack Krupansky
I hate to say this, but if you have to ask, then it is highly likely that the feature is inappropriate for you. It may in fact be true that Payload is precisely the feature you need, but Solr support for this Lucene feature is rather limited, so you may have to do a lot of work on your own to

Re: deep paging without sorting / keep IRs open

2014-05-17 Thread Yonik Seeley
On Wed, May 14, 2014 at 8:34 AM, Tommaso Teofili wrote: > Basically I need the ability to keep running searches against a specified > commit point / index reader / state of the Lucene / Solr index. I think searcher leases would fit the bill here? https://issues.apache.org/jira/browse/SOLR-2809 N

Re: What is the usage of solr.NumericPayloadTokenFilterFactory

2014-05-17 Thread Roman Chyla
Hi, What will replace spans, if spans are nuked ? Roman On 17 May 2014 09:15, "Ahmet Arslan" wrote: > Hi, > > > Payloads are used to store arbitrary data along with terms. You can > influence score with these arbitrary data. > See : > http://sujitpal.blogspot.com.tr/2013/07/porting-payloads-to-so

Re: What is the usage of solr.NumericPayloadTokenFilterFactory

2014-05-17 Thread Ahmet Arslan
Hi, I forget to include Grant's write-up: http://searchhub.org/2009/08/05/getting-started-with-payloads/ On Saturday, May 17, 2014 3:53 PM, Ahmet Arslan wrote: Hi, Payloads are used to store arbitrary data along with terms. You can influence score with these arbitrary data. See : http://

Re: Solr 4.8: Does eDisMax parser calls analyzer chain to tokenize?

2014-05-17 Thread Michael Sokolov
Alex - the query parsers generally accept an analyzer, which they must apply after they perform their own tokenization. Consider: how would a capitalized query term match lower-cased terms in the index without query analysis? -Mike On 5/17/2014 4:05 AM, Alexandre Rafalovitch wrote: Hello,

Re: Solr performance: multiValued filed vs separate fields

2014-05-17 Thread Yonik Seeley
On Thu, May 15, 2014 at 10:29 AM, danny teichthal wrote: > I wonder about performance difference of 2 indexing options: 1- multivalued > field 2- separate fields > > The case is as follows: Each document has 100 “properties”: prop1..prop100. > The values are strings and there is no relation betwee

Re: What is the usage of solr.NumericPayloadTokenFilterFactory

2014-05-17 Thread Ahmet Arslan
Hi, Payloads are used to store arbitrary data along with terms. You can influence score with these arbitrary data. See : http://sujitpal.blogspot.com.tr/2013/07/porting-payloads-to-solr4.html But remember that there is an ongoing work to nuke Spans. Ahmet On Saturday, May 17, 2014 8:24 AM,

Re: retreive all the fields in join

2014-05-17 Thread Ahmet Arslan
Hi Karanti, I was thinking the same. DocTransformer looks a good candidate for such implementation. What do you think? May be we can implement this and contribute back? On Saturday, May 17, 2014 9:13 AM, Kranti Parisa wrote: Aman, The option you have got is: - write custom components l

Solr 4.8: Does eDisMax parser calls analyzer chain to tokenize?

2014-05-17 Thread Alexandre Rafalovitch
Hello, I am getting weird results that seem to come from eDisMax using analyzer chain to break the input text. I have WordDelimiterFilterFactory in my chain, which does a lot of interesting things I did not expect query parser to be involved in. Specifically, the string "abc123XYZ" gets split int

Re: Solr Commiter

2014-05-17 Thread Shawn Heisey
On 5/15/2014 6:10 AM, Mukundaraman valakumaresan wrote: > How to become a solr committer? Any suggestions? For me, this question has personal relevance. In 2010, I began to integrate Solr into our environment. I joined the mailing list, asked questions, stumbled around quite a lot. Eventually I

Re: Solrj problem

2014-05-17 Thread Shawn Heisey
On 5/7/2014 7:41 AM, blach wrote: > according to this : https://issues.apache.org/jira/browse/SOLR-5590 > > I understand that solrj is still depends on the old httpclient shipped with > android tools, and this is my problem too. KARL has made an patch, could you > please explain what that patch fo