How to get the query content in DefaultSimilarity class?

2015-04-15 Thread Xi Shen
my search server. The binary content can be transformed into "text" like document for inverted index. But I still need to whole "query document" to compute the similarity score. Any suggestions? Thanks, [image: --] Xi Shen [image: http://]about.me/davidshen <http://about.me/d

Re: How to get/set customized Solr data source properties?

2013-05-01 Thread Xi Shen
Hi Hoss, I reviewed the code from other DataSouce classes as well, that's how I learned it should work. And this is my actual code. I create this DataSource for testing my ideas. I am blocked at the very beginning...sucks :( On Saturday, April 27, 2013, Chris Hostetter wrote: > : > : I am worki

Re: How to let Solr load libs from within my JAR?

2013-04-24 Thread Xi Shen
> Appinions > 18 East 41st Street, 2nd Floor > New York, NY 10017-6271 > > www.appinions.com > > Where Influence Isn’t a Game > > > On Tue, Apr 23, 2013 at 9:37 PM, Xi Shen wrote: > > Hi, > > > > I developed a

How to let Solr load libs from within my JAR?

2013-04-23 Thread Xi Shen
Hi, I developed a data import handler, it has some dependent libraries. I deployed them in a parallel folder with my JAR and included the path in solrconfig.xml. It works fine. But I am thinking maybe I can pack those JAR libs within my JAR, but I got NoClassDefFoundError exception when executing

How to get/set customized Solr data source properties?

2013-04-23 Thread Xi Shen
Hi, I am working on a DataSource implementation. I want to get some customized properties when the *DataSource.init* method is called. I tried to add the attributes in the *dataSource* element, but I cannot read it from the *initProps* object. Either I added the attribute in a wrong element, or I

Which token filter can combine 2 terms into 1?

2012-12-20 Thread Xi Shen
Hi, I am looking for a token filter that can combine 2 terms into 1? E.g. the input has been tokenized by white space: t1 t2 t2a t3 I want a filter that output: t1 t2t2a t3 I know it is a very special case, and I am thinking about develop a filter of my own. But I cannot figure out which API

Re: duplicated URL sent from Nutch to solr index

2012-12-03 Thread Xi Shen
Then the "URL" must be the same. On Mon, Dec 3, 2012 at 2:34 PM, Joe Zhang wrote: > Sorry I didn't make it perfectly clear. The "id" field is URL. > > On Sun, Dec 2, 2012 at 11:33 PM, Joe Zhang wrote: > > > Thanks! > > > > > > On

Re: behavior of solr.KeepWordFilterFactory

2012-12-03 Thread Xi Shen
splitOnCaseChange="1"/> > > > > > protected="protwords.txt"/> > > > > > > > > > > > > On Sun, Dec 2, 2012 at 1

Re: behavior of solr.KeepWordFilterFactory

2012-12-02 Thread Xi Shen
By the definition on https://lucene.apache.org/solr/api-3_6_1/org/apache/solr/analysis/KeepWordFilter.html, I am pretty sure it is the correct behavior of this filter :) I guess you are trying to this filter to index some special words in Chinese? On Mon, Dec 3, 2012 at 1:54 PM, Joe Zhang wrote

Re: duplicated URL sent from Nutch to solr index

2012-12-02 Thread Xi Shen
If the value for "id" field is the same, the old entry will be update; if it is new, a new entry will be created & indexed. This is my experience. :) On Mon, Dec 3, 2012 at 1:45 PM, Joe Zhang wrote: > Dear list, > > I just want to confirm an expected behavior of solr: > > Assuming we have " id