Re: Replacing FAST functionality at sesam.no - ShingleFilter+exact matching

2008-09-09 Thread Mck
> Looks to me like MultiPhraseQuery is getting in the way. Shingles > that begin at the same word are given the same position by > ShingleFilter, and Solr's FieldQParserPlugin creates a > MultiPhraseQuery when it encounters tokens in a query with the same > position. I think what you want is to

RE: Re: Replacing FAST functionality at sesam.no - ShingleFilter+exact matching

2008-09-09 Thread Steven A Rowe
Hi mck, On 09/09/2008 at 12:58 PM, Mck wrote: > > *ShortVersion* > > is there a way to make the ShingleFilter perform exact matching via > > inserting ^ $ begin/end markers? > > Reading through the mailing list i see how exact matching can > be done, a la STFW to myself... > > So the ShortVersi

Re: Replacing FAST functionality at sesam.no

2008-09-08 Thread Otis Gospodnetic
it of glue to use it in Solr. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Mck <[EMAIL PROTECTED]> > To: solr-user@lucene.apache.org > Sent: Monday, September 8, 2008 4:43:50 AM > Subject: Re: Replacing FAST functionalit

Re: Replacing FAST functionality at sesam.no

2008-09-08 Thread Mck
> I'm not very familiar with shingles but it seems to be that you should > have ShingleFilter at index time and make the query as a phrase query? Then the entry "abcd efgh ijkl" would be indexed as (abcd "abcd efgh" "abcd efgh ijkl" efgh "efgh ijkl" ijkl) and a subsequent query "abcd" would retu

Re: Replacing FAST functionality at sesam.no

2008-09-08 Thread Shalin Shekhar Mangar
I'm not very familiar with shingles but it seems to be that you should have ShingleFilter at index time and make the query as a phrase query? On Mon, Sep 8, 2008 at 1:00 PM, Mck <[EMAIL PROTECTED]> wrote: > > So then i change type="string" to type="shingleString" along with > > > [snip] > > >

Re: Replacing FAST functionality at sesam.no

2008-09-08 Thread Mck
> So then i change type="string" to type="shingleString" along with > > [snip] > > > > > > > outputUnigramIfNoNgram="true" maxShingleSize="99" /> > > Debugging ShingleFilter I see that without quotes the shingles StringBuffer array consists of just the current token

Re: Replacing FAST functionality at sesam.no

2008-09-06 Thread Mck
> but Mick Semb Wever will be taking over this job for the next two weeks. back from holidays and taking over where Glenn-Erik left. i'm very new to Solr so please bear with me, i'll run through our setup from scratch. Our test list has 9 entries: "abcd efgh ijkl", "abcd efgh", "efgh ijkl", "a

Re: Replacing FAST functionality at sesam.no

2008-08-28 Thread Glenn-Erik
>In order to do this, you can't use the ShingleFilter during indexing >since a document like "one two three" and a query like "one two four" >will match since they have the shingle "one two" in common. Hello Svein, nice to meet you in this place =) I have been trying with and without and also

Re: Replacing FAST functionality at sesam.no

2008-08-28 Thread Glenn-Erik
> The screenshot didn't make it (some attachments gets stripped) I have put the screenshots here: http://www.glennerik.com/solr/solrshingle1.gif and here: http://www.glennerik.com/solr/solrshingle2.gif I also put the schema.xml here: http://www.glennerik.com/solr/schema.xml > This sounds very

Re: Replacing FAST functionality at sesam.no

2008-08-27 Thread Svein Parnas
On 27. aug.. 2008, at 19.44, Glenn-Erik Sandbakken wrote: At sesam.no we want to replace a FAST (fast.no) Query Matching Server with a Solr index. The index we are trying to replace is not a regular index, but specially configured to perform phrases (and sub-phrases) matches against severa

Re: Replacing FAST functionality at sesam.no

2008-08-27 Thread Otis Gospodnetic
The screenshot didn't make it (some attachments gets stripped) Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Glenn-Erik Sandbakken <[EMAIL PROTECTED]> > To: solr-user@lucene.apache.org > Sent: Wednesday, August 27, 2008 1:44:53 PM > S