The screenshot didn't make it.... (some attachments gets stripped)

Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



----- Original Message ----
> From: Glenn-Erik Sandbakken <[EMAIL PROTECTED]>
> To: solr-user@lucene.apache.org
> Sent: Wednesday, August 27, 2008 1:44:53 PM
> Subject: Replacing FAST functionality at sesam.no
> 
> At sesam.no we want to replace a FAST (fast.no) Query Matching Server
> with a Solr index.
> 
> The index we are trying to replace is not a regular index, but specially
> configured to perform phrases (and sub-phrases) matches against several
> large lists (like an index with only a 'title' field).
> 
> I'm not sure of a correct, or logical, name for the behavior we are
> after, but it is like a combination between Shingles and exact matching.
> 
> Some examples should explain it well.
> 
> Lets say we have the following list:
> > one two three
> > one two
> > two three
> > one
> > two
> > three
> > three two
> > two one
> > one three
> > three one
> 
> For the query "one two three", we need hits against, and only against:
> > one two three
> > one two
> > two three
> > one
> > two
> > three
> 
> For the query "one two", we need hits against, and only against:
> > one two
> > one
> > two
> 
> For the query "one three four" (or "four one three"), we need hits
> against, and only against:
> > one three
> > one
> > three
> 
> For the query "one two sesam three", we need hits against, and only
> against:
> > one two
> > one
> > two
> > three
> 
> We have been testing out solr with the ShingleFilter for this, but
> without luck.
> I am unsure whether the reason is misconfiguration in schema.xml or that
> the ShingleFilter actually don't support this type of behavior.
> Attached our current schema.xml
> (it is different from when I made this post to the solr-dev mailinglist,
> the shingle "fieldType" is of class "solr.StrField")
> Attached is screenshots of the solr/admin/analysis.jsp against this
> configuration.
> 
> I'd like to know if the SchingleFilter is at all able to do what we
> want.
> If it is: How can I configure schema.xml?
> If not: does there exist any other solutions that we can incorporate
> into solr which will give us this behavior?
> 
> If there is no existing solution to this, we will probably end up
> writing our own methods for it, extending the ShingleFilter, gladly
> contributing to the solr project =)
> 
> Thanks for a great product,
> Glenn-Erik

Reply via email to