Re: Better highlighting fragmenter

2007-01-03 Thread Mike Klaas
On 1/3/07, Walter Underwood <[EMAIL PROTECTED]> wrote: On 1/3/07 9:33 PM, "Yonik Seeley" <[EMAIL PROTECTED]> wrote: > On 1/3/07, Walter Underwood <[EMAIL PROTECTED]> wrote: >> We tried several APIs and decided that the best was an array of >> String with the odd elements containing the strings t

Re: Better highlighting fragmenter

2007-01-03 Thread Walter Underwood
On 1/3/07 9:33 PM, "Yonik Seeley" <[EMAIL PROTECTED]> wrote: > On 1/3/07, Walter Underwood <[EMAIL PROTECTED]> wrote: >> We tried several APIs and decided that the best was an array of >> String with the odd elements containing the strings that needed >> highlighting. > > Good idea... the only th

Re: Better highlighting fragmenter

2007-01-03 Thread Yonik Seeley
On 1/3/07, Mike Klaas <[EMAIL PROTECTED]> wrote: That is _much_ better than the current system. It wouldn't be hard to add start/end offsets to the fragments too, as Chris suggested so long ago. Or leave room for other info such as weights, or what term matched, etc. Quite a few aspects of t

Re: Better highlighting fragmenter

2007-01-03 Thread Yonik Seeley
On 1/3/07, Walter Underwood <[EMAIL PROTECTED]> wrote: We tried several APIs and decided that the best was an array of String with the odd elements containing the strings that needed highlighting. Good idea... the only thing I could think of was an array of start/end offsets into the string, wh

Re: Better highlighting fragmenter

2007-01-03 Thread Mike Klaas
On 1/3/07, Walter Underwood <[EMAIL PROTECTED]> wrote: I've implemented this before, once in Python and once in C, so I'd be glad to take a look at it. I'm not sure I have time to do a lot of implementation, but I'd sure be glad to help. Cool. I'll post the current fragmenter as a JIRA issue

Re: Better highlighting fragmenter

2007-01-03 Thread Walter Underwood
On 1/3/07 5:13 PM, "Mike Klaas" <[EMAIL PROTECTED]> wrote: > Generally, we should strive for a high-quality out-of-the-box > highlighting in Solr. That might involve making things like better > fragmenters and a few other tricks(*) the default setup, and providing > a "quick & dirty" setting for

Re: Better highlighting fragmenter

2007-01-03 Thread Chris Hostetter
: implementing Solr's highlighting. What I'd like to do is improve the : end-user's experience with highlighting in Solr. If as a result a : high-quality component for lucene Highlighter is fleshed out, that can : always be contributed to Lucene later. : : Generally, we should strive for a high-

Re: Better highlighting fragmenter

2007-01-03 Thread Mike Klaas
On 1/3/07, Chris Hostetter <[EMAIL PROTECTED]> wrote: : I've written an unpolished custom fragmenter for highlighting which is : more expensive than the BasicFragmenter that ships with lucene, but : generates more natural candidate fragments (it will tend to produce : beginning/ends of sentences

Re: Better highlighting fragmenter

2007-01-03 Thread Chris Hostetter
: I've written an unpolished custom fragmenter for highlighting which is : more expensive than the BasicFragmenter that ships with lucene, but : generates more natural candidate fragments (it will tend to produce : beginning/ends of sentences). : : Would there be interest in the community in relea

Re: Better highlighting fragmenter

2007-01-03 Thread Erik Hatcher
On Jan 3, 2007, at 6:36 PM, Mike Klaas wrote: I've written an unpolished custom fragmenter for highlighting which is more expensive than the BasicFragmenter that ships with lucene, but generates more natural candidate fragments (it will tend to produce beginning/ends of sentences). Would there

Re: Better highlighting fragmenter

2007-01-03 Thread Michael Imbeault
I for one would be interested in such a fragmenter, as the default one is lacking and doesnt produce acceptable results for most applications. Michael Mike Klaas wrote: I've written an unpolished custom fragmenter for highlighting which is more expensive than the BasicFragmenter that ships wit

Better highlighting fragmenter

2007-01-03 Thread Mike Klaas
I've written an unpolished custom fragmenter for highlighting which is more expensive than the BasicFragmenter that ships with lucene, but generates more natural candidate fragments (it will tend to produce beginning/ends of sentences). Would there be interest in the community in releasing it and