At Netflix, we load the completion lexicon with movie titles, person
names, and a few aliases. Even then, we find a few misspellings in
our metadata (is it "NWA" or "N.W.A."?). Extracting terms from
documents will find a lot of misspellings.
You really do not want to rely on random users to correc
I'd add you probably don't want just the query logs, people may search
for things that aren't in the index, too. Your call as to whether
that is useful or not. Also, have a look at the TermsComponent, as it
will tell you the doc freq for terms.
On Dec 19, 2008, at 10:08 AM, roberto wrote
Erick,
Thanks this sounds good, i'll try.
Mike,
Could you give more details about query logs?
Thanks
On Fri, Dec 19, 2008 at 12:02 AM, Mike Klaas wrote:
>
> On 18-Dec-08, at 10:53 AM, roberto wrote:
>
> Erick,
>>
>> Thanks for the answer, let me clarify the thing, we would like to have a
>>
On 18-Dec-08, at 10:53 AM, roberto wrote:
Erick,
Thanks for the answer, let me clarify the thing, we would like to
have a
combobox with the terms to guide the user in the search i mean, if a
have
thousands of documents and want to tell them how many documents in
the base
have the partic
t; >
> > > > All terms from all docs? Really?
> > > >
> > > > At any rate, see http://wiki.apache.org/solr/TermsComponent May
> need
> > a
> > > > mod to not require any field, but for now you can enter all fields
> > (which
> mod to not require any field, but for now you can enter all fields
> (which
> > > you can get from LukeRequestHandler)
> > >
> > > -Grant
> > >
> > >
> > >
> > > On Dec 17, 2008, at 2:17 PM, roberto wrote:
> >
Really?
> >
> > At any rate, see http://wiki.apache.org/solr/TermsComponent May need a
> > mod to not require any field, but for now you can enter all fields (which
> > you can get from LukeRequestHandler)
> >
> > -Grant
> >
> >
> >
> > On
r all fields (which
> you can get from LukeRequestHandler)
>
> -Grant
>
>
>
> On Dec 17, 2008, at 2:17 PM, roberto wrote:
>
> Hello,
>>
>> I need to get all terms from all documents to be placed in my interface
>> almost like the facets, how can i do it?
get all terms from all documents to be placed in my
interface
almost like the facets, how can i do it?
thanks
--
"Without love, we are birds with broken wings."
Morrie
--
Grant Ingersoll
Lucene Helpful Hints:
http://wiki.apache.org/lucene-java/BasicsOfPerfor
Hello,
I need to get all terms from all documents to be placed in my interface
almost like the facets, how can i do it?
thanks
--
"Without love, we are birds with broken wings."
Morrie
10 matches
Mail list logo