Re: Get All terms from all documents

2008-12-19 Thread Walter Underwood
At Netflix, we load the completion lexicon with movie titles, person names, and a few aliases. Even then, we find a few misspellings in our metadata (is it "NWA" or "N.W.A."?). Extracting terms from documents will find a lot of misspellings. You really do not want to rely on random users to correc

Re: Get All terms from all documents

2008-12-19 Thread Grant Ingersoll
I'd add you probably don't want just the query logs, people may search for things that aren't in the index, too. Your call as to whether that is useful or not. Also, have a look at the TermsComponent, as it will tell you the doc freq for terms. On Dec 19, 2008, at 10:08 AM, roberto wrote

Re: Get All terms from all documents

2008-12-19 Thread roberto
Erick, Thanks this sounds good, i'll try. Mike, Could you give more details about query logs? Thanks On Fri, Dec 19, 2008 at 12:02 AM, Mike Klaas wrote: > > On 18-Dec-08, at 10:53 AM, roberto wrote: > > Erick, >> >> Thanks for the answer, let me clarify the thing, we would like to have a >>

Re: Get All terms from all documents

2008-12-18 Thread Mike Klaas
On 18-Dec-08, at 10:53 AM, roberto wrote: Erick, Thanks for the answer, let me clarify the thing, we would like to have a combobox with the terms to guide the user in the search i mean, if a have thousands of documents and want to tell them how many documents in the base have the partic

Re: Get All terms from all documents

2008-12-18 Thread Erick Erickson
t; > > > > > All terms from all docs? Really? > > > > > > > > At any rate, see http://wiki.apache.org/solr/TermsComponent May > need > > a > > > > mod to not require any field, but for now you can enter all fields > > (which

Re: Get All terms from all documents

2008-12-18 Thread roberto
> mod to not require any field, but for now you can enter all fields > (which > > > you can get from LukeRequestHandler) > > > > > > -Grant > > > > > > > > > > > > On Dec 17, 2008, at 2:17 PM, roberto wrote: > >

Re: Get All terms from all documents

2008-12-18 Thread Erick Erickson
Really? > > > > At any rate, see http://wiki.apache.org/solr/TermsComponent May need a > > mod to not require any field, but for now you can enter all fields (which > > you can get from LukeRequestHandler) > > > > -Grant > > > > > > > > On

Re: Get All terms from all documents

2008-12-17 Thread roberto
r all fields (which > you can get from LukeRequestHandler) > > -Grant > > > > On Dec 17, 2008, at 2:17 PM, roberto wrote: > > Hello, >> >> I need to get all terms from all documents to be placed in my interface >> almost like the facets, how can i do it?

Re: Get All terms from all documents

2008-12-17 Thread Grant Ingersoll
get all terms from all documents to be placed in my interface almost like the facets, how can i do it? thanks -- "Without love, we are birds with broken wings." Morrie -- Grant Ingersoll Lucene Helpful Hints: http://wiki.apache.org/lucene-java/BasicsOfPerfor

Get All terms from all documents

2008-12-17 Thread roberto
Hello, I need to get all terms from all documents to be placed in my interface almost like the facets, how can i do it? thanks -- "Without love, we are birds with broken wings." Morrie