Re: Ideas for a relevance score that could be considered stable across multiple searches with the same query structure?

2007-05-30 Thread Daniel Einspanjer
On 4/11/07, Chris Hostetter <[EMAIL PROTECTED]> wrote: : Not really. The explain scores aren't normalized and I also couldn't : find a way to get the explain data as anything other than a whitespace : formatted text blob from Solr. Keep in mind that they need confidence the defualt way Solr du

Re: Ideas for a relevance score that could be considered stable across multiple searches with the same query structure?

2007-05-05 Thread Daniel Einspanjer
On 4/11/07, Chris Hostetter <[EMAIL PROTECTED]> wrote: A custom Similaity class with simplified tf, idf, and queryNorm functions might also help you get scores from the Explain method that are more easily manageable since you'll have predictible query structures hard coded into your application

EmbeddedSolr class from Wiki

2007-04-27 Thread Daniel Einspanjer
The example EmbeddedSolr class on the wiki makes use of getUpdateHandler which was added after 1.1 (so it seems to be available only on trunk). I'd really like to move to an embedded Solr sooner rather than later. My questions are: - Would it be easy/possible to work around the lack of getUp

Re: embedding solr

2007-04-10 Thread Daniel Einspanjer
That is good to hear. I guess I was overly worried when I saw your mention of having trouble getting the field values with the correct types. I will be taking a look at this later this week. Thank you very much for your prompt response. On 4/10/07, Ryan McKinley <[EMAIL PROTECTED]> wrote: Ther

Ideas for a relevance score that could be considered stable across multiple searches with the same query structure?

2007-04-10 Thread Daniel Einspanjer
I did a bit of research on the list for prior discussions of normalized scores and such. Please forgive me if I overlooked something relevant, but I didn't see anything exactly what I'm looking for. I am building a replacement for our current text matching engine that takes a list of documents f

Re: embedding solr

2007-04-10 Thread Daniel Einspanjer
Ryan, Do you have any of this code you could share? I am currently using Solr to perform thousands of queries in a batch, and eliminating the HTTP overhead is something I'd love to do if it isn't complicated. We need several of the extra features Solr provides, which is why we are trying to use

How can you perform a fuzzy search on a phrase without it turning into a word distance search?

2007-03-29 Thread Daniel Einspanjer
I'd like to be able to search for multi-word titles in a fuzzy manner where small typos could be compensated for, but when you make a query term like: title:"The increadable machine"~ that will perform a word distance search instead of a fuzzy search. Is it possible to do this without manually sp

Re: Best approach for indexing and querying against a multivalue name field like directors or actors?

2007-03-28 Thread Daniel Einspanjer
parate instances of the multivalue field. I hope this makes understanding my question better rather than worse. :) Thanks, Daniel On 3/28/07, Daniel Einspanjer <[EMAIL PROTECTED]> wrote: but I need to make sure that "Jane Doe" wouldn't match a document with two directors, "Jane Smith" and "John Doe" at all.

Best approach for indexing and querying against a multivalue name field like directors or actors?

2007-03-28 Thread Daniel Einspanjer
I'm rather new to Solr and somewhat rusty on what little I learned on Lucene a few years back. I've got some documents I want to index that have multiple name fields such as directors or actors. I'm wanting to index them such that querying for "Jane Doe" would have a higher score for "Jane M. Doe