Re: E-Commerce Search: tf-idf, tie-break and boolean model

2017-10-20 Thread Walter Underwood
Setting mm to 100% means that any misspelled word in a query means zero results. That is not a good experience. Usually, 10% of queries contain a misspelling. Set mm to 1. The F-measure is not a good choice for this because recall is not very important in e-commerce. Use precision-oriented mea

Re: E-Commerce Search: tf-idf, tie-break and boolean model

2017-10-20 Thread Vincenzo D'Amore
Thanks for all the info, I really appreciate your help. I'm working on the configuration and following your suggestions. We already had a golden set of query-results pairs (~1000) used to tune and check how my application (and Solr configuration) performs. But I've to entirely double check if this

Re: E-Commerce Search: tf-idf, tie-break and boolean model

2017-10-17 Thread Walter Underwood
That page from Stanford is not about e-commerce search. Westlaw is professional librarian search. I agree with Emir’s advice. Start with edismax. Use a small value for the tie-breaker. It is one of the least important configuration values. I use the default from the sample configs: 0.1

Re: E-Commerce Search: tf-idf, tie-break and boolean model

2017-10-17 Thread Erick Erickson
synonyms and relations between > search terms. > > /JZ > > -Original Message- > From: Charlie Hull [mailto:char...@flax.co.uk] > Sent: Tuesday, October 17, 2017 10:10 AM > To: solr-user@lucene.apache.org > Subject: Re: E-Commerce Search: tf-idf, tie-break and

RE: E-Commerce Search: tf-idf, tie-break and boolean model

2017-10-17 Thread Junte Zhang
between search terms. /JZ -Original Message- From: Charlie Hull [mailto:char...@flax.co.uk] Sent: Tuesday, October 17, 2017 10:10 AM To: solr-user@lucene.apache.org Subject: Re: E-Commerce Search: tf-idf, tie-break and boolean model For our e-commerce customers we've been recomme

Re: E-Commerce Search: tf-idf, tie-break and boolean model

2017-10-17 Thread Charlie Hull
For our e-commerce customers we've been recommending a test-based relevance tuning strategy: here's a series of blogs written for us by someone who ran search for the world's largest electronic component distributor: http://www.flax.co.uk/blog/2016/03/18/get-started-improving-site-search-releva

Re: E-Commerce Search: tf-idf, tie-break and boolean model

2017-10-16 Thread alessandro.benedetti
I was having the discussion with a colleague of mine recently, about E-commerce search. Of course there are tons of things you can do to improve relevancy: Custom similarity - edismax tuning - basic user events processing - machine learning integrations - semantic search ect ect more you do, bette

Re: E-Commerce Search: tf-idf, tie-break and boolean model

2017-10-16 Thread Emir Arnautović
Hi Vincenzo, Unless you have really specific ranking requirements, I would not suggest you to start with you proprietary similarity implementation. In most cases edismax will be good enough to cover your requirements. It is not easy task to tune edismax since it has a log knobs that you can use.