Hello all,
One quick question, trying to find out what scenario would work best. We have huge free text dataset containing product titles, descriptions. Unfortunately, we don't have the data categorized so we rely on 'search relevancy + synonyms' heavily to categorize. Here is what I am trying to do : Someone clicks on 'Comforters & Pillows' , we would want the results to be filtered where title has keyword 'Comforter' or 'Pillows' but we have been getting results with word 'comfort' in the title. I assume it is because of stemming. What is the right way to handle this? I am thinking to create another unstemmed field as 'title_unstemmed' which stores the data unstemmed. So basically, with dismax - I could boost score on unstemmed field. I can think of other scenarios where stemming would be needed so stemmed field would still match. Does that sound like something that will work? Any suggestions please? Much appreciated