Re: Text Similarity

2018-07-15 Thread Aroop Ganguly
Thanks for your answer Rahul. I think I have explained similarity with the example, assuming the natural order. I would assume this is a common action for people who use solr and do search based systems. I am basically looking for any design patterns that people use to achieve the results as exp

Re: Text Similarity

2018-07-15 Thread Rahul Singh
How do you define similarity? There are various different methods that work for different methods. In solr depending on which index time analyzer / tokenizer you are using, it will treat one company name as similar in one scenario and not in another. This seems like a case of data deduplication