Thanks for your answer Rahul. I think I have explained similarity with the
example, assuming the natural order.
I would assume this is a common action for people who use solr and do search
based systems.
I am basically looking for any design patterns that people use to achieve the
results as exp
How do you define similarity? There are various different methods that work for
different methods. In solr depending on which index time analyzer / tokenizer
you are using, it will treat one company name as similar in one scenario and
not in another.
This seems like a case of data deduplication