Hi, 
My application needs to handle synonyms for courses. The most natural way to
achieve this would be having the field "course" to be multivalued. 

Now, say I add documents  like:

<document>
  <field name="professor">John Dane</field>
  <field name="course">Algorithms</field>
  <field name="course">Theory</field>
  <field name="course">Computability, Complexity and Algorithms</field>
</document>

<document>
  <field name="professor">Mary Arriaga</field>
  <field name="course">Algorithms for Pattern Matching</field>
</document>

Now, if I query for "Algorithms", I get a higher score for document 2 than
document 1. 

1) I have noticed that this is because length norm factor of lucene scoring
considers all values of the mutivalued field, which is reducing the overall
score of document 1. How can I avoid this? 

2) Is there a alternate way to achieve what I want here? I can think of
changing the schema of my index by making the field "course" as
single-valued and creating separate documents for each synonym for a course.
But wont that explode the index size. 

Any kind of help is appreciated. Thanks in advance.

- Abhishek
-- 
View this message in context: 
http://www.nabble.com/Querying-multivalued-field---can-scoring-formula-consider-only-matched-values--tp19865873p19865873.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to