Hello, I'm hoping someone might be able to help me out with this as I do not have very much solr experience. Basically, I am wondering if it is possible to not match on terms that have a document frequency above a certain threshold. For my situation, a stop word list will be unrealistic to maintain, so I was wondering if there may be an alternative solution using term document frequency to identify common terms.
What would actually be ideal is if I could somehow use the CommonTermsQuery. The problem I ran across when looking at this option was that the CommonTermsQuery seems to only work for queries on one field at a time (unless I'm mistaken). However, I have a query of the structure q=(field1:(blah) AND (field2:(blah) OR field3:(blah))) OR field1:(blah) OR (field2:(blah) AND field3:(blah)). If there are any ideas on how to use the CommonTermsQuery with this query structure, that would be great. If it's possible to extract the document frequency for terms in my query before the query is run, allowing me to remove the high frequency terms from the query first, that could also be a valid solution. I'm using solrj as well, so a solution that works with solrj would be appreciated. Thanks, Steve