RE: Do not match on high frequency terms

2015-08-03 Thread Swedish, Steve
certain threshold in a field. I can't find an easy way to do this, do you know if this is possible? Thanks, Steve -Original Message- From: Mikhail Khludnev [mailto:mkhlud...@griddynamics.com] Sent: Saturday, August 1, 2015 6:35 AM To: solr-user Subject: Re: Do not match on

Re: Do not match on high frequency terms

2015-08-01 Thread Mikhail Khludnev
It seems like you need to develop custom query or query parser. Regarding SolrJ: you can try to call http://wiki.apache.org/solr/TermsComponent https://cwiki.apache.org/confluence/display/solr/The+Terms+Component I'm not sure how exactly call TermsComponent in SolrJ, I just found https://lucene.apa

Re: Do not match on high frequency terms

2015-07-31 Thread Ahmet Arslan
Hi Steve, More like this (mlt) has mlt.mindf parameter, you can create a custom solr plugin inspired from it. At lucene level, you can access df with : org.apache.lucene.search.IndexSearcher#termStatistics Ahmet On Friday, July 31, 2015 11:31 PM, "Swedish, Steve" wrote: Hello, I'm hoping som

Do not match on high frequency terms

2015-07-31 Thread Swedish, Steve
Hello, I'm hoping someone might be able to help me out with this as I do not have very much solr experience. Basically, I am wondering if it is possible to not match on terms that have a document frequency above a certain threshold. For my situation, a stop word list will be unrealistic to main