Hi Tanya, I think can have a stop filter applied to the query for your field type.
<analyser type=query> ... <filter class="solr.StopFilterFactory" ignoreCase="true" words="myquerystopwords.txt" /> You should be aable to use the length filter for the second part of your question. <filter class="solr.LengthFilterFactory" min="3"/> Cheers, Peter. -----Original Message----- From: Tanya Bompi [mailto:tanya.bo...@gmail.com] Sent: 13 December 2018 19:54 To: solr-user@lucene.apache.org Subject: terms not to match in a search query Hi, If there are certain terms in the query like "pvt", "ltd" which I wouldn't want to be matched against the index, is there a way to specify the list of words that I could set in the configuration and not make it part of the query. Say, is it possible to add the terms to stopwords.txt or any other file that could be treated as a blacklist which at querying time will be taken of. Also, is there a configuration setting to be able to set a min length of the words that should be used in the matching when retrieving the documents? Basically any words after tokenization of length < 3 to be ignored. Kindly let me know. Thanks, Tanya _________________________________________________________________________________________ Caution: This email originated from outwith the organisation. Do not click on links or open attachments unless you recognise the sender and know the content is safe _________________________________________________________________________________________ ________________________________ This message is confidential and may contain privileged information. You should not disclose its contents to any other person. If you are not the intended recipient, please notify the sender named above immediately. It is expressly declared that this e-mail does not constitute nor form part of a contract or unilateral obligation. Opinions, conclusions and other information in this message that do not relate to the official business of findmypast shall be understood as neither given nor endorsed by it. ________________________________ __________________________________________________________________________ This email has been checked for virus and other malicious content prior to leaving our network. __________________________________________________________________________