Hi Everyone, I have a document that contains data like this "Bachelor's degree is easier to get" in the 'body' field and I am making a query on this field searching for word 'Bachelor's degree' like this - query?fq=body:"bachelor%27s% 20degree"&fl=body_frequency:termfreq(body,"bachelor%27s%20degree"),body and I am getting zero results in response even when I have documents that contains words like 'Bachelor's degree'.
I checked in the admin panel tab , there i can see the WordDelimiterFilterFactory applied on the word 'Bachelor's Degree' and converting it to 'Bachelor degree'. So both in the Field Value (Query) and Field Value (Index) the WordDelimiterFilterFactory is converting the word 'Bachelor's Degree' to 'Bachelor's Degree' , SO why I am getting zero results when quering. I have attached the screenshots of my analysis page. I have attached a code file 'code.txt' where you can see the code for the field 'body'. Please tell me what Am I doing wrong. Thanks -- *CONFIDENTIALITY NOTICE: This e-mail transmission, and any documents, files or previous e-mail messages attached to it, are confidential and may be privileged. If you are not the intended recipient, or a person responsible for delivering it to the intended recipient, you are hereby notified that any review, disclosure, copying, distribution, retransmission, dissemination or other use of any of the information contained in, or attached to, this transmission is STRICTLY PROHIBITED. If you have received this transmission in error, please immediately notify the sender. Please destroy the original transmission and its attachments without reading or saving in any manner. Thank you.* *algoscale technologies private limited*
<field name="body" required="true" type="text_general_shingled_stemmed" indexed="true" stored="true" termVectors="true" omitNorms="true" termOffsets="false"/> <fieldType name="text_general_shingled_stemmed" class="solr.TextField" positionIncrementGap="100"> <analyzer type="index"> <tokenizer class="solr.StandardTokenizerFactory"/> <filter class="solr.LowerCaseFilterFactory"/> <filter class="solr.WordDelimiterFilterFactory"/> <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt" /> <filter class="solr.SnowballPorterFilterFactory" language="English" protected="protwords.txt"/> <filter class="solr.ShingleFilterFactory" minShingleSize="2" maxShingleSize="5" outputUnigrams="true"/> </analyzer> <analyzer type="query"> <tokenizer class="solr.StandardTokenizerFactory"/> <filter class="solr.LowerCaseFilterFactory"/> <filter class="solr.WordDelimiterFilterFactory"/> <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt" /> <filter class="solr.SnowballPorterFilterFactory" language="English" protected="protwords.txt"/> <filter class="solr.ShingleFilterFactory" minShingleSize="2" maxShingleSize="5" outputUnigrams="true"/> </analyzer> <analyzer type="tvrh"> <tokenizer class="solr.StandardTokenizerFactory"/> <filter class="solr.LowerCaseFilterFactory"/> <filter class="solr.WordDelimiterFilterFactory"/> <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt" /> <filter class="solr.SnowballPorterFilterFactory" language="English" protected="protwords.txt"/> <filter class="solr.ShingleFilterFactory" minShingleSize="2" maxShingleSize="5" outputUnigrams="true"/> </analyzer> </fieldType>