Re: Searching on special characters

Jack Krupansky Thu, 24 Oct 2013 06:40:37 -0700

Have two or three copies of the text, one field could be raw string andboosted heavily for exact match, a second could be text using the keywordtokenizer but with lowercase filter also heavily boosted, and the thirdfield general, tokenized text with a lower boost. You could also have a copythat uses the keyword tokenizer to maintain a single token but also appliesa regex filter to strip special characters and applies a lower case filterand give that an intermediate boost.


-- Jack Krupansky

-----Original Message-----From: johnmu...@aol.com

Sent: Thursday, October 24, 2013 9:20 AM
To: solr-user@lucene.apache.org
Subject: Searching on special characters

Hi,

How should I setup Solr so I can search and get hit on special characterssuch as: + - && || ! ( ) { } [ ] ^ " ~ * ? : \



My need is, if a user has text like so:


Doc-#1: "(Solr)"
Doc-#2: "Solr"

And they type "(solr)" I want a hit on "(solr)" only in document #1, withthe brackets matching. And if they type "solr", they will get a hit inDocument #2 only.

An additional nice-to-have is, if they type "solr", I want a hit in bothdocument #1 and #2.



Here is what my current schema.xml looks like:



     <analyzer>
       <tokenizer class="solr.WhitespaceTokenizerFactory"/>

<filter class="solr.StopFilterFactory" ignoreCase="true"words="lang/stopwords_en.txt" enablePositionIncrements="true"/><filter class="solr.WordDelimiterFilterFactory"generateWordParts="1" generateNumberParts="1" catenateWords="1"catenateNumbers="1" catenateAll="1" splitOnCaseChange="0"splitOnNumerics="1" stemEnglishPossessive="1" preserveOriginal="1"/>

       <filter class="solr.LowerCaseFilterFactory"/>

<filter class="solr.KeywordMarkerFilterFactory"protected="protwords.txt"/>

       <filter class="solr.PorterStemFilterFactory"/>
       <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
     </analyzer>



Currently, special characters are being stripped.



Any idea how I can configure Solr to do this?  I'm using Solr 3.6.



Thanks !!

-MJ

Re: Searching on special characters

Reply via email to