Webster, did you try escaping the special character (assuming you did not do what Shawn did by replacing - with some other text and your indexed tokens have -)?
On Thu, Jul 27, 2017 at 12:03 PM, Webster Homer <webster.ho...@sial.com> wrote: > Shawn, > Thank you for that. I didn't know about that feature of the WDF. It doesn't > help my situation but it's great to know about. > Googling solr wildcard searches I found this link > http://lucene.472066.n3.nabble.com/Wildcard-search- > not-working-with-search-term-having-special-characters-and- > digits-td4133385.html > > "The use of a wildcard in a query term with embedded special characters > will bypass > normal analysis - you need to enter the term exactly as it would be > analyzed > at index time for wildcard to work." > > To me this seems like a design flaw. The Solr fieldtypes seem like they > allow a developer to create types that should handle wildcards > intelligently. At the very least the Analyzer tool should show this > behavior, and not even try to analyze terms with wildcards. > > Actually the behavior would more correctly be stated as "You need to enter > the term exactly as the data is after it has been indexed" If the fieldtype > removes hyphens then you must enter the wildcard query without hyphens. > > On Thu, Jul 27, 2017 at 8:35 AM, Shawn Heisey <apa...@elyograg.org> wrote: > > > On 7/26/2017 12:33 PM, Webster Homer wrote: > > > checked the Pattern Replace it's OK. Can't use the preserve original > > > since it preserves the hyphens too, which I don't want. It would be > > > best if it didn't touch the * at all > > > > You can tell WDF to change the meaning of certain characters. Here's a > > WDF entry in one of my schemas: > > > > <filter class="solr.WordDelimiterFilterFactory" > > splitOnCaseChange="0" > > splitOnNumerics="0" > > stemEnglishPossessive="0" > > generateWordParts="1" > > generateNumberParts="1" > > catenateWords="0" > > catenateNumbers="0" > > catenateAll="0" > > preserveOriginal="1" > > types="wdftypes_mt.txt" > > /> > > > > This is the contents of wdftypes_mt.txt (between the --- lines): > > > > --- > > - => ALPHA > > _ => ALPHA > > --- > > > > I have defined the hyphen and underscore as alphabetic characters in > > this situation. This is in a fieldType that I use for a field that > > contains a typical mime-type, where I do not want to split on a hyphen > > or underscore. > > > > I am having a hard time finding documentation on the "types" parameter > > for WDF. I no longer remember where I found the information on how to > > format that file. I may have looked at the source code. > > > > Thanks, > > Shawn > > > > > > -- > > > This message and any attachment are confidential and may be privileged or > otherwise protected from disclosure. If you are not the intended recipient, > you must not copy this message or attachment or disclose the contents to > any other person. If you have received this transmission in error, please > notify the sender immediately and delete the message and any attachment > from your system. Merck KGaA, Darmstadt, Germany and any of its > subsidiaries do not accept liability for any omissions or errors in this > message which may arise as a result of E-Mail-transmission or for damages > resulting from any unauthorized changes of the content of this message and > any attachment thereto. Merck KGaA, Darmstadt, Germany and any of its > subsidiaries do not guarantee that this message is free of viruses and does > not accept liability for any damages caused by any virus transmitted > therewith. > > Click http://www.emdgroup.com/disclaimer to access the German, French, > Spanish and Portuguese versions of this disclaimer. > -- Saurabh Sethi Principal Engineer I | Engineering