Thanks for all the infos.
That helps so far to understand the issue .

Cheers, Bernd

 Mit freundlichen Grüßen

 Bernd Schmidt
 SOFTWARE-ENTWICKLUNG 

 b.schm...@eggheads.de



 Von:   Shawn Heisey <apa...@elyograg.org> 
 An:   <solr-user@lucene.apache.org> 
 Gesendet:   07.12.2017 19:00 
 Betreff:   Re: Howto search for § character 

On 12/7/2017 9:37 AM, Bernd Schmidt wrote: 
> Indeed, I saw in the analysis tab of the solr admin that the § char will be 
> removed when using type text_general. 
> But in this use case we want to make a full text search like "_text_:§45" or 
> "_text_:§*" to find words starting with §. 
> We need a text field here, not a string field! 
> What is your recommended way to deal with it?  
> Is it possible to remove the word break behaviour for the  § char? 
> Or is the best way to encode all § chars when indexing and searching? 
 
This character is classified by Unicode as punctuation: 
 
http://www.fileformat.info/info/unicode/char/00a7/index.htm 
 
Almost any example field type for full-text search that you're likely to 
encounter is going to be designed to split on punctuation and remove it 
from the token stream.  That's one of the most common things that 
full-text search engines do. 
 
You're going to need to design a new analysis chain that *doesn't* do 
this, apply the fieldType containing that analysis to your field, 
restart/reload, and reindex. 
 
Designing analysis chains is an art form, and tends to be one of the 
hardest parts of setting up a production Solr install.  It took me at 
least a month of almost constant work to settle on the schema design for 
the indexes that I maintain.  All of the "solr.TextField" types in my 
schema are completely custom -- none of the analysis chains in Solr 
examples are in that schema. 
 
Thanks, 
Shawn 
 




 eggheads GmbH
 Herner Straße 370
44807 Bochum

Fon +49 234 89397-0
Fax +49 234 89397-28
 
 www.eggheads.de
 -----------------------------------------------


Kunden DER TOURISTIK, EMSA, FRIATEC, MAMMUT, SUTTERLÜTY, SCHÄFER SHOP, THOMAS 
COOK, TUI, WILO SE, WÜRTH, u.v.m.


Leistungen Standardsoftware für Product Information Management, Cross Media 
Publishing & Multi Channel Commerce, Prozessberatung


Innovationspreis 2017 eggheads ist Sieger beim Innovationspreis-IT 2017 in der 
Kategorie E-Commerce. Mehr

-----------------------------------------------

Webinar Vorstellung der neuen Funktionalität der eggheads Suite am 12.12.2017. 
Mehr

-----------------------------------------------

Reply via email to