Re: truncate string field type

2018-07-10 Thread Zahra Aminolroaya
suppose I want to search the "l(i|a)*on k(i|e)*ng". there is a space between two words. I want solr to retrieve the exact match that these two words or their other cases are adjacent. If I want to use text field type, each one of these words are considered as tokens, so solr may bring back other re

Re: truncate string field type

2018-07-10 Thread Alexandre Rafalovitch
Are you sure Solr is the right tool for you? Regexp searches is the really last resort approach in the domain. I suggest that maybe you rethink your actual business case (share it here) to benefiy from tokenization or look if other tools are better. As it is, you are using a drill to hammer nails

Re: truncate string field type

2018-07-09 Thread Zahra Aminolroaya
Thanks Alexandre and Erick. Erick I want to use my regular expression to search a field and Solr text field token the document, so the regular expression result will not be valid. I want Solr not to token my doc, although I will lose some terms using solr string. -- Sent from: http://lucene.4720

Re: truncate string field type

2018-07-08 Thread Erick Erickson
Why do you want to add such long strings to your index in the first place? There are almost useless for search, you want tokenized (text_general is a good place to start) if you want to search for words within the string. "The number of bytes limit" is 32K or so, right? What do you want to do with

Re: truncate string field type

2018-07-07 Thread Alexandre Rafalovitch
Did you look into UpdateRequestProcessors? There is a truncate one there. Regards, Alex On Sun, Jul 8, 2018, 12:44 AM Zahra Aminolroaya, wrote: > I want to truncate my string field type due to its number of bytes limit. I > wrote the following in my schema: > > > > > >pr

truncate string field type

2018-07-07 Thread Zahra Aminolroaya
I want to truncate my string field type due to its number of bytes limit. I wrote the following in my schema: However, I found that StrField (string) does not support specifying an analyzer. Besides, prefixLength in TruncateTokenFilterFactory could no