suppose I want to search the "l(i|a)*on k(i|e)*ng". there is a space between
two words. I want solr to retrieve the exact match that these two words or
their other cases are adjacent. If I want to use text field type, each one
of these words are considered as tokens, so solr may bring back other
re
Are you sure Solr is the right tool for you? Regexp searches is the really
last resort approach in the domain.
I suggest that maybe you rethink your actual business case (share it here)
to benefiy from tokenization or look if other tools are better.
As it is, you are using a drill to hammer nails
Thanks Alexandre and Erick. Erick I want to use my regular expression to
search a field and Solr text field token the document, so the regular
expression result will not be valid. I want Solr not to token my doc,
although I will lose some terms using solr string.
--
Sent from: http://lucene.4720
Why do you want to add such long strings to your index in the first
place? There are almost useless for search, you want tokenized
(text_general is a good place to start) if you want to search for
words within the string.
"The number of bytes limit" is 32K or so, right? What do you want to
do with
Did you look into UpdateRequestProcessors?
There is a truncate one there.
Regards,
Alex
On Sun, Jul 8, 2018, 12:44 AM Zahra Aminolroaya,
wrote:
> I want to truncate my string field type due to its number of bytes limit. I
> wrote the following in my schema:
>
>
>
>
>
>pr
I want to truncate my string field type due to its number of bytes limit. I
wrote the following in my schema:
However, I found that StrField (string) does not support specifying an
analyzer. Besides, prefixLength in TruncateTokenFilterFactory could no