hossman wrote:
>
> If you just want the full input string passed to the analyzer of each qf
> field, then you just need to quote the entire string (or escape every
> shitespace charter in the string with a backslash) so that the entire
> input is considered one chunk -- but then you don't get to use +/-, mm is
> meaninless, real quote characters spcified by your users are meaninless,
> etc...
>
But this way this will mean "phrase query" and all terms should present in
the doc for the match.
hossman wrote:
>
> Ok, so i'm asusming you mean you want to use the pattern tokenizer above
> at query time -- the thing you have to remember is that before the query
> time analysis is done, the query parser has to inspect teh raw input and
> decide what is "markup" and what is "input" ... both dismax and the
> standard query parser consider un-escaped/un-quated whitespace to be
> markup, so the text is divided up that way before yout analyzier is ever
> used -- it has to be so that the dismax parser has discrete chunks to
> correlate in the DisjunctionMaxQueries.
>
Correct, so I need to escape the whitespace characters in order to tell that
they are a part of "input" and not "markup". Will try that, thanks!
--
View this message in context:
http://www.nabble.com/Dismax%3A-Impossible-to-search-for-a-_phrase_-in-tokenized-and-untokenized-fields-at-the-same-time-tp25832932p26015802.html
Sent from the Solr - User mailing list archive at Nabble.com.