Re: Dismax: Impossible to search for a _phrase_ in tokenized and untokenized fields at the same time

Alex Baranov Thu, 22 Oct 2009 12:28:07 -0700


hossman wrote:
> 
> If you just want the full input string passed to the analyzer of each qf 
> field, then you just need to quote the entire string (or escape every 
> shitespace charter in the string with a backslash) so that the entire 
> input is considered one chunk -- but then you don't get to use +/-, mm is 
> meaninless, real quote characters spcified by your users are meaninless, 
> etc...
> 

But this way this will mean "phrase query" and all terms should present in
the doc for the match.


hossman wrote:
> 
> Ok, so i'm asusming you mean you want to use the pattern tokenizer above 
> at query time -- the thing you have to remember is that before the query 
> time analysis is done, the query parser has to inspect teh raw input and 
> decide what is "markup" and what is "input" ... both dismax and the 
> standard query parser consider un-escaped/un-quated whitespace to be 
> markup, so the text is divided up that way before yout analyzier is ever 
> used -- it has to be so that the dismax parser has discrete chunks to 
> correlate in the DisjunctionMaxQueries.
> 

Correct, so I need to escape the whitespace characters in order to tell that
they are a part of "input" and not "markup". Will try that, thanks!
-- 
View this message in context: 
http://www.nabble.com/Dismax%3A-Impossible-to-search-for-a-_phrase_-in-tokenized-and-untokenized-fields-at-the-same-time-tp25832932p26015802.html
Sent from the Solr - User mailing list archive at Nabble.com.

Re: Dismax: Impossible to search for a _phrase_ in tokenized and untokenized fields at the same time

Reply via email to