On Fri, 26 Sep 2008 10:42:42 -0700 (PDT)
Chris Hostetter <[EMAIL PROTECTED]> wrote:

> :                             <tokenizer
> : class="solr.KeywordTokenizerFactory" /> <!-- The LowerCase TokenFilter does
> 
> : Now, when I search with ?q=the doors , all the terms in my q= aren't used
> : together to build the dismaxQuery , so I never get a match on the _exact
> fields:
> 
> The query parser (even the dismax queryparser) does it's white space 
> "chunking" before handing any input off to the analyzer for the 
> appropriate field, so with [[ ?q=the doors ]] "the" and "doors" are going 
> to get analyzed seperately ... which is why you see artist_exact:the^100.0 
> and artist_exact:doors^100.0 in your parsedquery -- *BUT* since you used 
> KeywordTOkenizer at index time, you'll never get a match for either of 
> those on any document (unles the artist is just "the" or "doors")

Hi Hoss :)
thanks for the feedback - I arrived @ the same conclusion . The biz requirement
is that these *_exact fields match exactly the original contents of the field.
Right now we are using Dismax, and changing this means rewriting a lot of the
queries , which isn't possible. 

That's why I was wondering how Dismax breaks it all apart. It makes sense...I
suppose what I'd like to have is a way to tell dismax which fields NOT to
tokenize the input for. For these fields, it would pass the full q instead of
each part of it. Does this make sense? would it be useful at all? 

> : I've tried with other queries that don't include stopwords (smashing
> pumpkins, : for example), and in all cases, if I don't use " ", only the LAST
> word is used : with my _exact fields ( tried with 1, 2 and 3 words, always
> the same against my : _exact fields..)
> 
> this "LAST word" part doesn't make sense to me ... you can see "the" 
> making it into your query on the *_exact fields in the first 
> DisjunctionMaxQuery, do you have toStrings for these other queries we 
> could see to understand what you mean?

I agree, it makes sense as you say...i must have missed the initial tokens. I
can't confirm atm, so I'll follow the common sense path :)

As usual, thanks for your time and insights :)

B
_________________________
{Beto|Norberto|Numard} Meijome

"Humans die and turn to dust, but writing makes us remembered"
  4000-year-old words of an Egyptian scribe

I speak for myself, not my employer. Contents may be hot. Slippery when wet.
Reading disclaimers makes you go blind. Writing them is worse. You have been
Warned.

Reply via email to