Hmmm, I don't see any screen shots. Several things: 1> If your stopword file has comments, I'm not sure what the effect would be. 2> Something's not right here, or I'm being fooled again. Your withresults xml has this line: <str name="parsedquery">+DisjunctionMaxQuery((meta_text:"ecol d ingenieur")~0.01) ()</str> and your noresults has this line: <str name="parsedquery">+DisjunctionMaxQuery((meta_text:"academi charpenti")~0.01) DisjunctionMaxQuery((meta_text:"academi charpenti"~100)~0.01)</str>
the empty () in the first one often means you're NOT going to your configured dismax parser in solrconfig.xml. Yet that doesn't square with your custom qt, so I'm puzzled. Could we see your raw query string on the way in? It's almost as if you defined qt in one and defType in the other, which are not equivalent. 3> It may take 12 hours to index, but you could experiment with a smaller subset. You say you know that the noresults one should return documents, what proof do you have? If there's a single document that you know should match this, just index it and a few others and you should be able to make many runs until you get to the bottom of this... And obviously your stemming is happening on the query, are you sure it's happening at index time too? Best Erick On Mon, Jan 24, 2011 at 1:51 PM, Jerome Renard <jerome.ren...@gmail.com>wrote: > Hi Em, Erick > > thanks for your feedback. > > Em : yes Here is the stopwords.txt I use : > - > http://svn.apache.org/repos/asf/lucene/dev/trunk/modules/analysis/common/src/resources/org/apache/lucene/analysis/snowball/french_stop.txt > > On Mon, Jan 24, 2011 at 6:58 PM, Erick Erickson > <erickerick...@gmail.com>wrote: > >> Try submitting your query from the admin page with &debugQuery=on and see >> if that helps. The output is pretty dense, so feel free to cut-paste the >> results for >> help. >> >> Your stemmers have English as the language, which could also be >> "interesting". >> >> > Yes, I noticed that this will be fixed. > > >> As Em says, the analysis page may help here, but I'd start by taking out >> WordDelimiterFilterFactory, SnowballPorterFilterFactory and >> StopFilterFactory >> and build back up if you really need them. Although, again, the analysis >> page >> that's accessible from the admin page may help greatly (check "debug" in >> both >> index and query). >> >> > You will find attached two xml files one with no results (noresult.xml.gz) > and one with > a lot of results (withresults.xml.gz). You will also find attached two > screenshots showing > there is a highlighted section in the "Index analyzer" section when > analysing text. > > >> Oh, and you MUST re-index after changing your schema to have a true test. >> >> > Yes, the problem is that reindexing takes around 12 hours which makes it > really hard > for testing :/ > > > Thanks in advance for your feedback. > > Best Regards, > > -- > Jérôme >