Re: How to figure out whether stopwords are being indexed or not

2017-02-22 Thread Alexandre Rafalovitch
Actually just doing a debug-enabled query with real keywords would show you what happens, as it will list the analyzed keywords and against which fields they run. So, if your stopword is present in debug, it got through the chain. If it is not - it has not. But I am glad that at least your puzzle

Re: How to figure out whether stopwords are being indexed or not

2017-02-22 Thread Pratik Patel
That explains why I was getting back the results. Thanks! I was doing that query only to test whether stopwords are being indexed or not but apparently the query I had would not serve the purpose. I should rather have a document field with just the stop word and search against it without using wil

Re: How to figure out whether stopwords are being indexed or not

2017-02-22 Thread Alexandre Rafalovitch
StopFilterFactory (and WordDelimiterFilterFactory and maybe others) are NOT multiterm aware. Using wildcards triggers the edge-case third type of analyzer chain that is automatically constructed unless you specify it explicitly. You can see the full list of analyzers and whether they are multiter

Re: How to figure out whether stopwords are being indexed or not

2017-02-22 Thread Pratik Patel
Asterisks were not for formatting, I was trying to use a wildcard operator. Here is another example query and "parsed_query toString" entry for that. Query : http://localhost:8081/solr/collection1/select?debugQuery=on&indent=on&q=Description_note:*their*&wt=json "parsedquery_toString":"Descriptio

Re: How to figure out whether stopwords are being indexed or not

2017-02-22 Thread Erick Erickson
That's not what I'm looking for. Way down near the end there should be an entry like "parsed_query toString" This line is pretty suspicious: 82, "params":{ "q":"Description_note:* and *" Are you really searching for asterisks (I'd originally interpreted that as bolding which sometimes happens). P

Re: How to figure out whether stopwords are being indexed or not

2017-02-22 Thread Pratik Patel
Hi Eric, Thanks for the reply! Following is the relevant part of response header with debugQuery on. { "responseHeader":{ "status":0, "QTime":282, "params":{ "q":"Description_note:* and *", "indent":"on", "wt":"json", "debugQuery":"on", "_":"1487773835305"}}, "response":{"numFound":81771,"start":

Re: How to figure out whether stopwords are being indexed or not

2017-02-21 Thread Erick Erickson
Attach &debug=query to your query and look at the parsed query that's returned. That'll tell you what was searched at least. You can also use the TermsComponent to examine terms in a field directly. Best, Erick On Tue, Feb 21, 2017 at 2:52 PM, Pratik Patel wrote: > I have a field type in schema

How to figure out whether stopwords are being indexed or not

2017-02-21 Thread Pratik Patel
I have a field type in schema which has been applied stopwords list. I have verified that path of stopwords file is correct and it is being loaded fine in solr admin UI. When I analyse these fields using "Analysis" tab of the solr admin UI, I can see that stopwords are being filtered out. However,