Hi Teague, The StopFilterFactory (which I think you're using) by default uses lang/stopwords_en.txt (which wouldn't be empty if you check). What you're looking at is the stopword.txt. You could either empty that file out or change the field type for your field.
On Mon, Jul 14, 2014 at 12:53 PM, Teague James <teag...@insystechinc.com> wrote: > Hello all, > > I am working with Solr 4.9.0 and am searching for phrases that contain words > like "of" or "to" that Solr seems to be ignoring at index time. Here's what > I tried: > > curl http://localhost/solr/update?commit=true -H "Content-Type: text/xml" > --data-binary '<add><doc><field name="id">100</field><field > name="content">blah blah blah knowledge of science blah blah > blah</field></doc></add>' > > Then, using a broswer: > > http://localhost/solr/collection1/select?q="knowledge+of+science"&fq=id:100 > > I get zero hits. Search for "knowledge" or "science" and I'll get hits. > "knowledge of" or "of science" and I get zero hits. I don't want to use > proximity if I can avoid it, as this may introduce too many undesireable > results. Stopwords.txt is blank, yet clearly Solr is ignoring "of" and "to" > and possibly more words that I have not discovered through testing yet. Is > there some other configuration file that contains these small words? Is > there any way to force Solr to pay attention to them and not drop them from > the phrase? Any advice is appreciated! Thanks! > > -Teague > > -- Anshum Gupta http://www.anshumgupta.net