They apparently moved it .. it's here now:
http://doc.rero.ch/lm.php?url=1000,43,4,20091218142456-GY/Dolamic_Ljiljana_-_When_Stopword_Lists_Make_the_Difference_20091218.pdf
--------------------------------------------------
From: "Glen Newton" <glen.new...@gmail.com>
Sent: Wednesday, March 17, 2010 11:13 AM
To: <solr-user@lucene.apache.org>
Subject: Re: Stopwords
That discussion cites a paper via a URL:
http://doc.rero.ch/lm.php?url#16;00,43,4,20091218142456-GY/Dolamic_Ljiljana__When_Stopword_Lists_Make_the_Difference_20091218.pdf
Unfortunately when I go to this URL I get:
"L'accès à ce document est limité."
But I tracked down the paper. Here is its reference (which may require
a subscription: sorry):
US: http://dx.doi.org/10.1002/asi.21186
AU: Ljiljana Dolamic
AU: Jacques Savoy
TI: When stopword lists make the difference
SO: Journal of the American Society for Information Science and Technology
VL: 61
NO: 1
PG: 200-203
YR: 2010
CP: © 2009 ASIS&T
ON: 1532-2890
PN: 1532-2882
AD: Computer Science Department, University of Neuchâtel, 2009
Neuchâtel, Switzerland
DOI: 10.1002/asi.21186
-Glen
On 17 March 2010 06:02, Ahmet Arslan <iori...@yahoo.com> wrote:
I was reading "Scaling Lucen and Solr"
(http://www.lucidimagination.com/Community/Hear-from-the-Experts/Articles/Scaling-Lucene-and-Solr/)
and I came across the section StopWords.
In there it mentioned that its not recommended to remove
stop words at index
time. Why is this the case? Don't all the extraneous
stopwords bloat the
index and lead to less relevant results? Can someone please
explain this to
me. Thanks
There were a discussion about stopwords (remove them, not to remove them,
or index them with CommonGramsFilterFactory) and good references in this
thread.
http://search-lucene.com/m/QvJtF1mIPP22/When+Stopword+Lists+Make+the+Difference
--
-