On Thu, Apr 1, 2010 at 7:06 AM, MitchK <mitc...@web.de> wrote:

>
> It seems like that the ressource-loader has got a little bug. The first
> line
> of a file you want to load with the getLine()-method of RessourceLoader [1]
> has to be outcommented by "#". If not, the first line seems to be ignored
> or
> something like that.
>
>
Some applications (such as Windows Notepad), insert a UTF-8 Byte Order Mark
(BOM) as the first character of the file. So, perhaps the first word in your
stopwords list contains a UTF-8 BOM and thats why you are seeing this
behavior.

If you look at the file with "more" and the first character appears to be
, then you can confirm thats the problem.
-- 
Robert Muir
rcm...@gmail.com

Reply via email to