Re: Strange Behavior When Using CSVRequestHandler

2010-01-07 Thread Erick Erickson
It puzzles me too. I don't know the internals of that code well enough to speculate, but once you're into undefined behavior, I have great faith in *many* inexplicable things happening. Erick On Thu, Jan 7, 2010 at 9:45 AM, danben wrote: > > Erick - thanks very much, all of this makes sense

Re: Strange Behavior When Using CSVRequestHandler

2010-01-07 Thread danben
Erick - thanks very much, all of this makes sense. But the one thing I still find puzzling is the fact that re-adding the file a second, third, fourth etc time causes numDocs to increase, and ALWAYS by the same amount (141,645). Any ideas as to what could cause that? Dan Erick Erickson wrote:

Re: Strange Behavior When Using CSVRequestHandler

2010-01-06 Thread Erick Erickson
I think the root of your problem is that unique fields should NOT be multivalued. See http://wiki.apache.org/solr/FieldOptionsByUseCase?highlight=(unique)|(key) In this case, since you're tokenizing, your "query" field is