On Wed, 12-Dec-2007 at 11:35AM +0100, Peter Dalgaard wrote:

|> Philippe Grosjean wrote:
|> > The problem is often a misspecification of the comment.char argument. 
|> > For read.table(), it defaults to '#'. This means that everywhere you 
|> > have a '#' char in your Excel sheet, the rest of the line is ignored. 
|> > This results in a different number of items per line.
|> >
|> > You should better use read.csv() which provides better default arguments 
|> > for your particular problem.
|> > Best,
|> >
|> >   
|> Or read.delim/read.delim2, which should be even better at TAB-separated
|> files.
|> 
|> In general, be very suspicious of read.table() with such files, not only
|> because of the '#' but also because it expects columns separated by
|> _arbitrary_ amounts of whitespace. I.e., n TABs  counts as one, so empty
|> fields are skipped over.

I don't recall that happening with TABs, but a problem can arise when
the last (rightmost) column has more than a few empty cells.
Occasionally, I've had to resort to adding a dummy column on the
right, but as Peter suggests, read.delim is usually less involved.


-- 
~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.   
   ___    Patrick Connolly   
 {~._.~}                         Great minds discuss ideas    
 _( Y )_                        Middle minds discuss events 
(:_~*~_:)                        Small minds discuss people  
 (_)-(_)                                   ..... Anon
          
~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to