On 17/12/2009, at 11:14 AM, whitaker m. (mw1006) wrote:
I have a much larger dataset than in my original email (attached -
price dependent upon weight, Clarity (different levels IF-SI2),
colour(levels D-L) and Cut (ideal-fair), and tried the regression
command:
diamond.lm<-lm(price~weight+IF+VVS1+VVS2+VS1+VS2+SI1+SI2+I1+I2+D+E
+F+G+H+I+J+K+L+ideal+excellent+very.good+good+fair,
data="Diamonds2.txt")
Error in eval(predvars, data, env) : invalid 'envir' argument
Which lead to the error message below the command. I have tried
searching for this, and assumed this was down to having categrocial
variables within the data, is this a correct assumption or am i
doing something else wrong? Apologies if this is a bit of a basic
question!
(a) You don't want the quote marks around the data argument. That is
the source
of the "invalid 'envir' argument" error.
(b) You are not using the power of R. ***Don't*** create your own
dummy variables;
let lm() do it for you. Learn something about how R works, for
crying out loud.
Essentially you should be doing something like
diamond.lm <- lm(price ~ weight + Clarity + colour + Cut, data =
Diamond.txt)
where price, weight, Clarity, colour, and Cut are columns of the data
frame
Diamond.txt. The columns price and weight should be numeric vectors;
Clarity,
colour, and Cut should be ***factors***.
It is slightly worrying that you refer to ``Diamond.txt''. That
``.txt'' suffix
would lead me to believe that ``Diamond.txt'' is a (text) file
containing your
data. If that is the case, this won't work. The ``data'' argument
to lm() must
be an ***R object***. You have to read the data file into an R
object before trying
to use the data in a call to lm(). Something like
Diamond <- read.table("Diamond.txt") # Note that you ***do*** want
to quote the file name.
Then
diamond.lm <- lm(price ~ weight + Clarity + colour + Cut, data =
Diamond)
should do what you want. The dummy variable encoding used will be
determined
by the (first) value of options()$contrasts, which by default i
contr.treatment.
Read up on factors and contrasts.
cheers,
Rolf Turner
######################################################################
Attention:\ This e-mail message is privileged and confid...{{dropped:9}}
______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.