Can someone help me with this variable/data reading issue?
I read a csv file and transform/create an additional variable (called y).
The first set of commands below produced different sample statistics
for hw11$y and y
In the second set of command I renameuse the variable name yy, and
sample statistics for $hw11$yy and yy are identical.
Using y <- yy fixed it, but I am not sure why I would need to do that.
That "y" appeared to have come from a variable called "y" from
another data frame (unrelated to the current run).
Help!
> setwd("z:/homework")
> sink ("z:/homework/hw11.our", append=T, split=T)
> hw11 <- read.csv("ij10b.csv",header=T)
> hw11$y <- hw11$e3
> attach(hw11)
The following object(s) are masked _by_ '.GlobalEnv':
y
> (n <- dim(hw11)[1])
[1] 13765
> summary(hw11$y)
Min. 1st Qu. Median Mean 3rd Qu. Max.
0.0000 0.4500 1.0000 1.6726 2.0000 140.0000
> length(hw11$y)
[1] 13765
> summary(y)
Min. 1st Qu. Median Mean 3rd Qu. Max.
0.00000 0.00000 0.00000 0.24958 0.00000 1.00000
> length(y)
[1] 601
>
> setwd("z:/homework")
> sink ("z:/homework/hw11.our", append=T, split=T)
> hw11 <- read.csv("ij10b.csv",header=T)
> hw11$yy <- hw11$e3
> attach(hw11)
> hw11$yy <- hw11$e3
> summary(hw11$yy)
Min. 1st Qu. Median Mean 3rd Qu. Max.
0.0000 0.4500 1.0000 1.6726 2.0000 140.0000
> length(hw11$yy)
[1] 13765
> summary(yy)
Min. 1st Qu. Median Mean 3rd Qu. Max.
0.0000 0.4500 1.0000 1.6726 2.0000 140.0000
> length(yy)
[1] 13765
>
--
Steven T. Yen, Professor of Agricultural Economics
The University of Tennessee
http://web.utk.edu/~syen/
[[alternative HTML version deleted]]
______________________________________________
[email protected] mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.