Hello,

I am working on a Coursera assignment and have combined 332 files into a
single data frame called "dat". The dataframe has 4 columns,

1. Date
2. Sulfate (numerical values)
3. Nitrate  (numerical  values)
4. ID # (numerical values).

Our assignment is to write a function pollutantmean <- function(directory,
pollutant, ID). whereby we can calculate the mean by inputting the
pollutant name and ID #.

I have reached the stage of subsetting the date e.g. by ID # 1-10, however
when I do so and then calculate the mean of this subset I receive the NaN
message (even though I have instructed R to disregard the "NA"'s).


*Beginning Code: *
getwd()
read.csv(specdata)
specdata <- ("C:/Users/rober/specdata")
list.files(specdata)
files_full <- list.files(specdata, full.names=TRUE)
files_full
dat <- data.frame()
for (i in 1:332){
  dat <- rbind(dat,read.csv(files_full[i]))
}
str(dat)
mean(dat$sulfate, na.rm=TRUE)

*Code which generated the NaN message. *
dat1_10 <- dat[which(dat[,ID] ==1:10),]
mean(dat1_10$sulfate, na.rm=TRUE)

Am I making a mistake in subsetting the rows with ID's 1:10? Any advice
would be appreciated.

Thank you,

Robert

Robert J. Piliero

Cell: (617) 283 1020
38 Linnaean St. #6
Cambridge, MA, 02138
USA

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to