On 07/09/15 10:22, Olu Ola via R-help wrote:
Hello, I am currently working with a dataframe which has some missing
values represented by "NA". whenever, I add two columns in which at
least one of the pair of an observation is "NA", the sum returns
zero. That is for the same observation, if
dataframe$A = 20 dataframe$B = NA
dataframe$A + dataframe$B returns zero.
No it does not. It returns NA. As it should.
I do not want to delete the observations with the NA's. How do I go
about carrying out the necessary operations without deleting the
observations with the NA's.
Your question seems to demonstrate a substantial amount of confusion.
Amongst other things you probably want to deal with vectors (or perhaps
matrices) rather than data frames.
To sum a numeric vector, ignoring missing values, you can use the sum()
function, setting the argument "na.rm" to TRUE. E.g.
v <- c(1,NA,2,NA,3,NA,4,NA)
sum(v,na.rm=TRUE) # Gives 10.
Ignore other advice that you were given, to replace NAs in your data
frame (???) by zeroes. That is very dangerous, misleading and
confusing. "Missing" and "zero" are *VERY* different concepts.
cheers,
Rolf Turner
--
Technical Editor ANZJS
Department of Statistics
University of Auckland
Phone: +64-9-373-7599 ext. 88276
______________________________________________
R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.