On 3/3/08 8:21 PM, Ericka Lundström wrote:
 > I'm trying to emigrate from SPSS to R, thou I have some problems whit
 > getting R to distinguish between the different kind of missing.
...
 > Is there a smart way in R to differentiate between missing and valid
 > and at the same time treat both the categories within missing and
 > valid as answers (like SPSS did above)


The Hmisc package has some support for special missing values, for 
instance when reading in SAS datasets using sas.get. I don't believe 
spss.get offers the same facility, though.

You can define special missing values for a variable manually, which 
might seem a bit involved, but this could easily be automated. For your 
example, try:

special <- dataFrame$TWO %in% c("?","X")
attr(dataFrame$TWO, "special.miss") <-
     list(codes=as.character(dataFrame$TWO[special]),
     obs=(1:length(dataFrame$TWO))[special])
class(dataFrame$TWO) <- c("factor", "special.miss")
is.na(dataFrame$TWO) <- special

# Then describe gives new percentages

describe(dataFrame$TWO)
dataFrame$TWO
       n missing       ?       X  unique
       3       4       2       2       2

No (2, 67%), yes (1, 33%)

HTH,
James
-- 
James Reilly
Department of Statistics, University of Auckland
Private Bag 92019, Auckland, New Zealand

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to