Hello,

I am pretty new to R, I have always used SAS and SAS products. My
target variable is binary ('Y' and 'N') and i have about 14 predictor
variables. My goal is to compare different variable selection methods
like Forward, Backward, All possible subsests. I am using
misclassification rate to pick the winner method.

This is what i have as of now,

Reg <- glm (Graduation ~., DFtrain,family=binomial(link="logit"))
                step <- extractAIC(Reg, direction="forward")
                pred <- predict(Reg, DFtest,type="response")
                mis <- mean({pred > 0.5} != {DFtest[,"Graduation"] == "Y"})
This program actually works but I needed to check to make sure am
doing this right. Also, I am getting the same misclassification rates
for all different methods.

I also tried to use

Reg <- leaps(Graduation ~., DFtrain)
                pred <- predict(Reg, DFtest,type="response")
                mis <- mean({pred > 0.5} != {DFtest[,"Graduation"] == "Y"})
                #print(summary(mis))
which doesnt work

and

Reg <- regsubsets(Graduation ~., DFtrain)
                pred <- predict(Reg, DFtest,type="response")
                mis <- mean({pred > 0.5} != {DFtest[,"Graduation"] == "Y"})
                #print(summary(mis))

The Regsubsets will work but the 'predict' function does not work with
it. Is there any other way to do predictions when using regsubsets

Any help is appreciated.

Thanks,

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to