(11/03/27 22:49), KH wrote:
(11/03/25 22:40), Nick Sabbe wrote:
2. Which model, I mean lasso or elastic net, should be selected? and
why? Both models chose the same variables but different coefficient values.
You may want to read 'the elements of statistical learning' to find some
info on the advantages of ridge/lasso/elnet compared. Lasso should work fine
in this relatively low-dimensional setting, although it depends on the
correlation structure of your covariates.
I should have used vif from car package for logistic model.
library(car)
test3 <- glm(y ~ x1+x2+x3+x4+x5+x6+x7+x8+x9+x10+x11+x12+x13+x14+x15,
family="binomial", data=MyData)
vif(test3)
x1 x2 x3 x4 x5 x6 x7 x8 x9
x10 x11 x12 x13 x14 x15
1.339349 1.477299 1.292232 1.309631 1.375251
1.192694 1.763012 2.358474 1.755591 1.281404
1.229909 1.353517 1.304637 1.486188 1.428996
Anyway, multicollinearity is unlikely to be a problem.
KH
I also checked correlation structure of my covariates.
test<- lm(y ~ x15std)
library(DAAG)
vif(test)
x15std1 x15std2 x15std3 x15std4 x15std5 x15std6 x15std7 x15std8
x15std9 x15std10 x15std11 x15std12 x15std13 x15std14
1.2299 1.2880 1.1011 1.1559 1.3033 1.0774 1.5369 1.9604
1.4664 1.1754 1.1396 1.2683 1.1685 1.1667
x15std15
1.5534
Variance inflation are less than 5 suggesting that multicollinearity is
unlikely to be a problem.
Therefore, Lasso model should be selected?
Thanks a lot in advance,
KH
______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.