Hi all,

 

I have a vector of proportions (post_op_prw) such that

 

 >summary(amb$post_op_prw)

 

   Min. 1st Qu.  Median    Mean 3rd Qu.    Max.    NA's 

 0.0000  0.0000  0.0000  0.3985  0.9134  0.9962  1.0000 

 

> summary(cut2(amb$post_op_prw,0.0001))

 

[0.0000,0.0001) [0.0001,0.9962]            NA's 

           1904            1672                                      1

 

I want to use post_op_prw as a predictor variable in an OLS model.  I
decided to fit it using a restricted cubic spline.  But, I'm seeing
behavior I don't understand.  See below:

 

> rcspline.eval(amb$post_op_prw,nk = 3, knots.only = T)

[1] 0.0000000 0.6147927 0.9092937 0.9667178

Warning message:

In rcspline.eval(amb$post_op_prw, nk = 3, knots.only = T) :

  could not obtain 3 knots with default algorithm.

 Used alternate algorithm to obtain 4 knots

> rcspline.eval(amb$post_op_prw,nk = 4, knots.only = T)

[1] 0.0000000 0.8476793 0.9783558

> rcspline.eval(amb$post_op_prw,nk = 5, knots.only = T)

[1] 0.0000000 0.9012711 0.9783558

 

Why are the 4 and 5 knot spline requests returning a spline with 3
knots?  I get the best model results using rcs(amb$post_op_prw,3).   I'm
kind of new to using splines.  Does the fact that observations are
clustered at the ends make the spline fit questionable?  

 

Thanks,

 

Rachel Hayes


        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to