Re: [R] Prediction intervals (i.e. not CI of the fit) for monotonic loess curve using bootstrapping

Jan Stanstrup Wed, 13 Aug 2014 08:08:21 -0700

Thanks to all of you for your suggestions and comments. I reallyappreciate it.


Some comments to Dennis' comments:

1) I am not concerned about predicting outside the original range. Thatwould be nonsense anyway considering the physical phenomenon I ammodeling. I am, however, concerned that the bootstrapping leads toextremely wide CIs at the extremes of the range when there are few datapoints. But I guess there is not much I can do about that as long as Irely on bootstrapping?

2) I have made a function that does the interpolation to the requestednew x's from the original modeling data to get the residual variance andthe model variance. Then it interpolates the combined SDs back the thenew x values. See below.

3) I understand that. For this project it is not that important that thefinal prediction intervals are super accurate. But I need to hit theballpark. I am only trying to do something that doesn't crosslyunderestimate the prediction error and doesn't make statisticians loosetheir lunch a first glance.I also cannot avoid that my data contains erroneous values and I willneed to build many models unsupervised. But the fit should be goodenough that I plan to eliminate values outside some multiple of theprediction interval and then re-calculate. And if the model is not goodin any range I will throw it out completely.

Based on the formula of my last message I have made a function that atleast gives less optimistic intervals than what I could get with theother methods I have tried. The function and example data can be foundherehttps://github.com/stanstrup/retpred_shiny/blob/master/retdb_admin/make_predictions_CI_tests.Rin case anymore has any comments, suggestions or expletives to myimplementation.



----------------------
Jan Stanstrup
Postdoc

Metabolomics
Food Quality and Nutrition
Fondazione Edmund Mach



On 08/12/2014 05:40 PM, Bert Gunter wrote:

PI's of what? -- future individual values or mean values?

I assume quantreg provides quantiles for the latter, not the former.
(See ?predict.lm for a terse explanation of the difference). Both are
obtainable from bootstrapping but the details depend on what you are
prepared to assume. Consult references or your local statistician for
help if needed.

-- Bert

Bert Gunter
Genentech Nonclinical Biostatistics
(650) 467-7374

"Data is not information. Information is not knowledge. And knowledge
is certainly not wisdom."
Clifford Stoll




On Tue, Aug 12, 2014 at 8:20 AM, David Winsemius <dwinsem...@comcast.net> wrote:

On Aug 12, 2014, at 12:23 AM, Jan Stanstrup wrote:

Hi,

I am trying to find a way to estimate prediction intervals (PI) for a monotonic 
loess curve using bootstrapping.

At the moment my approach is to use the boot function from the boot package to 
bootstrap my loess model, which consist of loess + monoproc from the monoproc 
package (to force the fit to be monotonic which gives me much improved results 
with my particular data). The output from the monoproc package is simply the 
fitted y values at each x-value.
I then use boot.ci (again from the boot package) to get confidence intervals. The problem 
is that this gives me confidence intervals (CI) for the "fit" (is there a 
proper way to specify this?) and not a prediction interval. The interval is thus way too 
optimistic to give me an idea of the confidence interval of a predicted value.

For linear models predict.lm can give PI instead of CI by setting interval = 
"prediction". Further discussion of that here:
http://stats.stackexchange.com/questions/82603/understanding-the-confidence-band-from-a-polynomial-regression
http://stats.stackexchange.com/questions/44860/how-to-prediction-intervals-for-linear-regression-via-bootstrapping.

However I don't see a way to do that for boot.ci. Does there exist a way to get 
PIs after bootstrapping? If some sample code is required I am more than happy 
to supply it but I thought the question was general enough to be understandable 
without it.

Why not use the quantreg package to estimate the quantiles of interest to you? 
That way you would not be depending on Normal theory assumptions which you 
apparently don't trust. I've used it with the `cobs` function from the package 
of the same name to implement the monotonic constraint. I think there is a 
worked example in the quantreg package, but since I bought Koenker's book, I 
may be remembering from there.
--

David Winsemius
Alameda, CA, USA

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Prediction intervals (i.e. not CI of the fit) for monotonic loess curve using bootstrapping

Reply via email to