Re: [R] Best performance measure?

Frank E Harrell Jr Wed, 19 Aug 2009 12:12:39 -0700

Noah Silverman wrote:

Frank,
That makes sense.
I just had a look at the actual algorithm calculating the Briar score.
One thing that confuses me is how the score is calculated.



If I understand the code correctly, it is just:  sum((p - y)^2)/n
If I have an example with a label of 1 and a probability prediction of.4, it is (.4 - 1)^2(I know it is the average of these value across all the examples)


Yes and I seem to remember the original score is 1 minus that.

Wouldn't it make more sense to stratify the probabilities and then checkthe accuracy of each level.

The stratification will bring a great deal of noise into the problem.Better: loess calibration curves or decomposition of the Brier scoreinto discrimination and calibration components (which is not in thesoftware).


Frank

i.e. For predicted probabilities of .10 to .20 the data was actuallylabeled true .18 percent of the time. mean(label)
On 8/19/09 11:51 AM, Frank E Harrell Jr wrote:
Noah Silverman wrote:
Thanks for the suggestion.
You explained that Briar combines both accuracy and discriminationability. If I understand you right, that is in relation to binaryclassification.
I'm not concerned with binary classification, but the accuracy of theprobability predictions.
Is there some kind of score that measures just the accuracy?

Thanks!

-N
The Brier score has nothing to do with classification. It is aprobability accuracy score.
Frank
On 8/19/09 10:42 AM, Frank E Harrell Jr wrote:
Noah Silverman wrote:
Hello,

I working on a model to predict probabilities.

I don't really care about binary prediction accuracy.

I do really care about the accuracy of my probability predictions.
Frank was nice enough to point me to the val.prob function from theDesign library. It looks very promising for my needs.
I've put together some tests and run the val.prob analysis. Itproduces some very informative graphs along with a bunch ofperformance measures.
Unfortunately, I'm not sure which measure, if any, is the "best"one. I'm comparing hundreds of different models/parametercombinations/etc. So Ideally I'd like a single value or two as the"performance measure" for each one. That way I can pick the"best" model from all my experiments.
As mentioned above, I'm mainly interested in the accuracy of myprobability predictions.
Does anyone have an opinion about which measure I should look at??
(I see Dxy, C, R2, D, U, Briar, Emax, Eavg, etc.)

Thanks!!

-N
It all depends on the goal, i.e., the relative value you place onabsolute accuracy vs. discrimination ability. The Brier scorecombines both and other than interpretability has many advantages.
Frank
______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guidehttp://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.



--
Frank E Harrell Jr   Professor and Chair           School of Medicine
                     Department of Biostatistics   Vanderbilt University

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Best performance measure?

Reply via email to