I've completed an experiment and want to summarize the results.

There are two things I like to create.

1) A simple count of things from the data.frame with predictions
    1a) Number of predictions with probability greater than x
1b) Number of predictions with probability greater than x that are really true

    In SQL, this would be,
        "Select count(predictions) from data.frame where probability > x"
"Select count(predictions) from data.frame where probability > x and label ='T' "

How can I do this one in R?


2) I'd like to create what we call "binning". It is a simple list of probability ranges and how accurate our model is. The idea is to see how "true" our probabilities are.
for example

range        number of items        mean(probability)   true_accuracy
100-90%        20                            .924                    .90
90-80%          50                            .825                    .84
80-70%          214                          .75                      .71
etc...

It would be really great if I could also graph this!

Is there any kind of package or way to do this in R

Thanks!

-N

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to