On 10/12/2007 1:16 PM, D. R. Evans wrote:
> The standard chisq.test() and fisher.test() functions, when applied to
> two distributions (to determine whether the same underlying
> distribution applies to both) requires one to pre-bin the
> distributions.
> 
> Is there a library function (either built-in or in a package) that
> acts more like the ks.test() function, in that one can simply pass the
> two distributions and have it do the necessary binning as well as the
> actual statistical test?
> 
> (Yes, you can accuse me of laziness: I just don't fancy trying to
> figure out a routine that would make sure that there more than 5
> samples in each of the expected bins before applying the chi-squared
> test. It seems too much like re-inventing an elementary wheel that
> must have been invented by someone else.)

If you have a quantile function q() for the distribution, a sample size 
of N, and want expected counts of 5 in each bin, just calculate the 
cutpoints as

nbins <- floor(N/5)
cutpoints <- c(-Inf, q( (1:(nbins-1)/nbins)), Inf)

Duncan Murdoch

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to