Hi R-help,
    I am interested in comparing two vectors of data
observations to see if they come from the same distrubution (and have
settled on the Kolmogorov-Smirnov test to do this)..

I'd prefer to use all my data points, but computationally speaking,
this is proving to be troublesome due to the size of my vectors (the
larger of the two is about 90 million observations).  I suppose I
could take a smaller sample of points from that large vector to use as
input in my ks-test, but I want to see if I can avoid doing that, in
favor of including all of the data..

Code:
> result <- ks.test(rep(1:940,100000),rep(1:940,800))
Error: cannot allocate vector of size 358.6 Mb

Any ideas?

OS: Windows 7 64-bit, R ver. 2.10.1, Memory: 4 gb

Best,
Jonathan



Best,
Jonathan

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to