Re: [R] Measure Difference Between Two Distributions

Lorenzo Isella Sat, 25 Sep 2010 06:55:04 -0700

On 09/25/2010 03:23 PM, Rainer M Krug wrote:


Evaluate, for me, does not necessary mean "test if they are
significantly different", but rather to quantify the difference. If that
is what you are looking for, you could look at the "Earth Movers
Distance", where a package is available at R-forge
(https://r-forge.r-project.org/projects/earthmovdist/) which I co-wrote
and used before.

Cheers,

Rainer

Thanks Rainer. I had a quick look at wikipedia and the package youmention, and it seems what I am looking for.Just a question about normalization of the distance calculated by thealgorithm.Let us say that I have 4 distributions A,B,C,D coupled this way (A,B)and (C,D).The length of data in A is equal to the length of data in B, sameapplies to C and D but length(A)!=length(C).Now, the argument I would like to make is that A and B are more similarthan C and D and show a couple of numbers to prove this.Bottom line: provided my data lists are long enough, does this distancescale with the number of data? and if they do, how should I normalizethis distance to compare the results?

Cheers

Lorenzo

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Measure Difference Between Two Distributions

Reply via email to