Re: [R] Bootstrap or Wilcoxons' test?

David Winsemius Sat, 14 Feb 2009 09:48:38 -0800


On Feb 14, 2009, at 3:23 AM, Thomas Lumley wrote:

On Fri, 13 Feb 2009, David Winsemius wrote:
I must disagree with both this general characterization of theWilcoxon test and with the specific example offered. First, weought to spell the author's correctly and then clarify that it isthe Wilcoxon rank-sum test that is being considered. Next, the WRStest is a test for differences in the location parameter ofindependent samples conditional on the samples having been drawnfrom the same distribution. The WRS test would have nodiscriminatory power for samples drawn from the same distributionhaving equal location parameters but only different with respect tounequal dispersion. Look at the formula, for Pete's sake. Itsummarizes differences in ranking, so it is in fact designed NOT tobe sensitive to the spread of the values in the sample. It wouldhave no power, for instance, to test the variances of two samples,both with a mean of 0, and one having a variance of 1 with theother having a variance of 3. One can think of the WRS as a testfor unequal medians.
One can, and it may be helpful to do so, as long as one knows itisn't actually true. Unfortunately, some text books claim orstrongly imply it is true.

Yes. I have been corrected on that point before, which was why a chosethe words I did. Doing a Google search on "derivation wilcoxon rank-sum test", the first hit is to a text "Introductory Biostatistics" byLe that is an example of such a text ... and many others further downthe hit list.

To make the test consistent for differences in the median you haveto know in advance that the distributions differ only by a locationshift, and then it is also consistent for differences in mean (or inany other location parameter).

That is a typical assumption in the derivation of samplingdistributions of the WRS W-statistic, is it not?

Troendle's article in Statistics and Medicine 18, 2763-2773 (1999)(would only be available to subscribers and libraries):

http://www3.interscience.wiley.com.online.uchc.edu/journal/66002289/abstract

An interesting on-line accessible discussion by O'Brien and Castellanoe:
http://www.amstat.org/sections/SRMS/Proceedings/y2005/Files/JSM2005-000930.pdf

Googling also brought up a Univ Of Minn website that has r scriptsillustrating permutation tests (including WRS) from Hollander andWolfe and a page for the WRS:


http://www.stat.umn.edu/geyer/old/5601/examp/perm.html

http://www.stat.umn.edu/geyer/5601/examp/ranksum.html#test

Also, the operating characteristics aren't particularly similar to areal test for medians, which has pretty low efficiency at the Normallocation-shift model (2/pi, IIRC) and is much more sensitive to tiesin the data.

My memory from Conover and Iman (only having seen the first edition)was that the Pittman efficiency of the WRS in the Gaussian case ofunequal means was around 85% relative to the t-test. I suppose thechoice of a central measure for reporting ought to be based on thepurposes of investigation. If one is planning classification, and thedistributions were skewed, then the median might be preferable becauseit is less subject to sampling effects:


> var( apply( sapply(1:500, function(x) rlnorm(20)), 2, median))
[1] 0.08123678
>
>
> var( apply( sapply(1:500, function(x) rlnorm(20)), 2, mean))
[1] 0.2168887

Thank you for the clarification.

--
David Winsemius

And I could go on and on about non-transitivity, but I won't. Anyonewho is interested can Google for 'Efron dice'.
      -thomas


Thomas Lumley                   Assoc. Professor, Biostatistics
tlum...@u.washington.edu        University of Washington, Seattle


______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Bootstrap or Wilcoxons' test?

Reply via email to