Re: [R] Wilcoxon signed rank test and its requirements

Thomas Lumley Fri, 25 Jun 2010 12:05:09 -0700

On Thu, 24 Jun 2010, Atte Tenkanen wrote:

On Jun 23, 2010, at 9:58 PM, Atte Tenkanen wrote:

Thanks. What I have had to ask is that

how do you test that the data is symmetric enough?
If it is not, is it ok to use some data transformation?

when it is said:

"The Wilcoxon signed rank test does not assume that the data are
sampled from a Gaussian distribution. However it does assume that
the data are distributed symmetrically around the median. If the
distribution is asymmetrical, the P value will not tell you much
about whether the median is different than the hypothetical value."


You are being misled. Simply finding a statement on a statistics
software website, even one as reputable as Graphpad (???), does not
mean that it is necessarily true. My understanding (confirmed
reviewing "Nonparametric statistical methods for complete and censored

data" by M. M. Desu, Damaraju Raghavarao, is that the Wilcoxon signed-

rank test does not require that the underlying distributions be
symmetric. The above quotation is highly inaccurate.

--
David.


Thanks. Unfortunately, I can't follow the reference at all, but I read this in 
that way that I can be carefree as far as the underlying distribution is 
concerned?

Is there any other authoritative reference where that is just stated in a way "test 
does not require that the underlying distributions be   symmetric or normal".


The statement from GraphPad is correct, but for a different question.  Let me 
expound.

First let us consider means:

If you have paired samples X1.. Xn and Y1..Yn you could ask if the mean of X is 
equal to the mean of Y, or if the mean of (X-Y) is zero.   These are equivalent 
questions, because of the way the mean is defined.   So the paired t-test, 
which answers the first question, and the one-sample t-test, which answers the 
second question, are equivalent.  They have no assumptions (other than 
sufficient sample size for the means to be Normally distributed).


Now, let us consider medians.
f you have paired samples X1.. Xn and Y1..Yn you could ask if the median of X 
is equal to the median of Y, or if the median of (X-Y) is zero.  The first 
question can be answered by any standard test (though there are ways to do it). 
 The second is answered by the sign test.  They are not at all equivalent: it 
is possible for the median of X to be larger than the median of Y but the 
median of (X-Y) to be negative.   The non-equivalence is true for essentially 
all statistics except for the mean.

Now, let us consider the Wilcoxon signed-rank test.
This can be characterized precisely as a test of the null hypothesis that the 
median pairwise mean of  X-Y is zero. That is, take all n(n-1)/2 pairs of 
(X-Y)s.  Take the mean of each pair to get n(n-1)/2 pairwise means. Take the 
median of these numbers.  The p-value will be 0.5 one-sided or 1.0 two-sided 
when this median pairwise mean is exactly zero.  The median pairwise mean is 
also sometimes known as the Hodges-Lehmann estimator (though this is strictly 
speaking a more general term).

As David correctly points out, no assumptions are needed for the Wilcoxon signed-rank 
test to be a test of *this* null hypothesis.   The problem is that this may not be the 
null hypothesis you care about.  As GraphPad correctly points out, "the P value will 
not tell you much about whether the *median* is different than the hypothetical 
value" because the median is not the same as the median pairwise mean.  It is 
entirely possible for the median difference to be positive and the median pairwise mean 
difference to be zero or negative.

If you assume that the distribution of differences X-Y is symmetric, then the 
Wilcoxon signed-rank test also tests the null hypothesis that the median of X-Y 
is zero (and that the mean of X-Y is zero), because these null hypotheses are 
equivalent for a symmetric distribution.  That's what GraphPad is saying

You could also assume that the distributions X and Y are stochastically 
ordered.  This basically implies that the direction of difference is the same 
no matter what location statistic you use to measure it. If X was before some 
intervention and Y was afterwards you would basically be assuming that the 
intervention is either beneficial for everyone or harmful for everyone (up to 
measurement error). Under this assumption, the signed rank test also tells you 
reliably about differences in medians.

To some extent this is a philosophical issue.  My preference is to know exactly 
what a test is doing and to make these distinctions.   Many other people, 
including reputable experts like Frank Harrell, believe (I think) that 
simplifying assumptions such as stochastic ordering are a pretty good 
approximation in a lot of situations, so it isn't necessary to always make 
these distinctions.


     -thomas

Thomas Lumley                   Assoc. Professor, Biostatistics
tlum...@u.washington.edu        University of Washington, Seattle

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Wilcoxon signed rank test and its requirements

Reply via email to