Re: [R] Problems with normality req. for ANOVA

David Winsemius Mon, 02 Aug 2010 12:35:51 -0700

In a general situation of observational studies, your point isundoubtedly true, and apparently you believe it to be true even in thesetting of designed experiments. Perhaps I should have confined myselfto my first sentence.


--
David.



On Aug 2, 2010, at 2:05 PM, Bert Gunter wrote:

David et. al:
I take issue with this. It is the lack of independence that is themajor issue. In particular, clustering, split-plotting, and so forthdue to "convenience order" experimentation, lack of randomization,exogenous effects like the systematic effects due to measurementmethod/location have the major effect on inducing bias anddistorting inference. Normality and unequal variances typically paleto insignificance compared to this.
Obviously, IMHO.
Note 1: George Box noted this at least 50 years ago in the early'60's when he and Jenkins developed arima modeling.
Note 2: If you can, have a look at Jack Youden's classic paper"Enduring Values", which comments to some extent on these issues,here: http://www.jstor.org/pss/1266913
Cheers,
Bert


Bert Gunter
Genentech Nonclinical Biostatistics
On Mon, Aug 2, 2010 at 10:32 AM, David Winsemius <dwinsem...@comcast.net> wrote:
On Aug 2, 2010, at 9:33 AM, wwreith wrote:
I am conducting an experiment with four independent variables eachof whichhas three or more factor levels. The sample size is quite large i.e.severalthousand. The dependent variable data does not pass a normality testbut"visually" looks close to normal so is there a way to compute theaffectthis would have on the p-value for ANOVA or is there a way toperform annonparametric test in R that will handle this many independentvariables.Simply saying ANOVA is robust to small departures from normality isnot
going to be good enough for my client.
The statistical assumption of normality for linear models do notapply to the distribution of the dependent variable, but rather tothe residuals after a model is estimated. Furthermore, it is thehomoskedasticity assumption that is more commonly violated and alsogreater threat to validity. (And if you don't already know both ofthese points, then you desperately need to review your basicmodeling practices.)
 I need to compute an error amount for
ANOVA or find a nonparametric equivalent.
You might get a better answer if you expressed the first part ofthat question in unambiguous terminology. What is "error amount"?
For the second part, there is an entire Task View on RobustStatistical Methods.
--

David Winsemius, MD
West Hartford, CT


David Winsemius, MD
West Hartford, CT

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Problems with normality req. for ANOVA

Reply via email to