John Malone wrote:
Hi!

I'm trying to implement an outlier test once/row in a large dataframe.
Ideally, I'd do this then add the Pvalue results and the number flagged as
an outlier as two new separate columns to the dataframe.  Grubbs outlier
test requires a vector and I'm confused how to make each row of my dataframe
a vector, followed by doing a Grubbs test for each row containing the vector
of numbers I want to perform the outlier test on.

I'm new to R and no doubt this is a simple problem. Any help you might
provide would be greatly appreciated.

Many thanks in advance!!

        [[alternative HTML version deleted]]


John - you would be making a strong normality assumption. You might reject H0 using Grubbs' test just because of non-normality, or you might fail to reject it just because of non-normality. Is it really this straitforward to declare something an outlier? What does outlier really mean?

The following is must reading.

@Article{fin06cal,
  author =               {Finney, David J.},
title = {Calibration guidelines challenge outlier practices},
  journal =      The American Statistician,
  year =                 2006,
  volume =               60,
  pages =                {309-313},
  annote =               {anticoagulant
therapy;bias;causation;ethics;objectivity;outliers;guidelines for
treatment of outliers;overview of types of outliers;letter to the editor and reply 61:187 May 2007}


--
Frank E Harrell Jr   Professor and Chair           School of Medicine
                     Department of Biostatistics   Vanderbilt University

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to