Shane:  Just to add some practical advice on top of Rich's interpretation
of the censoring process (which is correct), my recent experiences with
analyzing below-detection limit chemical concentrations in water using
left-censoring estimators indicates that historically people have not
always indicated the below-detection limit data in a consistent fashion.
 So often people would include either a < with the detection limit value
(e.g, <5), possibly also negative values (e.g., -5, negatives could be used
as a flag for censoring if the real measurement scale is strictly positive
as it is for chemical concentrations, but it has risky potential side
effects), and possibly a separate column variable like your Censored
TRUE/FALSE.  I'm wondering if it is possible that your data format has a
mix of these approaches.  It seems like most recent statistical software
for censoring usually wants a column variable for the measured response
(Y), where observations of Y that are below (or above) detection-limit
(censored) have the detection limit (censoring) value; and then a second
column like your Censored column (either with TRUE/FALSE or 0/1 indicators).

Brian

Brian S. Cade, PhD

U. S. Geological Survey
Fort Collins Science Center
2150 Centre Ave., Bldg. C
Fort Collins, CO  80526-8818

email:  ca...@usgs.gov <brian_c...@usgs.gov>
tel:  970 226-9326



On Thu, Mar 14, 2013 at 9:20 AM, Rich Shepard <rshep...@appl-ecosys.com>wrote:

> On Thu, 14 Mar 2013, Shane Carey wrote:
>
>  Thanks for your reply. My data frame contains the value and a true/false
>> to indicate whether they are censored or not.
>>
>> So I have something like:
>>
>> Data          Censored
>> -1.2             TRUE
>> -5.5             TRUE
>> 5                 FALSE
>>
>
>  These negative values are actual values so do these have to be made
>> absolute?
>>
>
> Shane,
>
>   If your data represent chemical concentrations of some sort then, yes,
> the data must all be positive. You cannot have a chemical concentration
> less
> than zero. If the negative values are actual, then what is the reporting
> level? And what do they measure that is really a censored value?
>
>   The concept of censored data, whether right or left censored, is that
> there is no way of knowing the actual numeric value. In right-censored
> survival analyses (e.g., medical trials) the death of an individual is
> recorded. But, when the study ends for whatever reason, there are still
> subjects alive and there's no way of knowing how long after the end of the
> study they die. Ergo, their age at death is unknown or censored.
>
>   With left censored data such as chemical constituent concentrations in
> air, water, or some other medium, there is a concentration below which the
> instruments cannot distinguish it from noise. All we know is that the
> constituent is present but its concentration is somewhere between zero and
> the detection/reporting limit. Therefore, having a number that is below
> this
> detection/reporting limit is meaningless, and it cannot be negative. That's
> why it is flagged as being censored. The cenmle() function assumes these
> conditions to be true.
>
>   Please keep this thread on the mail list so others can participate and
> learn from the conversation.
>
> Rich
>
> ______________________________**________________
> R-help@r-project.org mailing list
> https://stat.ethz.ch/mailman/**listinfo/r-help<https://stat.ethz.ch/mailman/listinfo/r-help>
> PLEASE do read the posting guide http://www.R-project.org/**
> posting-guide.html <http://www.R-project.org/posting-guide.html>
> and provide commented, minimal, self-contained, reproducible code.
>

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to