Re: [R] multivariate, hierarchical model

Omnia Abdulrazeg Thu, 02 May 2013 08:47:36 -0700

in addition, I get this error whenever I want to plot the predicted values
plot(augPred(mod2))Error in sprintf(gettext(fmt, domain = domain), ...) :   
invalid type of argument[1]: 'symbol'

From: om...@hotmail.com
To: r-help@r-project.org
Subject: [R] multivariate, hierarchical model
Date: Thu, 2 May 2013 10:26:57 +0000

Sorry for the last email, sent too early. 
I have a small data set that has a hierarchical structure. It has both temporal 
(year, months) and spatial (treatment code and zone code). The following 
explains the data:

WSZ_Code                  the
water supply zone code (1 to 8)

Treatment_Code         the
treatment plant which supplies each water supply zone (1 to 4)           

Year                             year
of sampling (1996 - 2000)

Month                         month
of the year of sampling, 1=January

TTHM                          the
total trihalomethane concentration (ìg/L) in the tap water sample 

CL2_Free                     concentration
of free chlorine (mg/L) in the water sample - indicates the level of the
disinfecting chlorine dose which has not reacted with organic matter in the
water.

BrO3                            concentration
of bromate (?g/L) in the water sample. BrO3 is formed during certain types of
water treatment. This variable contains some missing values due to information
not being recorded for the sample - these are denoted by "NA".

Colour                          measure
of colour of the water sample - this is one possible indicator of the level of
organic matter in the water. Units = Hazen.

pH                               pH
of the water sample.

Turbidity         measure of "cloudiness" of
the water sample - caused by particles suspended in the water. Units = FTU

The aim of the analysis is to produce exposure estimates for TTHM. I has split 
the months into seasons as there appears to be a seasonality trend. I've also 
carried out multiple imputation for any missing values. I've done a correlation 
analysis on all the variables and observed which ones are well correlated with 
TTHM. It appears that CL2_Free is the most significantly correlated so i've 
decided to include that in the final model. Positively skewed
I'm thinking of doing a mixed effects model with random intercepts as the 
treatment code and zones within the treatment cose and random slopes as the 
seasons.

mod2 <- lme(tthm ~ cl2free, random= ~ seasons| treatcode/loc_code)
but that doesn't work. 
these seems to work good:

mod2 <- lme(tthm ~ cl2free, random= ~ 1| loc_code, data=new.data, method="ML")

mod3 <- lme(tthm ~ cl2free, random= ~ 1| treatcode/loc_code, data=new.data, 
method ="ML")
mod2 has a lower AIC, so it appears better.
Should I group (merge) treatment code and location code?
Also, cl2free has a very positively skewed distribution, should I transform it? 
I want to incorporate seasons, possibly as a random slope, but R doesn't seem 
to like it.
Help greatly appreciated (and sorry for long email).
Omnia                                                                           

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] multivariate, hierarchical model

Reply via email to