On Thu, 4 Feb 2010, Luciano La Sala wrote:

Dear R crew:

I think I am in the right mailing list. I have a very simple dataset consisting 
of two variables: cestode intensity and chick size (defined as CAPI). Intensity 
is clearly overdispersed, with way too many zeroes. I'm interested in looking 
at the association between these two variables, i.e. how well does chick size 
predict tape intensity?

I fit a zero inflated negat. binomial model using the "pscl" package.

I built my model as follows and got the output below.

model <- zeroinfl(Int_Cesto ~ CAPI, dist = "negbin", EM = TRUE)
model

Call:
zeroinfl(formula = Int_Cesto ~ CAPI, dist = "negbin", EM = TRUE)

Count model coefficients (negbin with log link):
(Intercept)         CAPI
  -2.99182      0.06817
Theta = 0.4528

Zero-inflation model coefficients (binomial with logit link):
(Intercept)         CAPI
   12.1364      -0.1572

summary(model)

Call:
zeroinfl(formula = Int_Cesto ~ CAPI, dist = "negbin", EM = TRUE)

Pearson residuals:
    Min       1Q   Median       3Q      Max
-0.62751 -0.38842 -0.21303 -0.06899  7.29566

Count model coefficients (negbin with log link):
           Estimate Std. Error z value Pr(>|z|)
(Intercept) -2.99182    3.39555  -0.881   0.3783
CAPI         0.06817    0.04098   1.664   0.0962 .
Log(theta)  -0.79222    0.45031  -1.759   0.0785 .

Zero-inflation model coefficients (binomial with logit link):
           Estimate Std. Error z value Pr(>|z|)
(Intercept) 12.13636    3.71918   3.263  0.00110 **
CAPI        -0.15720    0.04989  -3.151  0.00163 **
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Theta = 0.4528
Number of iterations in BFGS optimization: 1
Log-likelihood: -140.2 on 5 Df

QUESTIONS

1. Is my model adequately specified?

Hard to say from only the output. But given that you only have these two variables it seems like a natural model. You could also compare it with the corresponding hurdle() model.

In both models, you can look at observed and expected frequencies for the counts 0, 1, 2, etc.

2. CAPI is included in block 1 of output containing negative binomial regression coefficients the variable, and in block 2 corresponding to the inflation model. Does this make sense? If so...

As I said above: It seems natural to me but I don't have any background knowledge in this application.

3. How should one interprete these results?

count_CAPI: The mean "Int_Cesto" seems to increase slightly with CAPI but
  not very much.
count_theta: There is clear overdispersion (compared to Poisson).
zero_CAPI: The probability for an inflated zero "Int_Cesto" decreases clearly with CAPI.

See
  vignette("countreg", package = "pscl")
for a hands-on introduction to Poisson and negative binomial models with and without excess zeros.

hth,
Z


Thanks in advance!
LFLS


     Yahoo! Cocina

Encontra las mejores recetas con Yahoo! Cocina.


http://ar.mujer.yahoo.com/cocina/

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.



______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to