On 23.12.2011 14:54, Brendan Halpin wrote:
I've been running a glmer logit on a very large data set (600k obs).

Running on a 10% subset works correctly, but for the complete data set,
R completes apparently without error, but does not display the results.
Given these jobs take about 200 hours, it's very hard to make progress
by trial and error.

I append the code and the sample and complete output. As is apparent, I
upgraded R during the complete run, but I recall testing on the
subsample with the earlier version too. I am also assuming that
upgrading R will not affect the running process -- is this true?

Err, this depends on the platform and the way you are using R.
If you change some parts of R while that is running, it may result into unexpected behaviour or a crash if R accesses the files after the upgrade, of course.

Best wishes,
Uwe Ligges




I'd be grateful for any leads. In the meantime I'll be running with
larger subsamples!

Regards,

Brendan Halpin


- code ---------------------------------------------------------------
library(arm)
library(foreign)
mlm<- read.dta("../workingdata.dta")
attach(mlm)

gender<- as.factor(stu_gend)

yr<- year - 1998
failure<- (lmer(fail ~
               1 + cao + subj1 + subj2 + subj3 + gender + yr + ageentry + 
as.factor(yrs5)
                 + modsize  + meancao + depfemr + (1|deptno) + (1|modinst)  + 
(1|ulid) ,
               na.action = na.exclude, family = binomial (link="logit")))

display(failure, digits=5, detail=TRUE)
----------------------------------------------------------------------

- output with 10% sample data ----------------------------------------
R version 2.14.0 (2011-10-31)
Copyright (C) 2011 The R Foundation for Statistical Computing
ISBN 3-900051-07-0
Platform: i486-pc-linux-gnu (32-bit)

R is free software and comes with ABSOLUTELY NO WARRANTY.
You are welcome to redistribute it under certain conditions.
Type 'license()' or 'licence()' for distribution details.

   Natural language support but running in an English locale

R is a collaborative project with many contributors.
Type 'contributors()' for more information and
'citation()' on how to cite R or R packages in publications.

Type 'demo()' for some demos, 'help()' for on-line help, or
'help.start()' for an HTML browser interface to help.
Type 'q()' to quit R.

library(arm)

arm (Version 1.4-13, built: 2011-6-19)
Working directory is /home/brendan/work/mlmmarks/genderECSR
library(foreign)
mlm<- read.dta("../worksample-random1.dta")
attach(mlm)

gender<- as.factor(stu_gend)

yr<- year - 1998
failure<- (lmer(fail ~
+               1 + cao + subj1 + subj2 + subj3 + gender + yr + ageentry + 
as.factor(yrs5)
+                 + modsize  + meancao + depfemr + (1|deptno) + (1|modinst)  + (1|ulid) , 
na.action = na.exclude, family = binomial (link="logit")))

display(failure, digits=5, detail=TRUE)
glmer(formula = fail ~ 1 + cao + subj1 + subj2 + subj3 + gender +
     yr + ageentry + as.factor(yrs5) + modsize + meancao + depfemr +
     (1 | deptno) + (1 | modinst) + (1 | ulid), family = binomial(link = 
"logit"),
     na.action = na.exclude)
                  coef.est  coef.se   z value   Pr(>|z|)
(Intercept)        2.63826   0.97870   2.69568   0.00702
cao               -2.08963   0.11987 -17.43314   0.00000
subj1              0.02608   0.23573   0.11064   0.91190
subj2             -0.55668   0.32759  -1.69932   0.08926
subj3             -1.57120   0.30664  -5.12400   0.00000
genderM            0.36368   0.09188   3.95845   0.00008
yr                 0.06067   0.01658   3.65996   0.00025
ageentry          -0.00720   0.04338  -0.16598   0.86817
as.factor(yrs5)1  -0.25181   0.05712  -4.40806   0.00001
as.factor(yrs5)2  -0.54725   0.07601  -7.20005   0.00000
as.factor(yrs5)3  -1.07483   0.08660 -12.41184   0.00000
as.factor(yrs5)4  -1.22447   0.14373  -8.51932   0.00000
as.factor(yrs5)5  -1.55032   0.31342  -4.94653   0.00000
modsize            0.03387   0.02533   1.33733   0.18112
meancao            1.08747   0.10748  10.11780   0.00000
depfemr           -1.49097   0.49350  -3.02122   0.00252

Error terms:
  Groups   Name        Std.Dev.
  modinst  (Intercept) 1.14308
  ulid     (Intercept) 1.54030
  deptno   (Intercept) 0.52497
  Residual             1.00000
---
number of obs: 63254, groups: modinst, 9076; ulid, 2275; deptno, 26
AIC = 30275.2, DIC = 30237.2
deviance = 30237.2

Loading required package: MASS
Loading required package: Matrix
Loading required package: lattice

Attaching package: ‘Matrix’

The following object(s) are masked from ‘package:base’:

     det

Loading required package: lme4

Attaching package: ‘lme4’

The following object(s) are masked from ‘package:stats’:

     AIC, BIC

Loading required package: R2WinBUGS
Loading required package: coda

Attaching package: ‘coda’

The following object(s) are masked from ‘package:lme4’:

     HPDinterval

Loading required package: abind
Loading required package: foreign

Attaching package: ‘arm’

The following object(s) are masked from ‘package:coda’:

     traceplot
----------------------------------------------------------------------

- output with complete data ------------------------------------------
R version 2.13.1 (2011-07-08)
Copyright (C) 2011 The R Foundation for Statistical Computing
ISBN 3-900051-07-0
Platform: i486-pc-linux-gnu (32-bit)

R is free software and comes with ABSOLUTELY NO WARRANTY.
You are welcome to redistribute it under certain conditions.
Type 'license()' or 'licence()' for distribution details.

   Natural language support but running in an English locale

R is a collaborative project with many contributors.
Type 'contributors()' for more information and
'citation()' on how to cite R or R packages in publications.

Type 'demo()' for some demos, 'help()' for on-line help, or
'help.start()' for an HTML browser interface to help.
Type 'q()' to quit R.

library(arm)

arm (Version 1.4-13, built: 2011-6-19)
Working directory is /home/brendan/work/mlmmarks/genderECSR
library(foreign)
mlm<- read.dta("../workingdata.dta")
attach(mlm)

gender<- as.factor(stu_gend)

yr<- year - 1998
failure<- (lmer(fail ~
+               1 + cao + subj1 + subj2 + subj3 + gender + yr + ageentry + 
as.factor(yrs5)
+                 + modsize  + meancao + depfemr + (1|deptno) + (1|modinst)  + (1|ulid) , 
na.action = na.exclude, family = binomial (link="logit")))
Loading required package: MASS
Loading required package: Matrix
Loading required package: lattice

Attaching package: ‘Matrix’

The following object(s) are masked from ‘package:base’:

     det

Loading required package: lme4

Attaching package: ‘lme4’

The following object(s) are masked from ‘package:stats’:

     AIC, BIC

Loading required package: R2WinBUGS
Loading required package: coda

Attaching package: ‘coda’

The following object(s) are masked from ‘package:lme4’:

     HPDinterval

Loading required package: abind
Loading required package: foreign

Attaching package: ‘arm’

The following object(s) are masked from ‘package:coda’:

     traceplot
----------------------------------------------------------------------


______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to