I cannot reproduce this on our x86_64 Fedora systems (and I tried all the usual tricks such as gctorture and valgrind to provoke a problem). And I have fitted much larger GLMs many times over the last decade, so your 'bug summary' cannot be the whole story.

Your example is random and you haven't set a seed: to eliminate that there is something specific about the data you tried can you set one and tell us which failed.

One possibility is a compiler optimization bug, so can you please tell us what compilers were used with what flags to build this version of R, and if you built it yourself try it without optimization. (The machines I used had GCC 4.3.2 and 4.4.1 with CFLAGS="-g -O3 -Wall -pedantic -mtune=core2" FFLAGS="-g -O -mtune=core2": higher levels of optimization have known problems with recent x86_64 versions of gfortran, and I am wondering if that is an underlying issue.)


On Thu, 17 Dec 2009, adr...@maths.uwa.edu.au wrote:

Bug summary:
            glm() causes a segfault if the argument 'data'
            is a data frame with more than 16384 rows.

Bug demonstration:

-------input ---------------
     N <- 16400
     df <- data.frame(x=runif(N, min=1,max=2),y=rpois(N, 2))
     glm(y ~ x, family=poisson, data=df)

------ output ---------------
     *** caught segfault ***
     address (nil), cause 'unknown'

     Traceback:
1: ifelse(y == 0, 1, y/mu)
2: dev.resids(y, mu, weights)
3: glm.fit(x = X, y = Y, weights = weights, start = start, etastart =
etastart,     mustart = mustart, offset = offset, family = family,
control = control,     intercept = attr(mt, "intercept") > 0)
4: glm(y ~ x, family = poisson, data = df)

--------------------------------

The code generates a segfault if the value of 'N' is greater than 16384.

regards
Adrian Baddeley

////////////////////////////////////////////////////////////

--please do not edit the information below--

Version:
platform = x86_64-unknown-linux-gnu
arch = x86_64
os = linux-gnu
system = x86_64, linux-gnu
status =
major = 2
minor = 10.1
year = 2009
month = 12
day = 14
svn rev = 50720
language = R
version.string = R version 2.10.1 (2009-12-14)

Locale:
LC_CTYPE=en_AU.UTF-8;LC_NUMERIC=C;LC_TIME=en_AU.UTF-8;LC_COLLATE=en_AU.UTF-8;LC_MONETARY=C;LC_MESSAGES=en_AU.UTF-8;LC_PAPER=en_AU.UTF-8;LC_NAME=C;LC_ADDRESS=C;LC_TELEPHONE=C;LC_MEASUREMENT=en_AU.UTF-8;LC_IDENTIFICATION=C

Search Path:
.GlobalEnv, package:stats, package:graphics, package:grDevices,
package:utils, package:datasets, package:methods, Autoloads, package:base

______________________________________________
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


--
Brian D. Ripley,                  rip...@stats.ox.ac.uk
Professor of Applied Statistics,  http://www.stats.ox.ac.uk/~ripley/
University of Oxford,             Tel:  +44 1865 272861 (self)
1 South Parks Road,                     +44 1865 272866 (PA)
Oxford OX1 3TG, UK                Fax:  +44 1865 272595

______________________________________________
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel

Reply via email to