Hi all, I'm running into some computer issues when trying to run a binomial model for spatially correlated data using glmmPQL and was wondering if anyone could help me out. My whole dataset consists of about 300,000 points for which I have a suite of environmental variables (I'm trying to come up with a habitat model for a species of seal, using real (presence) and simulated dives (absence) as my response variable). Since my dataset is so large, I split it into thirds and ran the model without spatial correlation. However, when checking my results, I did get a spatial correlation in the residuals, which I'm trying to incorporate using a variogram (spherical). The problem is that when I run it for my 1/3 of my data (about 70k points), the calculatations go on forever. I was running the first model for over a week and was still not getting past the first iteration.
This is the model I used M1.f.Spa <- glmmPQL(presence ~ sst + tmax100 + tbot + sss + ssu + tmax100d + sbot, random = ~1|fPTT, family = binomial, correlation = corSpher(c(91323.53,0.4279603), form =~ x+y, nugget = TRUE), data = sample.df1) This week, I tried a different approach and split the 70k subset into smaller datasets of 10,000 points, and now the model runs much faster (just a couple of hours tops), Yet the output of these models changes with regards to the significance of one variable, which makes me think that I need a larger dataset than 10,000 points. Does someone have a suggestion on how to improve/run this code with the spatial correlation for a larger dataset than 10,000 which wouldn't take weeks to run? I'm working with an Intel Core i7, 12 Gb RAM (plus a couple of 100 Gbs in virtual memory), in Windows 7 64-bit. Thanks for your help, ---------------------------------------- Luis A. Huckstadt, Ph.D. Department of Ecology and Evolutionary Biology University of California Santa Cruz Long Marine Lab 100 Shaffer Road Santa Cruz, CA 95060 [[alternative HTML version deleted]] ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.