[R] How to checkpoint-restart R jobs in batch mode?

Mizanur Khondoker Tue, 14 Oct 2008 07:15:06 -0700

Dear list,

Most high performance computing clusters/grid engines  have some
restrictions on how long a job can be run in batch mode.
The cluster I am using has maximum of 48 hours limit, but my job would take
far more than that.


I know that it is possible to checkpoint jobs without modifying the code if
some specialized software (e.g., BLCR ) is installed on the grid engine.

However, I am looking for a solution when this kind of facility is not
available on the cluster,  for example , by modifying the code so that the
job can checkpoint and restart by itself.

Does anyone have any  experience or idea of doing so? Any help would be
greatly appreciated.

-- 
Mizanur Khondoker
Division of Pathway Medicine (DPM)
The University of Edinburgh Medical School
The Chancellor's Building
49 Little France Crescent
Edinburgh EH16 4SB
United Kingdom

Tel:  +44 (0) 131 242 6287
Fax: +44 (0) 131 242 6244
http://www.pathwaymedicine.ed.ac.uk/

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] How to checkpoint-restart R jobs in batch mode?

Reply via email to