thanks for the insights. comedic levity included... :) running the job twice is likely going to be our solution. it's painful when you have multiple people running multiple jobs, in that it wastes resources, but such is life.
i was intrigued by Joe's suggestion of snapshot'ing kvm instances. i might look into that as an academic exercise. i knew you could pause/snapshot/resume an instance, but i've never tried to resume a saved off snapshot, only restart one. if one could resume a snapshot and have the computation leave off exactly where it was paused, that might be nifty _______________________________________________ Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf