User Tools

Site Tools


cluster:190

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
cluster:190 [2020/01/24 13:33]
hmeij07
cluster:190 [2020/09/28 07:38]
hmeij07
Line 11: Line 11:
  
 This is a replacement for THE BLCR methods we used [[cluster:​147|BLCR Checkpoint in OL3 -serial]] or [[cluster:​148|BLCR Checkpoint in OL3 - parallel]] ... BLCR is not being developed anymore. Today'​s brief power outage removed the BLCR kernel module HPCC wide. So learn DMTCP. I have not provided wrappers but you can follow the same logic as we used with BLCR. This is a replacement for THE BLCR methods we used [[cluster:​147|BLCR Checkpoint in OL3 -serial]] or [[cluster:​148|BLCR Checkpoint in OL3 - parallel]] ... BLCR is not being developed anymore. Today'​s brief power outage removed the BLCR kernel module HPCC wide. So learn DMTCP. I have not provided wrappers but you can follow the same logic as we used with BLCR.
 +
 +Write your checkpoint files in ''/​sanscratch/​checkpoints/​JOBPID''​ so it does not add into your quota. ​ The scheduler will not create this directory for you, you must do this in your submit job.  Directories will automatically be delete if 120 days old.
  
 <​code>​ <​code>​
cluster/190.txt ยท Last modified: 2020/09/28 07:38 by hmeij07