Differences
This shows you the differences between two versions of the page.
Both sides previous revision
Previous revision
|
Next revision
Both sides next revision
|
cluster:198 [2020/12/02 15:17] hmeij07 [GPU checkpoint/restart] |
cluster:198 [2020/12/02 15:19] hmeij07 [GPU checkpoint/restart] |
* [[https://arxiv.org/pdf/2008.10596.pdf|CRAC: Checkpoint-Restart Architecture for CUDA with Streamsand UVM]] | * [[https://arxiv.org/pdf/2008.10596.pdf|CRAC: Checkpoint-Restart Architecture for CUDA with Streamsand UVM]] |
| |
* git site (notice early development): https://github.com/DMTCP-CRAC/CRAC-early-development | * git site (notice __early development__): https://github.com/DMTCP-CRAC/CRAC-early-development |
| |
Some quotes from the article ... //CRAC provides the ability to save and restore the state of CUDA by first using CUDA-specific save/restore operations, and then delegating to a traditional checkpoint-restart package.... In the end, the support of DMTCP for process virtualization and plugins makes it easier to add modular support for CUDA without having to excessively understand details of the internals of the host checkpointing package.// | Some quotes from the article ... //CRAC provides the ability to save and restore the state of CUDA by first using CUDA-specific save/restore operations, and then delegating to a traditional checkpoint-restart package.... In the end, the support of DMTCP for process virtualization and plugins makes it easier to add modular support for CUDA without having to excessively understand details of the internals of the host checkpointing package.// |