Skip to content

Heraklion

Compare
Choose a tag to compare
@leobago leobago released this 10 Jul 15:26
· 748 commits to master since this release
bae938b

Release

This release includes the new version of differential checkpointing, a complete implementation of incremental checkpointing, full support for GPU checkpointing and full support for HDF5 checkpointing, including the option for checkpointing into a single file (N-1) and restarting with a different number of processes.

Changelog

  • New major feature allowing users to checkpoint data allocated in the GPU device memory.
  • New implementation of differential checkpointing that addresses performance issues for highly fragmented differential updates.
  • New major feature allowing users to use incremental checkpointing for CPU and GPU data by adding one by one the variables to the checkpoint file.
  • New major feature for DCPPosix allowing to recover from last non-corrupted checkpoint file.
  • New examples in the examples/GPU directory that checkpoint GPU data.
  • New major feature allowing to restart with a different number of processes using a shared HDF5 checkpoint file.
  • New unitary tests for the new features.
  • New configurable/flexible local test structure.
  • Fixed Bug of RecoverVar.
  • Fixed Bug on DCP recovery.
  • Complete and full code documentation generated with Doxygen.