Skip to content

Conversation

coryMosaicML
Copy link
Contributor

@coryMosaicML coryMosaicML commented Feb 28, 2023

This PR adds backwards compatibility for loading checkpoints saved with the EMA algorithm from composer 0.12.1 and earlier.

Here is a plot showing a checkpoint saved on composer 0.12.1 resuming training on this PR branch.
Screen Shot 2023-02-27 at 10 56 55 PM

@mvpatel2000
Copy link
Contributor

Can you please add screenshots showing you tested resuming an old checkpoint from v12 on 1 GPU?

@coryMosaicML
Copy link
Contributor Author

coryMosaicML commented Feb 28, 2023

Can you please add screenshots showing you tested resuming an old checkpoint from v12 on 1 GPU?

Added one at the top.

Copy link
Contributor

@mvpatel2000 mvpatel2000 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! Thanks for the hotfix

@mvpatel2000 mvpatel2000 merged commit f4af779 into mosaicml:dev Mar 1, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants