Skip to content

Conversation

ananyahjha93
Copy link
Contributor

Fixes #664 .

@ananyahjha93 ananyahjha93 requested a review from epwalsh July 17, 2024 08:36
@ananyahjha93 ananyahjha93 merged commit d627c94 into main Jul 17, 2024
10 of 12 checks passed
@ananyahjha93 ananyahjha93 deleted the ddp-ckpt-fix branch July 17, 2024 21:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

DDP training tries to save sharded checkpoint on the last step
3 participants