known issues in v0.4 & breaking changes after v0.4 #1902
eric-haibin-lin
announced in
Announcements
Replies: 1 comment
-
SFT trainer enables gradient checkpointing by default #1889 |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Summary
known issues in v0.4.0 release
PPO
vf_loss
factor #2016SFT
Breaking changes after v0.4 (main branch)
Megatron
SFT
PPO ray trainer
simple_timer
in [perf] feat: Add verl profiling support from Nvidia Nsight System #1820. We noticed many ppl import it although its a private function.Checkpoint manager
checkpoint_config
as the keyword to replacecheckpoint_contents
Deprecations after v0.4 (main branch)
Beta Was this translation helpful? Give feedback.
All reactions