Breaking & deprecation changes since v0.3 #943
Replies: 4 comments
-
config(sft_trainer.yaml) change the default value to make them same as PPO (trainer.default_local_dir, trainer.default_hdfs_dir): #1451 |
Beta Was this translation helpful? Give feedback.
-
utils
|
Beta Was this translation helpful? Give feedback.
-
entropy_coeff now defaults to 0 #1770 |
Beta Was this translation helpful? Give feedback.
-
Breaking changes since v0.4 recorded in #1902 |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
This is a change log specifically focused on breaking changes so that the community is aware.
class interfaces
SFTDataset interface
RLHFDataset interface
FSDPSFTTrainer:
config
If you come across usage such as
+{config}={value}
that throw an error:These breakings are usually caused by consolidating a config term into the yaml file. Please try removing the
+
to fix such errors.Here are the changed configs:
trainer.val_before_train
: [BREAKING config] fix: move val_before_train to config yaml. Using trainer.val_before_train instead of +trainer.val_before_train going forward #820sft_trainer.yaml
)data.{prompt,response}_dict_keys
: [ci] feat: improve CI speed #1032Beta Was this translation helpful? Give feedback.
All reactions