Skip to content

Conversation

dwadden
Copy link
Contributor

@dwadden dwadden commented Aug 16, 2024

Use hf_olmo instead of old_style and transformers for new_style. Add docstring explaining the difference and default to using hf_olmo, which is standard for internal development.

I've run this and it seems to work fine.

Use `hf_olmo` instead of `old_style` and `transformers` for `new_style`.
Add docstring explaining the difference and default to using `hf_olmo`,
which is standard for internal development.
@dwadden dwadden requested a review from 2015aroras August 16, 2024 19:17
Copy link
Collaborator

@2015aroras 2015aroras left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks!

@@ -111,7 +124,7 @@ def s3_unshard_to_hf(args):

# Convert to HF.
print("Converting to HF.")
if args.old_style_hf:
if args.checkpoint_style == "hf_olmo":
# Convert to old-style checkpoint.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: update the comment

@dwadden dwadden merged commit e5217cf into main Aug 16, 2024
10 of 11 checks passed
@dwadden dwadden deleted the dave/checkpoint_style_naming branch August 16, 2024 21:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants