-
Notifications
You must be signed in to change notification settings - Fork 652
Add OLMo April + July 2024 Checkpoints & Config #547
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
configs/official/OLMo-1.7-7B.yaml
Outdated
paths: | ||
######### NON WEB DATA ######### | ||
# ~> GUTENBERG BOOKS (5.256 GT) | ||
- s3://ai2-llm/preprocessed/olmo-mix/v1_6-decontaminated/books/gpt-neox-olmo-dolma-v1_5/part-0-00000.npy |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
https
data paths not yet available
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done
@2015aroras - do you think that for 1.7 we should have two configs? I understand that training was stopped, config changed, then resumed. |
Hi, thank you so much for the work! Is there an estimated date of when all the details including the https links of the training data and the data order files are going to be released? |
No description provided.