-
Notifications
You must be signed in to change notification settings - Fork 3k
Closed
Labels
staleNo activity in 60 days on issue or PRNo activity in 60 days on issue or PR
Description
Looking for a way to convert model weights between huggingface and Megatron-LM.
(1): Continual pretraining from pretrained weights from huggingface
(2): Convert Megatron-LM model weights to huggingface
It shouldn't be too difficult to adjust layer names/weights, but I'm hoping someone has already done this.
Related #3 (already closed but couldn't find the solution)
Beomi, chutaklee, amirj, haven-jeon, tomohideshibata and 5 moreyuvalkirstain, gagangayari and chenfengshijie
Metadata
Metadata
Assignees
Labels
staleNo activity in 60 days on issue or PRNo activity in 60 days on issue or PR