🎯
Focusing
-
WeChat Tencent
- GuangZhou
Pinned Loading
-
deepspeedai/Megatron-DeepSpeed
deepspeedai/Megatron-DeepSpeed PublicForked from NVIDIA/Megatron-LM
Ongoing research training transformer language models at scale, including: BERT & GPT-2
-
deepspeedai/DeepSpeed
deepspeedai/DeepSpeed PublicDeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
-
tensorflow/recommenders-addons
tensorflow/recommenders-addons PublicAdditional utils and helpers to extend TensorFlow when build recommendation systems, contributed and maintained by SIG Recommenders.
-
NVIDIA/Megatron-LM
NVIDIA/Megatron-LM PublicOngoing research training transformer models at scale
-
ISEEKYAN/mbridge
ISEEKYAN/mbridge PublicBridge Megatron-Core to Hugging Face/Reinforcement Learning
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.