Skip to content

NVIDIA/NeMo-Skills

Repository files navigation

NeMo Skills

NeMo-Skills is a collection of pipelines to improve "skills" of large language models (LLMs). We support everything needed for LLM development, from synthetic data generation, to model training, to evaluation on a wide range of benchmarks. Start developing on a local workstation and move to a large-scale Slurm cluster with just a one-line change.

Here are some of the features we support:

News

Evaluation Results with pass@1

Evaluation Results with GenSelect

  • [04/23/2025]: We released OpenMathReasoning dataset and models!

    • OpenMathReasoning dataset has 306K unique mathematical problems sourced from AoPS forums with:
      • 3.2M long chain-of-thought (CoT) solutions
      • 1.7M long tool-integrated reasoning (TIR) solutions
      • 566K samples that select the most promising solution out of many candidates (GenSelect)
    • OpenMath-Nemotron models are SoTA open-weight models on math reasoning benchmarks at the time of release!
  • [10/03/2024]: We released OpenMathInstruct-2 dataset and models!

    • OpenMathInstruct-2 is a math instruction tuning dataset with 14M problem-solution pairs generated using the Llama3.1-405B-Instruct model.
    • OpenMath-2-Llama models show significant improvements compared to their Llama3.1-Instruct counterparts.

Getting started

To get started, follow these steps, browse available pipelines or run ns --help to see all available commands and their options.

You can find more examples of how to use NeMo-Skills in the tutorials page.

We've built and released many popular models and datasets using NeMo-Skills. See all of them in the Papers & Releases documentation.

You can find the full documentation here.

Contributing

We welcome contributions to NeMo-Skills! Please see our Contributing Guidelines for more information on how to get involved.

Disclaimer: This project is strictly for research purposes, and not an official product from NVIDIA.

About

A project to improve skills of large language models

Resources

License

Contributing

Stars

Watchers

Forks

Packages

No packages published