Skip to content

Conversation

ashors1
Copy link
Contributor

@ashors1 ashors1 commented Jun 17, 2025

What does this PR do ?

Adds Megatron support for SFT and DPO.

  • SFT convergence on SQuAD:
    sft_megatron

  • DPO convergence on HelpSteer-3:
    nemorl_dpo

Issues

List issues that this PR closes (syntax):

Usage

  • You can potentially add a usage example below
# Add a code snippet demonstrating how to use this 

Before your PR is "Ready for review"

Pre checks:

  • Make sure you read and followed Contributor guidelines
  • Did you write any new necessary tests?
  • Did you run the unit tests and functional tests locally? Visit our Testing Guide for how to run tests
  • Did you add or update any necessary documentation? Visit our Document Development Guide for how to write, build and test the docs.

Additional Information

  • ...

ashors1 added 8 commits June 17, 2025 11:33
Signed-off-by: ashors1 <ashors@nvidia.com>
Signed-off-by: Anna Shors <ashors@nvidia.com>
Signed-off-by: Anna Shors <ashors@nvidia.com>
Signed-off-by: ashors1 <ashors@nvidia.com>
…used together

Signed-off-by: ashors1 <ashors@nvidia.com>
Signed-off-by: Anna Shors <ashors@nvidia.com>
Signed-off-by: Anna Shors <ashors@nvidia.com>
Signed-off-by: Anna Shors <ashors@nvidia.com>
@ashors1 ashors1 marked this pull request as ready for review June 17, 2025 22:53
ashors1 added 2 commits June 17, 2025 15:53
Signed-off-by: Anna Shors <ashors@nvidia.com>
Signed-off-by: Anna Shors <ashors@nvidia.com>
Copy link
Contributor

@SahilJain314 SahilJain314 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

minor nits, otherwise lgtm

Signed-off-by: ashors1 <ashors@nvidia.com>
SahilJain314
SahilJain314 previously approved these changes Jun 17, 2025
@terrykong
Copy link
Contributor

@ashors1 can you add convergence to the description?

parthchadha
parthchadha previously approved these changes Jun 17, 2025
Signed-off-by: Sahil Jain <sahilj@nvidia.com>
@SahilJain314 SahilJain314 dismissed stale reviews from parthchadha and themself via 45491a4 June 17, 2025 23:26
@SahilJain314
Copy link
Contributor

(lint check failing, ran linter -- please reapprove)

SahilJain314
SahilJain314 previously approved these changes Jun 17, 2025
@ashors1
Copy link
Contributor Author

ashors1 commented Jun 17, 2025

@ashors1 can you add convergence to the description?

Yeah, do you mean uploading convergence plots @terrykong?

@ashors1 ashors1 added the CI:L1 Run doctests, unit tests, and functional tests label Jun 17, 2025
@terrykong
Copy link
Contributor

Yea, screenshot is okay; just as a record of dtensor vs. mcore convergence at least on one model

@ashors1
Copy link
Contributor Author

ashors1 commented Jun 17, 2025

Yea, screenshot is okay; just as a record of dtensor vs. mcore convergence at least on one model

Done

@SahilJain314 SahilJain314 enabled auto-merge June 17, 2025 23:49
@SahilJain314 SahilJain314 disabled auto-merge June 17, 2025 23:55
@terrykong terrykong enabled auto-merge June 18, 2025 22:07
@terrykong terrykong added this pull request to the merge queue Jun 19, 2025
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Jun 19, 2025
Signed-off-by: ashors1 <ashors@nvidia.com>
terrykong
terrykong previously approved these changes Jun 19, 2025
@terrykong terrykong added this pull request to the merge queue Jun 19, 2025
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Jun 19, 2025
Signed-off-by: ashors1 <ashors@nvidia.com>
terrykong
terrykong previously approved these changes Jun 19, 2025
@terrykong terrykong added this pull request to the merge queue Jun 19, 2025
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Jun 19, 2025
Signed-off-by: ashors1 <ashors@nvidia.com>
Signed-off-by: Anna Shors <ashors@nvidia.com>
terrykong
terrykong previously approved these changes Jun 19, 2025
@terrykong terrykong added this pull request to the merge queue Jun 19, 2025
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Jun 19, 2025
Signed-off-by: Anna Shors <ashors@nvidia.com>
@terrykong terrykong added this pull request to the merge queue Jun 20, 2025
Merged via the queue into main with commit 3d36d0a Jun 20, 2025
13 of 14 checks passed
@terrykong terrykong deleted the ashors/megatron_sft_dpo branch June 20, 2025 02:39
YzjiaoNvd pushed a commit to YzjiaoNvd/NeMo-RL that referenced this pull request Jul 14, 2025
Signed-off-by: ashors1 <ashors@nvidia.com>
Signed-off-by: Anna Shors <ashors@nvidia.com>
Signed-off-by: Sahil Jain <sahilj@nvidia.com>
Co-authored-by: Sahil Jain <sahilj@nvidia.com>
KiddoZhu pushed a commit that referenced this pull request Jul 28, 2025
Signed-off-by: ashors1 <ashors@nvidia.com>
Signed-off-by: Anna Shors <ashors@nvidia.com>
Signed-off-by: Sahil Jain <sahilj@nvidia.com>
Co-authored-by: Sahil Jain <sahilj@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CI:docs Run doctest
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants