Skip to content

Conversation

ZhiyuLi-Nvidia
Copy link
Contributor

What does this PR do ?

Address sliding_window issue in reproducing deepscaler experiments with deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B. Given the bug from upstream hf, overwrite it in hf model initialization.

Issues

List issues that this PR closes (syntax):

Usage

  • You can potentially add a usage example below
# Add a code snippet demonstrating how to use this 

Before your PR is "Ready for review"

Pre checks:

  • Make sure you read and followed Contributor guidelines
  • Did you write any new necessary tests?
  • Did you run the unit tests and functional tests locally? Visit our Testing Guide for how to run tests
  • [] Did you add or update any necessary documentation? Visit our Document Development Guide for how to write, build and test the docs.

Additional Information

  • ...

Signed-off-by: Zhiyu Li <zhiyul@nvidia.com>
@ZhiyuLi-Nvidia ZhiyuLi-Nvidia changed the title sliding_window_overwrite bug: sliding_window_overwrite May 7, 2025
@ZhiyuLi-Nvidia ZhiyuLi-Nvidia changed the title bug: sliding_window_overwrite fix: sliding_window_overwrite May 7, 2025
ZhiyuLi-Nvidia and others added 5 commits May 7, 2025 13:47
Co-authored-by: Sahil Jain <48468750+SahilJain314@users.noreply.github.com>
Signed-off-by: ZhiyuLi-Nvidia <zhiyul@NVIDIA.com>
Co-authored-by: Sahil Jain <48468750+SahilJain314@users.noreply.github.com>
Signed-off-by: ZhiyuLi-Nvidia <zhiyul@NVIDIA.com>
Signed-off-by: Zhiyu Li <zhiyul@nvidia.com>
Signed-off-by: Zhiyu Li <zhiyul@nvidia.com>
Signed-off-by: Sahil Jain <48468750+SahilJain314@users.noreply.github.com>
@parthchadha parthchadha added this pull request to the merge queue May 8, 2025
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks May 9, 2025
@parthchadha parthchadha added this pull request to the merge queue May 9, 2025
@terrykong terrykong mentioned this pull request May 9, 2025
3 tasks
Merged via the queue into main with commit 35a0e09 May 9, 2025
21 checks passed
@parthchadha parthchadha deleted the zhiyul/sliding_window_fix branch May 9, 2025 17:52
YzjiaoNvd pushed a commit to YzjiaoNvd/NeMo-RL that referenced this pull request Jun 10, 2025
Signed-off-by: Zhiyu Li <zhiyul@nvidia.com>
Signed-off-by: ZhiyuLi-Nvidia <zhiyul@NVIDIA.com>
Signed-off-by: Sahil Jain <48468750+SahilJain314@users.noreply.github.com>
Co-authored-by: Sahil Jain <48468750+SahilJain314@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants