Skip to content

Conversation

yuki-97
Copy link
Contributor

@yuki-97 yuki-97 commented Jun 24, 2025

Under some conditions, ray stops respecting non-kwargs arguments.

This PR supports passing args/kwargs inrun_all_workers_multiple_data and run_all_workers_sharded_data, so that we can pass through kwargs instead of passing an unclear param data.

@yuki-97 yuki-97 added the CI:L1 Run doctests, unit tests, and functional tests label Jun 24, 2025
@yuki-97 yuki-97 added CI:L1 Run doctests, unit tests, and functional tests and removed CI:L1 Run doctests, unit tests, and functional tests labels Jun 24, 2025
@yuki-97 yuki-97 force-pushed the yukih/worker-group-args branch from 0da5ef1 to eef6d04 Compare June 24, 2025 11:08
@yuki-97 yuki-97 requested a review from SahilJain314 June 24, 2025 11:09
Copy link
Contributor

@SahilJain314 SahilJain314 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How should we handle the fact that passing args will fail under certain circumstances? Can we assert fail the args or remove them entirely for now? (until we figure out what's going on)? I don't want people to run into this as a sharp edge.

yuki-97 added 3 commits June 30, 2025 07:29
…ers_sharded_data

Signed-off-by: Yuki Huang <yukih@nvidia.com>
Signed-off-by: Yuki Huang <yukih@nvidia.com>
Signed-off-by: Yuki Huang <yukih@nvidia.com>
@yuki-97 yuki-97 force-pushed the yukih/worker-group-args branch from eef6d04 to 1eab584 Compare June 30, 2025 08:29
@yuki-97 yuki-97 added CI:L1 Run doctests, unit tests, and functional tests and removed CI:L1 Run doctests, unit tests, and functional tests labels Jun 30, 2025
Signed-off-by: Yuki Huang <yukih@nvidia.com>
@yuki-97 yuki-97 force-pushed the yukih/worker-group-args branch from 1eab584 to 347d02c Compare June 30, 2025 08:31
@yuki-97 yuki-97 added CI:L1 Run doctests, unit tests, and functional tests and removed CI:L1 Run doctests, unit tests, and functional tests labels Jun 30, 2025
Signed-off-by: Yuki Huang <yukih@nvidia.com>
@yuki-97 yuki-97 added CI:L1 Run doctests, unit tests, and functional tests and removed CI:L1 Run doctests, unit tests, and functional tests labels Jun 30, 2025
@yuki-97
Copy link
Contributor Author

yuki-97 commented Jun 30, 2025

How should we handle the fact that passing args will fail under certain circumstances? Can we assert fail the args or remove them entirely for now? (until we figure out what's going on)? I don't want people to run into this as a sharp edge.

@SahilJain314 Added assert. Also add an issue #582 to trace ray stops respecting non-kwargs arguments.

Copy link
Contributor

@SahilJain314 SahilJain314 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

assert fail LGTM for now. We can run the CI to make sure all instances are covered.

@SahilJain314 SahilJain314 added this pull request to the merge queue Jun 30, 2025
Merged via the queue into main with commit 2d79359 Jul 1, 2025
20 of 23 checks passed
@SahilJain314 SahilJain314 deleted the yukih/worker-group-args branch July 1, 2025 00:36
xxman-google pushed a commit to xxman-google/NeMo-RL that referenced this pull request Jul 2, 2025
Signed-off-by: Yuki Huang <yukih@nvidia.com>
therealnaveenkamal pushed a commit to therealnaveenkamal/RL that referenced this pull request Jul 7, 2025
Signed-off-by: Yuki Huang <yukih@nvidia.com>
YzjiaoNvd pushed a commit to YzjiaoNvd/NeMo-RL that referenced this pull request Jul 14, 2025
Signed-off-by: Yuki Huang <yukih@nvidia.com>
KiddoZhu pushed a commit that referenced this pull request Jul 28, 2025
Signed-off-by: Yuki Huang <yukih@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CI:L1 Run doctests, unit tests, and functional tests
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants