refactor with block for sampling parameter update #2

zhaochenyang20 · 2025-05-24T23:34:52Z

Checklist Before Starting

Search for similar PR(s).

What does this PR do?

I updated serval comments and the with block usage. one question left in here:

        # The format of the model weights to be loaded.
        # TODO(chenyang): why we set `load_format` to `dummy`?

        # “auto” will try to load the weights in the safetensors format and
        # fall back to the pytorch bin format if safetensors format is not
        # available.
        # “pt” will load the weights in the pytorch bin format.
        # “safetensors” will load the weights in the safetensors format.
        # “dummy” will initialize the weights with random values, which is
        # mainly for profiling.
        # “bitsandbytes” will load the weights using bitsandbytes quantization.
        # “npcache” will load the weights in pytorch format and store a numpy
        # cache to speed up the loading.

zhaochenyang20 · 2025-05-25T03:51:23Z

Well, I am cool with the load_format. But I think seldomly do we use dummy 😂

zyzshishui · 2025-05-25T06:07:43Z

verl/workers/rollout/sglang_rollout/sglang_rollout.py

-    non_pad_index = torch.nonzero(prompt_token_ids != pad_token_id, as_tuple=False)[0][0]
+    non_pad_index = torch.nonzero(prompt_token_ids != pad_token_id, as_tuple=False)[0][
+        0
+    ]


That's not graceful, plz roll back to single line

Co-authored-by: Bihan Rana <bihan@Bihans-MacBook-Pro.local> Co-authored-by: peterschmidt85 <andrey.cheptsov@gmail.com>

refactor with block for sampling parameter update

0c325e9

zyzshishui reviewed May 25, 2025

View reviewed changes

Fix review

dbb568d

zyzshishui merged commit 7be85f7 into refactor May 25, 2025
2 checks passed

zyzshishui pushed a commit that referenced this pull request May 27, 2025

Add dstack example (#2) (volcengine#1706)

54b2677

Co-authored-by: Bihan Rana <bihan@Bihans-MacBook-Pro.local> Co-authored-by: peterschmidt85 <andrey.cheptsov@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor with block for sampling parameter update #2

refactor with block for sampling parameter update #2

Uh oh!

zhaochenyang20 commented May 24, 2025

Uh oh!

zhaochenyang20 commented May 25, 2025

Uh oh!

zyzshishui May 25, 2025

Uh oh!

Uh oh!

Uh oh!

refactor with block for sampling parameter update #2

refactor with block for sampling parameter update #2

Uh oh!

Conversation

zhaochenyang20 commented May 24, 2025

Checklist Before Starting

What does this PR do?

Uh oh!

zhaochenyang20 commented May 25, 2025

Uh oh!

zyzshishui May 25, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!