Skip to content

Conversation

BearBiscuit05
Copy link
Collaborator

No description provided.

vermouth1992
vermouth1992 previously approved these changes Apr 17, 2025
@vermouth1992 vermouth1992 merged commit 0bdf7f4 into volcengine:main Apr 18, 2025
18 checks passed
@BearBiscuit05 BearBiscuit05 deleted the tmp_qwen_moe branch April 18, 2025 01:46
yellowbee686 pushed a commit to yellowbee686/verl that referenced this pull request Apr 18, 2025
vermouth1992 pushed a commit that referenced this pull request Apr 24, 2025
## Motivation
This is a fix for the issue where the `weight_loader` in FusedMoe of the
vLLM code could not be used correctly during the resharding phase,
addressed in #923, #1137, and #1139 respectively. Currently, the results
of these PRs can be used together, allow both FSDP and Megatron to use
the same function, reducing code maintenance costs.
yuchenwang3 pushed a commit to yuchenwang3/verl that referenced this pull request Apr 25, 2025
ScottCTD pushed a commit to ScottCTD/verl that referenced this pull request May 5, 2025
## Motivation
This is a fix for the issue where the `weight_loader` in FusedMoe of the
vLLM code could not be used correctly during the resharding phase,
addressed in volcengine#923, volcengine#1137, and volcengine#1139 respectively. Currently, the results
of these PRs can be used together, allow both FSDP and Megatron to use
the same function, reducing code maintenance costs.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants