Skip to content

Conversation

ch-wan
Copy link
Collaborator

@ch-wan ch-wan commented Aug 1, 2025

Motivation

This PR introduces --moe-a2a-backend and deprecates --enable-ep-moe and --enable-deepep-moe.

Modifications

Accuracy Test

Benchmark & Profiling

Checklist

Copy link
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@ch-wan ch-wan changed the title [5/N] [wip] MoE Refactor: Update MoE parallelism arguments [5/N] MoE Refactor: Update MoE parallelism arguments Aug 1, 2025
@ch-wan ch-wan merged commit 6c88f6c into sgl-project:main Aug 1, 2025
28 of 62 checks passed
@ch-wan ch-wan deleted the cheng/refactor/update-ep-args branch August 1, 2025 08:20
TianQiLin666666 pushed a commit to TianQiLin666666/sglang that referenced this pull request Aug 1, 2025
@ch-wan ch-wan mentioned this pull request Aug 2, 2025
22 tasks
narutolhy pushed a commit to narutolhy/sglang that referenced this pull request Aug 17, 2025
narutolhy pushed a commit to narutolhy/sglang that referenced this pull request Aug 18, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant