Skip to content

Conversation

lifuhuang
Copy link
Collaborator

Motivation

Allowing customer to limit the maximal # of LoRAs loaded in CPU.

Modifications

Accuracy Test

Benchmark & Profiling

Checklist

Copy link
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

Copy link
Collaborator

@Fridge003 Fridge003 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@zhyncs zhyncs merged commit 8675bdf into main Aug 3, 2025
121 of 128 checks passed
@zhyncs zhyncs deleted the lifuhuang/max-lora branch August 3, 2025 07:02
htiennv pushed a commit to htiennv/sglang that referenced this pull request Aug 5, 2025
narutolhy pushed a commit to narutolhy/sglang that referenced this pull request Aug 17, 2025
narutolhy pushed a commit to narutolhy/sglang that referenced this pull request Aug 18, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants