Skip to content

Conversation

yossiovadia
Copy link
Contributor

Additional fix needed for #320 to use HF and not PVC.

Current behavior: --download-model Qwen/Qwen3-0.6B → pvc://model-pvc/Qwen/Qwen3-0.6B → Broken hybrid mode
Fixed behavior: --download-model Qwen/Qwen3-0.6B → hf://Qwen/Qwen3-0.6B → Proper HuggingFace download

Tested full deployment ( now including looking into the pod )

INFO 06-18 01:46:26 [cli_args.py:297] non-default args: {'port': 8001, 'served_model_name': ['Qwen/Qwen3-0.6B']}

Copy link
Member

@nerdalert nerdalert left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@nerdalert nerdalert merged commit aff7242 into llm-d:main Jun 23, 2025
2 of 3 checks passed
nerdalert pushed a commit to nerdalert/llm-d-deployer that referenced this pull request Jun 23, 2025
nerdalert pushed a commit to nerdalert/llm-d-deployer that referenced this pull request Jun 23, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants