Skip to content

Conversation

zhuangqh
Copy link
Collaborator

Reason for Change:

Requirements

  • added unit tests and e2e tests (if applicable).

Issue Fixed:

Notes for Reviewers:

Copy link

Title

Add support for deepseek-r1 and deepseek-v3 models


Description

  • Added support for deepseek-r1 and deepseek-v3 models

  • Registered new models in plugin registration

  • Defined constants and runtime parameters for new models

  • Updated supported_models.yaml with new models


Changes walkthrough 📝

Relevant files
Enhancement
model.go
Add deepseek-r1 and deepseek-v3 support                                   

presets/workspace/models/deepseek/model.go

  • Registered deepseek-r1 and deepseek-v3 models in plugin registration
  • Defined constants for new models
  • Added runtime parameters for deepseek-r1 and deepseek-v3
  • Implemented GetInferenceParameters, GetTuningParameters,
    SupportDistributedInference, and SupportTuning methods for new models
  • +101/-0 
    Configuration changes
    supported_models.yaml
    Update supported models                                                                   

    presets/workspace/models/supported_models.yaml

    • Added entries for deepseek-r1-0528 and deepseek-v3-0324 models
    +13/-0   

    Need help?
  • Type /help how to ... in the comments thread for any questions about PR-Agent usage.
  • Check out the documentation for more information.
  • Copy link

    PR Reviewer Guide 🔍

    Here are some key observations to aid the review process:

    ⏱️ Estimated effort to review: 3 🔵🔵🔵⚪⚪
    🧪 No relevant tests
    🔒 No security concerns identified
    ⚡ Recommended focus areas for review

    Duplicate Metadata

    The Metadata field in the GetInferenceParameters method for both deepseekR1 and deepseekV3 uses PresetDeepSeekR1Model. This might be incorrect and should be PresetDeepSeekV3Model for deepseekV3.

    return &model.PresetParam{
    Inconsistent Model Names

    The ModelName field in the VLLM struct for both deepseekR1 and deepseekV3 uses PresetDeepSeekR1Model. This should be consistent with the actual model names PresetDeepSeekR1Model and PresetDeepSeekV3Model.

    ModelRunParams:       deepseekR1RunParamsVLLM,

    Copy link

    PR Code Suggestions ✨

    Explore these optional code suggestions:

    CategorySuggestion                                                                                                                                    Impact
    Possible issue
    Correct preset name

    Use the correct preset name for deepseekV3.

    presets/workspace/models/deepseek/model.go [194]

    -Metadata:                  metadata.MustGet(PresetDeepSeekR1Model),
    +Metadata:                  metadata.MustGet(PresetDeepSeekV3Model),
    Suggestion importance[1-10]: 8

    __

    Why: The Metadata field for deepseekV3 incorrectly uses PresetDeepSeekR1Model. This should be corrected to PresetDeepSeekV3Model to match the intended model registration.

    Medium

    Signed-off-by: zhuangqh <zhuangqhc@gmail.com>
    Signed-off-by: zhuangqh <zhuangqhc@gmail.com>
    @Fei-Guo Fei-Guo merged commit dc46418 into kaito-project:main Aug 4, 2025
    18 of 20 checks passed
    @github-project-automation github-project-automation bot moved this from In Progress to Done in KAITO Roadmap Aug 4, 2025
    Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
    Projects
    Status: Done
    Development

    Successfully merging this pull request may close these issues.

    3 participants