Skip to content

Conversation

AlexKer
Copy link
Contributor

@AlexKer AlexKer commented Jul 29, 2025

This PR adds Baseten Model APIs with support of the DeepSeek, Llama and Kimi K2 models. We can now select Baseten as a provider in the Cline dropdown menu. This is similar to #4943.

This was tested locally with BASETEN_API_KEY and various unit tests (not included in PR).

  • 🐛 Bug fix (non-breaking change which fixes an issue)
  • ✨ New feature (non-breaking change which adds functionality)
  • 💥 Breaking change (fix or feature that would cause existing functionality to not work as expected)
  • ♻️ Refactor Changes
  • 💅 Cosmetic Changes
  • 📚 Documentation update
  • 🏃 Workflow Changes

Pre-flight Checklist

  • Changes are limited to a single feature, bugfix or chore (split larger changes into separate PRs)
  • Tests are passing (npm test) and code is formatted and linted (npm run format && npm run lint)
  • I have created a changeset using npm run changeset (required for user-facing changes)
  • I have reviewed contributor guidelines

Screenshots

Loom demo link: https://www.loom.com/share/acef33c66eca4a9494e947924fda79f0

Additional Notes


Important

Adds Baseten as a new provider with model support and integrates it into the API, state management, and UI components.

  • Behavior:
    • Adds Baseten as a provider option in ApiOptions.tsx and BasetenProvider.tsx.
    • Supports DeepSeek, Llama, and Kimi K2 models in baseten.ts.
    • Implements model refresh logic in refreshBasetenModels.ts.
  • Models:
    • Adds Baseten models to models.proto and api.ts.
    • Updates api-configuration-conversion.ts for Baseten model info conversion.
  • State Management:
    • Updates state.ts and state-keys.ts to handle Baseten API keys and model configurations.
    • Modifies ExtensionStateContext.tsx to manage Baseten models.
  • UI Components:
    • Introduces BasetenModelPicker.tsx for model selection.
    • Updates providerUtils.ts for Baseten model normalization.
  • Misc:
    • Updates README.md to include Baseten in supported providers list.

This description was created by Ellipsis for f34c120. You can customize this summary. It will automatically update as commits are pushed.

Copy link

changeset-bot bot commented Jul 29, 2025

🦋 Changeset detected

Latest commit: 76f93e8

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 1 package
Name Type
claude-dev Minor

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

@pashpashpash
Copy link
Contributor

Hey! Thanks for working on this @AlexKer! This is directionally correct, but looks like you got avalanched with merge conflicts. Do you mind resolving those and ping me for more thorough review + testing?

@AlexKer
Copy link
Contributor Author

AlexKer commented Jul 30, 2025

@pashpashpash looks like a new provider huaweiCloudMaas was added since Monday, so ran into some conflicts. Should all be resolved now:)

Copy link
Contributor

@arafatkatze arafatkatze left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fix the token counting logic but emitting usage once

@AlexKer
Copy link
Contributor Author

AlexKer commented Aug 4, 2025

@arafatkatze @pashpashpash fixed! Thanks for your review, please merge when ready!

@arafatkatze
Copy link
Contributor

@AlexKer Your API is not persisted on the frontend. Please test the PR and fix the frontend and make sure you cna actually make a request that goes through

@AlexKer
Copy link
Contributor Author

AlexKer commented Aug 5, 2025

@arafatkatze fixed, also added support for 2 new qwen3 models we added the past week. Here is a video showing it working: https://www.loom.com/share/ba73fcbe26e6477490852676bb096527?t=11&sid=2167adb0-880f-4b3e-beef-5ada27378e91

@arafatkatze
Copy link
Contributor

image These two models didn't actually support images image

@AlexKer
Copy link
Contributor Author

AlexKer commented Aug 5, 2025

@arafatkatze thanks for catching those! changed.

@arafatkatze
Copy link
Contributor

@AlexKer This is a fast provider and I liked playing with it. I did not verify that the maxTokens, contextWindow and pricing of the models is accurate based on your documentation.

I can merge this PR provided you take the responsibility of their accuracy.

@AlexKer
Copy link
Contributor Author

AlexKer commented Aug 5, 2025

@arafatkatze I'm glad you enjoyed testing it -- much appreciated for your fast feedback. I can take responsibility in the accuracy based on our docs. We will look into a way of dynamically updating these values in the future.

@arafatkatze arafatkatze merged commit e3c966f into cline:main Aug 5, 2025
7 of 8 checks passed
@arafatkatze
Copy link
Contributor

Merged

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants