Bug fix: VSCode LM API token counting for Claude models #5051

johnib · 2025-07-20T08:59:52Z

Related Issue

Description

When using CLINE with VSCode LM API as provider - the token counting is wrong:

It counts the system prompt twice
It passes the vscode LM API .countTokens() API the msg object instead of the msg's content itself -- for which the count is always 4.

The impact of the above two -- is that CLINE cannot monitor the context window usage, and therefore cannot condense the conversation automatically when reaching the 80% threshold automatically. This leads to exceeding the context window at some point, and break the entire cline conversation -- because when exceeding the context window, GHCP API truncates the conversation from the beginning, omitting CLINE's system prompt.

See issue #4027 for example.

Test Procedure

I have been using in the past several days, a private build of CLINE, that incorporates this change -- in order to validate it works properly.

Type of Change

🐛 Bug fix (non-breaking change which fixes an issue)
✨ New feature (non-breaking change which adds functionality)
💥 Breaking change (fix or feature that would cause existing functionality to not work as expected)
♻️ Refactor Changes
💅 Cosmetic Changes
📚 Documentation update
🏃 Workflow Changes

Pre-flight Checklist

Changes are limited to a single feature, bugfix or chore (split larger changes into separate PRs)
Tests are passing (npm test) and code is formatted and linted (npm run format && npm run lint)
I have created a changeset using npm run changeset (required for user-facing changes)
I have reviewed contributor guidelines

Screenshots

Additional Notes

Important

Fixes token counting for Claude models in VsCodeLmHandler by using a character-to-token ratio and removing double counting of system prompt tokens.

Behavior:
- Fixes token counting for Claude models in VsCodeLmHandler by using a 4 character-to-token ratio.
- Removes double counting of system prompt tokens in calculateTotalInputTokens().
Functions:
- Adds extractTextFromMessage() to extract text from vscode.LanguageModelChatMessage.
- Adds isClaudeModel() to check if the model is a Claude model.
- Modifies countTokens() to use character-to-token ratio for Claude models.
Misc:
- Removes systemPrompt parameter from calculateTotalInputTokens() in vscode-lm.ts.

^{This description was created by}^{for 17ecc59. You can customize this summary. It will automatically update as commits are pushed.}

- Reorder imports for better organization - Add extractTextFromMessage helper method - Add isClaudeModel detection method - Use 4:1 character-to-token ratio for Claude models instead of VSCode's inaccurate counting - Fallback to existing VSCode LM token counting for non-Claude models

…mHandler

changeset-bot · 2025-07-20T08:59:55Z

🦋 Changeset detected

Latest commit: 17ecc59

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 1 package

Name	Type
claude-dev	Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

Copilot

Pull Request Overview

This PR fixes critical token counting issues in the VSCode LM API provider that were causing context window management failures and conversation breaks. The bug prevented Cline from properly monitoring context usage and automatically condensing conversations when reaching the 80% threshold.

Key changes include:

Eliminates double-counting of the system prompt in token calculations
Implements character-to-token ratio estimation for Claude models instead of relying on VSCode's inaccurate API
Adds proper text extraction from VSCode language model chat messages

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
src/api/providers/vscode-lm.ts	Core token counting logic fixes and Claude model detection
.changeset/smart-jobs-impress.md	Changeset for the bug fix release

src/api/providers/vscode-lm.ts

arafatkatze

Put debug statements in your Code and you will see that for GPT 4 family of models the token counting calculations are inaccurate so please fix that.

This is not about your PR as the claude 4 family of models is correct btw.

johnib · 2025-07-28T08:20:51Z

Put debug statements in your Code and you will see that for GPT 4 family of models the token counting calculations are inaccurate so please fix that.

This is not about your PR as the claude 4 family of models is correct btw.

Not quite sure I understood your message.

Are you saying that token counting for Claude models is correct, in this branch?
Do you want to extend this PR to fix token counting for all models provided by VSCode LM API ?

I don't use the rest of the models, so it's hard for me to verify that.
I can do that, but would rather separate this from this PR if that's okay with you.

arafatkatze · 2025-07-28T09:08:52Z

Are you saying that token counting for Claude models is correct, in this branch?

Yes

Do you want to extend this PR to fix token counting for all models provided by VSCode LM API ?

That would be nice but i understand that this would be beyond the scope of this PR.

Jonathan Barazany added 6 commits July 7, 2025 10:11

Update version to 3.18.3-r1 and refactor token calculation in VsCodeL…

543505d

…mHandler

Merge tag 'v3.19.5' into johnib/vscodelm-fixed-token-counting

15461e8

3.19.5-r1

763f8e1

Merge branch 'main' into johnib/vscodelm-fixed-token-counting

f9829bb

Add smart jobs impress changeset for VSCode LM API token counting fix

17ecc59

Copilot AI review requested due to automatic review settings July 20, 2025 08:59

johnib requested review from saoudrizwan, ocasta181, NightTrek, pashpashpash, dcbartlett, saito-sv and Garoth as code owners July 20, 2025 08:59

Copilot AI reviewed Jul 20, 2025

View reviewed changes

src/api/providers/vscode-lm.ts Show resolved Hide resolved

src/api/providers/vscode-lm.ts Show resolved Hide resolved

saoudrizwan assigned arafatkatze Jul 26, 2025

arafatkatze requested changes Jul 28, 2025

View reviewed changes

arafatkatze self-requested a review July 28, 2025 09:09

arafatkatze approved these changes Jul 28, 2025

View reviewed changes

arafatkatze merged commit 65c21e7 into cline:main Jul 28, 2025
8 checks passed

arafatkatze mentioned this pull request Jul 28, 2025

VSCode Language Model API provider dramatically under-reports token usage #4584

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bug fix: VSCode LM API token counting for Claude models #5051

Bug fix: VSCode LM API token counting for Claude models #5051

Uh oh!

johnib commented Jul 20, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

changeset-bot bot commented Jul 20, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

arafatkatze left a comment •

edited

Loading

Uh oh!

johnib commented Jul 28, 2025

Uh oh!

arafatkatze commented Jul 28, 2025

Uh oh!

Uh oh!

Uh oh!

Bug fix: VSCode LM API token counting for Claude models #5051

Bug fix: VSCode LM API token counting for Claude models #5051

Uh oh!

Conversation

johnib commented Jul 20, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Related Issue

Description

Test Procedure

Type of Change

Pre-flight Checklist

Screenshots

Additional Notes

Uh oh!

changeset-bot bot commented Jul 20, 2025

🦋 Changeset detected

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

arafatkatze left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

johnib commented Jul 28, 2025

Uh oh!

arafatkatze commented Jul 28, 2025

Uh oh!

Uh oh!

Uh oh!

johnib commented Jul 20, 2025 •

edited by ellipsis-dev bot

Loading

arafatkatze left a comment •

edited

Loading