Feat: Prompt Caching in SAP AI Core #5399

ncryptedV1 · 2025-08-06T20:48:04Z

Related Issue

None, this is just a minor improvement to the SAP AI Core inference provider.

Description

This PR adds prompt caching for SAP AI Core to recent Claude models (Claude 4 Sonnet/Opus & Claude 3.7 Sonnet), significantly reducing costs and improving response times. For Vertex AI & Azure Open AI inferences, caching already automatically takes place under the hood, just as in the native providers.
Additionally, as part of adding cache support, the provider has been cleaned up and refactored to align closer with the original providers, easing the integration of future changes to those.

Test Procedure

I tested inference with all registered SAP AI Core models using the regular Cline workflow for two separate AI Core instances. Solely Claude 4 Opus did not work as it's not available in SAP AI Core, yet. This is the only place where the changes could break existing functionalities.

Type of Change

🐛 Bug fix (non-breaking change which fixes an issue)
✨ New feature (non-breaking change which adds functionality)
💥 Breaking change (fix or feature that would cause existing functionality to not work as expected)
♻️ Refactor Changes
💅 Cosmetic Changes
📚 Documentation update
🏃 Workflow Changes

Pre-flight Checklist

Changes are limited to a single feature, bugfix or chore (split larger changes into separate PRs)
Tests are passing (npm test) and code is formatted and linted (npm run format && npm run lint)
I have created a changeset using npm run changeset (required for user-facing changes)
I have reviewed contributor guidelines

Screenshots

/

Additional Notes

While a PR for this already exists in #4683, this PR also adds a slight refactoring to align payload preparation with the native providers. This renders the integration of future adaptations to these providers easier. @tjandy98 @lizzzcai @schardosin

Important

Adds prompt caching for SAP AI Core Claude models and refactors provider for better alignment with native providers.

Behavior:
- Adds prompt caching for SAP AI Core in sapaicore.ts for Claude models (Claude 4 Sonnet/Opus & Claude 3.7 Sonnet).
- Refactors SapAiCoreHandler to align with native providers.
Caching:
- Introduces Bedrock and Gemini namespaces in sapaicore.ts for caching functions.
- Implements prepareSystemMessages, applyCacheControlToMessages, and formatMessagesForConverseAPI in Bedrock.
- Implements processStreamChunk and prepareRequestPayload in Gemini.
Models:
- Updates sapAiCoreModels in api.ts to support prompt caching for specific models.

^{This description was created by}^{for c323fd4. You can customize this summary. It will automatically update as commits are pushed.}

…entation (and make implicit caching clear)

remove: caching support flag for older claude models

changeset-bot · 2025-08-06T20:48:08Z

🦋 Changeset detected

Latest commit: 5e6de55

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 1 package

Name	Type
claude-dev	Minor

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

package-lock.json

arafatkatze · 2025-08-07T05:12:25Z

@ncryptedV1 Thanks for the PR, for SAP I would need an enterprise account to be able to set it up and use locally. We have two options

If you have properly tested it yourself and can verify that it works perfectly then if you send some screenshots of it working in a debugger etc and then I will approve and merge the PR.
If you would like me to test this locally, you can email me at ara@cline.bot and then we can have a separate conversation about creds setup etc.

lizzzcai · 2025-08-07T08:19:27Z

@tjandy98 can you help to test this out, thanks.

saoudrizwan · 2025-08-09T03:02:12Z

@tjandy98 please let us know when we are good to go to merge this -- we cannot test this ourselves.

tjandy98 · 2025-08-09T05:48:33Z

Hello, I have tested the changes, caching is working as expected for Gemini. SAP AI Core does not return cache usage information(converse stream) for Claude models at this point.

arafatkatze

@tjandy98 Since you have confirmed that this works I am approving and merging this PR.

arafatkatze · 2025-08-09T07:15:04Z

@tjandy98 You also mentioned

SAP AI Core does not return cache usage information(converse stream) for Claude models at this point.

So feel free to make a followup PR if something needs to change there.

ncryptedV1 added 6 commits July 22, 2025 22:35

add: caching support for bedrock (claude)

f663c0f

Merge branch 'main' into sapaicore-caching

ca66fef

refactor: gemini message handling to adhere closer to original implem…

40860f9

…entation (and make implicit caching clear)

remove: unused bedrock conversion functions

9676bee

fix: payload for converse stream (older claude models)

c267b44

remove: caching support flag for older claude models

add: changeset

c323fd4

ncryptedV1 requested review from saoudrizwan, ocasta181, NightTrek, pashpashpash, dcbartlett, saito-sv and Garoth as code owners August 6, 2025 20:48

celestial-vault reviewed Aug 7, 2025

View reviewed changes

package-lock.json Outdated Show resolved Hide resolved

Update package-lock.json

5e6de55

arafatkatze approved these changes Aug 9, 2025

View reviewed changes

arafatkatze merged commit 759ef87 into cline:main Aug 9, 2025
7 of 8 checks passed

GTxx mentioned this pull request Aug 9, 2025

fix: calibrate input token when using anthropic models of sap ai core… #5469

Merged

11 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feat: Prompt Caching in SAP AI Core #5399

Feat: Prompt Caching in SAP AI Core #5399

Uh oh!

ncryptedV1 commented Aug 6, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

changeset-bot bot commented Aug 6, 2025 •

edited

Loading

Uh oh!

Uh oh!

arafatkatze commented Aug 7, 2025

Uh oh!

lizzzcai commented Aug 7, 2025

Uh oh!

saoudrizwan commented Aug 9, 2025

Uh oh!

tjandy98 commented Aug 9, 2025 •

edited

Loading

Uh oh!

arafatkatze left a comment

Uh oh!

Uh oh!

arafatkatze commented Aug 9, 2025

Uh oh!

Uh oh!

Feat: Prompt Caching in SAP AI Core #5399

Feat: Prompt Caching in SAP AI Core #5399

Uh oh!

Conversation

ncryptedV1 commented Aug 6, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Related Issue

Description

Test Procedure

Type of Change

Pre-flight Checklist

Screenshots

Additional Notes

Uh oh!

changeset-bot bot commented Aug 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦋 Changeset detected

Uh oh!

Uh oh!

arafatkatze commented Aug 7, 2025

Uh oh!

lizzzcai commented Aug 7, 2025

Uh oh!

saoudrizwan commented Aug 9, 2025

Uh oh!

tjandy98 commented Aug 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arafatkatze left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

arafatkatze commented Aug 9, 2025

Uh oh!

Uh oh!

ncryptedV1 commented Aug 6, 2025 •

edited by ellipsis-dev bot

Loading

changeset-bot bot commented Aug 6, 2025 •

edited

Loading

tjandy98 commented Aug 9, 2025 •

edited

Loading