runner: clear cache when shift is not possible #9433

BruceMacD · 2025-03-01T00:27:31Z

Clear KV cache when shift operation is not supported by model. Added KvCacheCanShift() check to handle models that can't perform cache shifts, falling back to full cache clear while preserving logical token history to maintain expected behavior when context window fills up.

Fixes: #5975
Fixes: #8074
Fixes: #8571
Fixes: #8599
Fixes: #8602
Fixes: #8614
Fixes: #8924
Fixes: #9010
Fixes: #9047
Fixes: #9064
Fixes: #9105
Fixes: #9171
Fixes: #9248
Fixes: #9410

runner/llamarunner/cache.go

runner/ollamarunner/cache.go

runner/llamarunner/cache.go

runner/ollamarunner/cache.go

runner/ollamarunner/runner.go

runner/ollamarunner/cache.go

runner/llamarunner/cache.go

runner/ollamarunner/cache.go

runner/ollamarunner/runner.go

runner/ollamarunner/cache.go

Clear KV cache when shift operation is not supported by model. Added KvCacheCanShift() check to handle models that can't perform cache shifts, falling back to full cache clear while preserving logical token history to maintain expected behavior when context window fills up.

BruceMacD requested a review from jessegross March 1, 2025 00:27

BruceMacD mentioned this pull request Mar 1, 2025

runner: reduce deepseek failures by allowing dynamic num_predict behaviour. #9393

Closed

rick-github reviewed Mar 1, 2025

View reviewed changes

runner/llamarunner/cache.go Show resolved Hide resolved

jmorganca reviewed Mar 1, 2025

View reviewed changes

runner/llamarunner/cache.go Outdated Show resolved Hide resolved

jessegross reviewed Mar 3, 2025

View reviewed changes

runner/llamarunner/cache.go Outdated Show resolved Hide resolved

runner/llamarunner/cache.go Outdated Show resolved Hide resolved

runner/llamarunner/cache.go Outdated Show resolved Hide resolved

BruceMacD marked this pull request as draft March 3, 2025 23:46

BruceMacD commented Mar 4, 2025

View reviewed changes

runner/llamarunner/cache.go Outdated Show resolved Hide resolved

runner/llamarunner/cache.go Show resolved Hide resolved

runner/ollamarunner/cache.go Outdated Show resolved Hide resolved

BruceMacD force-pushed the brucemacd/ctx-shift-err branch from 40faf3c to 68776d9 Compare March 4, 2025 00:10

BruceMacD marked this pull request as ready for review March 4, 2025 00:10

BruceMacD requested a review from jessegross March 4, 2025 00:11

BruceMacD changed the title ~~llamarunner: clear cache when shift is not possible~~ runner: clear cache when shift is not possible Mar 4, 2025

jessegross reviewed Mar 4, 2025

View reviewed changes

BruceMacD force-pushed the brucemacd/ctx-shift-err branch from 682ea85 to 9c23f11 Compare March 11, 2025 04:24

BruceMacD requested a review from jessegross March 11, 2025 04:26

jessegross reviewed Mar 14, 2025

View reviewed changes

runner/ollamarunner/runner.go Outdated Show resolved Hide resolved

runner/ollamarunner/cache.go Outdated Show resolved Hide resolved

runner/ollamarunner/cache.go Outdated Show resolved Hide resolved

BruceMacD added 2 commits March 28, 2025 16:23

PR feedback

8ac3b75

BruceMacD force-pushed the brucemacd/ctx-shift-err branch from 9c23f11 to 8ac3b75 Compare March 28, 2025 23:53

BruceMacD requested a review from jessegross March 28, 2025 23:56

jessegross approved these changes Mar 31, 2025

View reviewed changes

BruceMacD merged commit 66b2539 into main Mar 31, 2025
8 checks passed

BruceMacD deleted the brucemacd/ctx-shift-err branch March 31, 2025 19:54

rick-github mentioned this pull request Apr 13, 2025

Deepseek (various) 236b crashes on run #7867

Closed

rick-github mentioned this pull request Aug 22, 2025

Deepseek2 with large context crashes with "Deepseek2 does not support K-shift" #5975

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

runner: clear cache when shift is not possible #9433

runner: clear cache when shift is not possible #9433

Uh oh!

BruceMacD commented Mar 1, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

runner: clear cache when shift is not possible #9433

runner: clear cache when shift is not possible #9433

Uh oh!

Conversation

BruceMacD commented Mar 1, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!