llama.vim: filter server response fields #24

VJHack · 2025-01-14T17:28:06Z

As of 10940, we are now able to serialize only relavent data.

A suggestion was made here to filter out unnecessary fields in the response. This feature ensures that we are transporting the most minimal response from the server to the client.

There are two places where this change was made:

When we make the curl request in ring_update() to asyncronously process extra_context, we filter out all response fields since we don't take any action on the response.
These are the response fields we want to include when making the main fim call

'response_fields':  [ 
                     "content",
                     "timings/prompt_n", 
                     "timings/prompt_ms", 
                     "timings/prompt_per_token_ms",
                     "timings/prompt_per_second",
                     "timings/predicted_n", 
                     "timings/predicted_ms", 
                     "timings/predicted_per_token_ms",
                     "timings/predicted_per_second",
                     "truncated",
                     "tokens_cached",
                     "generation_settings/n_ctx",
                     ],

autoload/llama.vim

formatting comma Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

ggerganov · 2025-01-14T17:56:42Z

autoload/llama.vim

-
-        let l:n_cached  = get(l:response, 'tokens_cached', 0)
-        let l:truncated = get(l:response, 'truncated', v:false)
+        let l:n_ctx = get(l:response, 'generation_settings/n_ctx', 0)


After recent refactoring, the server no longer provides the n_ctx field in the responses. We should remove it from the llama.vim client and instead of displaying:

c: 1234 / 0 | ...

We should simply display:

c: 1234 | ...

I did notice that. I removed all instances of n_ctx field and adjusted the info message accordingly as shown below.

Thank you!

VJHack added 4 commits January 14, 2025 10:56

filter response fields

aa91f6a

clean up

c57ac19

response fields for ring update

5bce411

formatting change

32169e8

VJHack requested a review from ggerganov January 14, 2025 17:30

ggerganov reviewed Jan 14, 2025

View reviewed changes

autoload/llama.vim Outdated Show resolved Hide resolved

Update autoload/llama.vim

fc6f90d

formatting comma Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

ggerganov reviewed Jan 14, 2025

View reviewed changes

ggerganov approved these changes Jan 14, 2025

View reviewed changes

VJHack added 2 commits January 14, 2025 15:16

removed n_ctx

79ee0d1

merge

4185ae4

ggerganov merged commit 9f5cadd into ggml-org:master Jan 15, 2025

ggerganov added a commit that referenced this pull request Feb 12, 2025

resp : fix context size info (#24)

dd1c20f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llama.vim: filter server response fields #24

llama.vim: filter server response fields #24

Uh oh!

VJHack commented Jan 14, 2025 •

edited

Loading

Uh oh!

Uh oh!

ggerganov Jan 14, 2025

Uh oh!

VJHack Jan 14, 2025

Uh oh!

Uh oh!

llama.vim: filter server response fields #24

llama.vim: filter server response fields #24

Uh oh!

Conversation

VJHack commented Jan 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ggerganov Jan 14, 2025

Choose a reason for hiding this comment

Uh oh!

VJHack Jan 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

VJHack commented Jan 14, 2025 •

edited

Loading