Skip to content

Releases: gpustack/llama-box

v0.0.171

16 Aug 03:52
Compare
Choose a tag to compare
refactor: adjust context size

Signed-off-by: thxCode <thxcode0824@gmail.com>

v0.0.170

13 Aug 03:34
Compare
Choose a tag to compare
  1. Rebase upstream.
  2. Fix tool calling with --jinja in Qwen3 Coder.

v0.0.169

07 Aug 10:54
Compare
Choose a tag to compare
  1. Rebase upstream.
  2. Fix MUSA release suffix.

v0.0.168

04 Aug 04:54
Compare
Choose a tag to compare
  1. Rebase upstream.

v0.0.167

28 Jul 02:23
Compare
Choose a tag to compare
  1. Rebase upstream;
  2. Bump MUSA to rc4.2.0, cc @yeahdongcn .

v0.0.166

21 Jul 07:56
Compare
Choose a tag to compare
  1. Rebase upstream.

v0.0.165

17 Jul 11:03
Compare
Choose a tag to compare
  1. Rebase upstream;
  2. Support Kimi-K2.

v0.0.164

11 Jul 15:50
Compare
Choose a tag to compare
  1. Rebase upstream;
  2. Fix zero offloading VRAM occupied in DL packages.

v0.0.163

10 Jul 01:34
Compare
Choose a tag to compare
  1. Fix failed while chatting service deployed inside a Docker container.

v0.0.162

08 Jul 19:59
Compare
Choose a tag to compare
  1. Simplify samplers;
  2. Refactor managing embeddings KV cache;
  3. Fix invalid capacity in Darwin RPC server.