Releases: gpustack/llama-box
Releases · gpustack/llama-box
v0.0.171
v0.0.170
- Rebase upstream.
- Fix tool calling with
--jinja
in Qwen3 Coder.
v0.0.169
- Rebase upstream.
- Fix MUSA release suffix.
v0.0.168
- Rebase upstream.
v0.0.167
- Rebase upstream;
- Bump MUSA to rc4.2.0, cc @yeahdongcn .
v0.0.166
- Rebase upstream.
v0.0.165
- Rebase upstream;
- Support Kimi-K2.
v0.0.164
- Rebase upstream;
- Fix zero offloading VRAM occupied in DL packages.
v0.0.163
- Fix failed while chatting service deployed inside a Docker container.
v0.0.162
- Simplify samplers;
- Refactor managing embeddings KV cache;
- Fix invalid capacity in Darwin RPC server.