Fixes

License switch

The license was switched from MIT to Apache-2.0, which is more permissive (has explicit patent grants), so it should be easier to adopt in organizations.

Features

OpenAI compatibility endpoint:
- Support for max_completion_tokens parameter in /v1/chat/completions endpoint
- Support for messages parameter in /v1/chat/completions endpoint
- Support for stream parameter in /v1/chat/completions endpoint

Documentation: https://paddler.intentee.com/docs/migrating-to-paddler/openai-compatibility/

Full Changelog: v2.0.0...v2.1.0

What's Changed

Long story short, we rewrote most of the llama-server, made it scalable, and bundled that with Paddler. This means you do not have to deploy llama-server alongside Paddler anymore (the fewer moving parts, the better). :)

We also have a new, vastly improved admin panel, and a lot of other fixes, improvements, and changes.

Full Changelog: v1.2.1-rc1...v2.0.0

What's Changed

Long story short, we rewrote most of the llama-server, made it scalable, and bundled that with Paddler. This means you do not have to deploy llama-server alongside Paddler anymore (the fewer moving parts, the better). :)

We also have a new, vastly improved admin panel, and a lot of other fixes, improvements, and changes.

Full Changelog: v1.2.1-rc1...v2.0.0-rc1

Fixes

Fix overflow/underflow issues when managing slots

@Propfend

Features

Add TUI dashboard (paddler dashboard --management-addr [HOST]:[PORT]) to be able to easily observe balancer instances from the terminal level (thank you @Propfend for the contribution!)

@zamazan4ik

More meaningful error messages when the agent can't connect to the llama.cpp slot endpoint, or when slot endpoint is not enabled in llama.cpp
Set default logging level to info for agents and balancer to increase the amount of information in the logs (it wasn't clean if the agent was running or not)
Enable LTO optimization for the release builds (see #28) (thank you @zamazan4ik)

Paddler is now rewritten in Rust and uses the Pingora framework for the networking stack. A few minor API changes and reporting improvements are introduced (documented in the README).

This is a stability/quality release that solves some memory related issues, and makes the balancer more resilient to agents randomly going up/down.

Features

Add --local-llamacpp-api-key flag to balancer to support llama.cpp API keys (see: #23)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

License switch

Features

Uh oh!

What's Changed

Uh oh!

What's Changed

Uh oh!

Fixes

Uh oh!

Features

Contributors

Uh oh!

Contributors

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Features

Uh oh!

Releases: intentee/paddler

v2.1.0

License switch

Features

Uh oh!

v2.0.0

What's Changed

Uh oh!

v2.0.0-rc1

What's Changed

Uh oh!

v1.2.1-rc1

Fixes

Uh oh!

v1.2.0

Features

Contributors

Uh oh!

v1.1.0

Contributors

Uh oh!

v1.0.0

Uh oh!

v1.0.0-rc1

Uh oh!

v0.10.0

Uh oh!

v0.9.0

Features

Uh oh!