v0.6.0 - 2025-08-08
This release includes these major changes:
- Added support for DeepSeek-R1/DeepSeek-V3 models.
- Added /v1/chat/completions API for RAGEngine.
- Provided better UX for preferred nodes and cpu nodes.
- Updated documentation with new features, integrations.
- Added NVIDIA A10 GPU to the supported SKUs.
Changelog
Features 🌈
- 253d6aa feat: add /v1/chat/completions API for RAGEngine (#1277)
- dc46418 feat: add deepseek-r1/deepseek-v3 model (#1251)
- bfd271a feat: adding RAGEngine CRD shortName and ServiceReady status column (#1336)
- 45cbbf1 feat: support Preferred node in RAG (#1327)
- ee0c3d0 feat: add
make help
target to Makefile (#1248)
Bug Fixes 🐞
- e39f82f fix: pin phi2 to vllm v0 (#1369)
- 8f4fa75 fix: fix bug where fetch GPU count was failing and defaulting (#1338)
- 06f4cbd fix: resolve pydantic deprecation warnings (#1317)
- 78ef22d fix: image link error in scaler proposals (#1318)
- 2163923 fix: get gpu config from status if preferred nodes provided (#1308)
- 76e099e fix: avoid extra node creation on informercache delay (#1311)
Code Refactoring 💎
- d2ac059 refactor: adopt generator pattern in fine-tuning part (#1292)
- aea0a42 refactor: introduce manifest generator -- part 1 (#1284)
Continuous Integration 💜
- 461f75c ci: update release branch prefix to 'release-' (#1371)
- a42040d ci: Expand trivy scanning to other images (#1161)
Documentation 📘
- 5fe9a44 docs: add release management (#1360)
- ca4b6e2 docs: add chat/completions rag docs and split install/api docs (#1334)
- d50af9d docs: Document Model-As-OCI-Artifacts feature (#1359)
- 3ea17ff docs: fix ConfigMap creation sequence in example docs (#1348)
- fa41991 docs: add aikit to integrations (#1303)
- 1112673 docs: Kaito kubectl cli proposal (#1230)
- 5e750b8 docs: Update documentation to use chat completions API instead of deprecated completions API (#1340)
- 4a75426 docs: verify docs site with Algolia (#1328)
- 336d7be docs: add Headlamp-KAITO to documentation (#1314)
- 03d8665 docs: add search to website using Algolia (#1302)
- ff93426 docs: update installation docs to support different cloud providers (#1247)
- fca7291 docs: publish v0.5.1 docs (#1289)
Maintenance 🔧
- 4ad29f8 chore: bump base image to 0.0.5 (#1364)
- 91b38b1 chore: bump actions/setup-go from 5.4.0 to 5.5.0 (#1345)
- 5acd3c7 chore: rename Go files to use underscore file naming convention (#1361)
- 0389855 chore: fix references to yaml files and rename bugs (#1347)
- 152c0e8 chore: rename .yml to .yaml extension in GH actions for consistency (#1339)
- f00cad7 chore: bump actions/cache from 4.2.2 to 4.2.3 (#1344)
- db679e8 chore: Revert the model used in the rag e2e test back to Phi-3 (#1342)
- ea6216f chore: update node sku for e2e tests (#1341)
- af43eb5 chore: remove unnecessary sleep in test (#1332)
- fc4a847 chore: bump actions/setup-node from 4.3.0 to 4.4.0 (#1324)
- 8c7ac49 chore: reduce verbose logs and unnecessary reconcile (#1312)
- 725da54 chore: bump starlette from 0.40.0 to 0.47.2 in /presets/workspace/dependencies (#1290)
- 3f85a58 chore: bump step-security/harden-runner from 2.12.2 to 2.13.0 (#1287)