Skip to content

v0.6.0

Latest
Compare
Choose a tag to compare
@github-actions github-actions released this 08 Aug 07:51
· 29 commits to main since this release
461f75c

v0.6.0 - 2025-08-08

This release includes these major changes:

  • Added support for DeepSeek-R1/DeepSeek-V3 models.
  • Added /v1/chat/completions API for RAGEngine.
  • Provided better UX for preferred nodes and cpu nodes.
  • Updated documentation with new features, integrations.
  • Added NVIDIA A10 GPU to the supported SKUs.

Changelog

Features 🌈

  • 253d6aa feat: add /v1/chat/completions API for RAGEngine (#1277)
  • dc46418 feat: add deepseek-r1/deepseek-v3 model (#1251)
  • bfd271a feat: adding RAGEngine CRD shortName and ServiceReady status column (#1336)
  • 45cbbf1 feat: support Preferred node in RAG (#1327)
  • ee0c3d0 feat: add make help target to Makefile (#1248)

Bug Fixes 🐞

  • e39f82f fix: pin phi2 to vllm v0 (#1369)
  • 8f4fa75 fix: fix bug where fetch GPU count was failing and defaulting (#1338)
  • 06f4cbd fix: resolve pydantic deprecation warnings (#1317)
  • 78ef22d fix: image link error in scaler proposals (#1318)
  • 2163923 fix: get gpu config from status if preferred nodes provided (#1308)
  • 76e099e fix: avoid extra node creation on informercache delay (#1311)

Code Refactoring 💎

  • d2ac059 refactor: adopt generator pattern in fine-tuning part (#1292)
  • aea0a42 refactor: introduce manifest generator -- part 1 (#1284)

Continuous Integration 💜

  • 461f75c ci: update release branch prefix to 'release-' (#1371)
  • a42040d ci: Expand trivy scanning to other images (#1161)

Documentation 📘

  • 5fe9a44 docs: add release management (#1360)
  • ca4b6e2 docs: add chat/completions rag docs and split install/api docs (#1334)
  • d50af9d docs: Document Model-As-OCI-Artifacts feature (#1359)
  • 3ea17ff docs: fix ConfigMap creation sequence in example docs (#1348)
  • fa41991 docs: add aikit to integrations (#1303)
  • 1112673 docs: Kaito kubectl cli proposal (#1230)
  • 5e750b8 docs: Update documentation to use chat completions API instead of deprecated completions API (#1340)
  • 4a75426 docs: verify docs site with Algolia (#1328)
  • 336d7be docs: add Headlamp-KAITO to documentation (#1314)
  • 03d8665 docs: add search to website using Algolia (#1302)
  • ff93426 docs: update installation docs to support different cloud providers (#1247)
  • fca7291 docs: publish v0.5.1 docs (#1289)

Maintenance 🔧

  • 4ad29f8 chore: bump base image to 0.0.5 (#1364)
  • 91b38b1 chore: bump actions/setup-go from 5.4.0 to 5.5.0 (#1345)
  • 5acd3c7 chore: rename Go files to use underscore file naming convention (#1361)
  • 0389855 chore: fix references to yaml files and rename bugs (#1347)
  • 152c0e8 chore: rename .yml to .yaml extension in GH actions for consistency (#1339)
  • f00cad7 chore: bump actions/cache from 4.2.2 to 4.2.3 (#1344)
  • db679e8 chore: Revert the model used in the rag e2e test back to Phi-3 (#1342)
  • ea6216f chore: update node sku for e2e tests (#1341)
  • af43eb5 chore: remove unnecessary sleep in test (#1332)
  • fc4a847 chore: bump actions/setup-node from 4.3.0 to 4.4.0 (#1324)
  • 8c7ac49 chore: reduce verbose logs and unnecessary reconcile (#1312)
  • 725da54 chore: bump starlette from 0.40.0 to 0.47.2 in /presets/workspace/dependencies (#1290)
  • 3f85a58 chore: bump step-security/harden-runner from 2.12.2 to 2.13.0 (#1287)