Skip to content

v0.4.0

Latest
Compare
Choose a tag to compare
@github-actions github-actions released this 06 Dec 01:35
· 26 commits to main since this release
6a75817

v0.4.0 - 2024-12-06

Changelog

Features 🌈

  • e0f28f0 feat: Handle HF Remote API Call Format (#751)
  • 0f9a11d feat: support vllm in controller (#635)
  • 5269bd7 feat: bump accelerate to 1.0.0 (#739)
  • f5d0958 feat: Update Llama Endpoint (#738)
  • 2cb5710 feat: add tuning test to preset test (#741)
  • f7e6d66 feat: [SKU modularization] AWS chart changes (#710)
  • 391b398 feat: Add flag for running 1ES Public Models (#733)
  • 0aea28e feat: Custom Dockerfile update BaseImage (#724)
  • 0087e09 feat: add preset test for vllm (#694)
  • c25e7e9 feat: RAG service health check (#704)
  • f3ef4c8 feat: RAG engine validation (#691)
  • 1c6eb2e feat: support adaptive max_model_len (#657)
  • cafb947 feat: RAG engine deployment creation (#660)
  • 2ecfdf1 feat: RAG engine controller revision (#682)
  • 79494a2 feat: Dockerfile for Kaito RAG Service (#680)
  • 9f5632a feat: Migrate E2E to Self-Hosted Runner (#641)
  • 1676c0d feat: Runner Setup Script (#676)
  • 71ddc55 feat: Introduce Abstract Class for Integration Testing (#674)
  • ad0dde9 feat: Update VectorStore Base class (#673)
  • 7bea782 feat: run e2e test in parallel (#667)
  • 1709ba0 feat: package vllm runtime into image (#655)
  • 6b216fc feat: Add delete and finalizer to RAGEngine (#646)
  • 1d09da0 feat: implement inference server by using vllm (#624)
  • 8906190 feat: Part 4 (Final) - Introduce Main RAG Service API and its tests (#603)
  • 791c175 feat: add printcolumn to RAG Engine (#623)
  • 544df3f feat: add Nodeclaim & Machine provision to RAG Engine controller (#622)
  • 941170b feat: Part 3 - Introduce Vector Store Manager and Vector Store Class (#633)
  • 65b844a Revert "feat: Migrate E2E pipeline to using Self-Hosted Runner" (#642)
  • b6694c2 feat: Migrate E2E pipeline to using Self-Hosted Runner (#638)
  • 314a80e feat: Revert the refactoring of RAGEngineStatus and WorkspaceStatus (#636)
  • 870a93d feat: Part 2 - Add custom LLM inference class (#630)
  • 1d99028 feat: Part 1 - Add RAG Embedding Interface (#628)
  • 152e683 feat: refact updateStatusConditionIfNotMatch for both RAG and workspace (#626)
  • 920ada5 feat: refactor updateObjStatus for both RAG and workspace (#625)
  • 1818551 feat: Update RAG Status (#621)
  • f613bb4 feat: update of functions related to nodeclaim and machine for RAG engine (#620)
  • 38656dd feat: Clusterrole and Webhook update for RAG Engine (#619)
  • cccb1cb feat: add WorkerNodes to RAGEngineStatus (#612)
  • ba1a62d feat: Add ragengine controller scaffolding code and chart (#600)
  • a06cf97 feat: [SKU modularization] remove sku_config from v1alpha1 and implement skuHandler interface (#602)
  • 2cdc682 feat: Add RAGEngine CRD (#597)
  • f3d6e09 feat: Options for Building and Running Private/Custom Models (#598)

Bug Fixes 🐞

Code Refactoring πŸ’Ž

  • 2d57916 refactor: Move RAG Service under Presets (#715)

Documentation πŸ“˜

  • 692a7da docs: update for multi-runtime support (#754)
  • 711c858 docs: [SKU modularizastion] Add AWS installation documentation (#711)
  • 00056b5 docs: Update installation.md (#736)
  • f889920 docs: Add guide for running Kaito on BYO GPU nodes (#732)
  • 2139dfe docs: Update helm list command in installation guide to use new namespaces. (#730)
  • 64c8ffb docs: update docs with 0.3.2 release (#700)
  • 58894ba docs: fix terraform and update readme (#637)
  • 6481b76 docs: quick deploy using terraform (#634)
  • 6b8bc80 docs: Update README with the new release (#592)

Maintenance πŸ”§

  • 0e80023 chore: switch buildkit image to mcr registry
  • 1e4e699 chore: Mark ragengine as WIP for helm installation (#758)
  • 3bc450b chore: bump actions/dependency-review-action from 4.3.4 to 4.5.0 (#714)
  • 69986b0 chore: bump codecov/codecov-action from 4.6.0 to 5.0.7 (#716)
  • 3a68fe8 chore: bump actions/setup-go from 5.0.2 to 5.1.0 (#687)
  • ff41d1c chore: add zhuangqh to codeowners (#701)
  • fcd5d1c chore: restruct workspace controller code - part 4 (#685)
  • a057d70 chore: restruct workspace controller code - part 3 (#684)
  • 3c873ec chore: restruct workspace controller code - part 2 (#683)
  • e886346 chore: restruct workspace controller code - part 1 (#675)
  • 38fae09 chore: bump step-security/harden-runner from 2.9.1 to 2.10.1 (#596)
  • 79e425c chore: bump github.com/Azure/karpenter-provider-azure from 0.5.1 to 0.5.4 (#599)
  • 1fb9989 chore: bump azure/CLI from 2.0.0 to 2.1.0 (#588)
  • b97ab11 chore: refactor to move ragengine to a central package (#671)
  • 2ca998e chore: removed Microsoft trademark, updated contributing guidelines, CoC in readme (#672)
  • 40d1321 chore: Updated to CNCF CoC, Maintainers file (#670)
  • 1248109 chore: clean up build cmds for workspace (#668)
  • 8f894bb chore: bump codecov/codecov-action from 4.5.0 to 4.6.0 (#613)
  • 00ad1f6 chore: bump azure/login from 2.1.1 to 2.2.0 (#627)
  • bf12222 chore: bump actions/checkout from 4.1.7 to 4.2.2 (#647)
  • 5f2f649 chore: Renaming to reflect updated repo (#663)
  • f35ca31 chore: bump azure/login from 2.1.1 to 2.2.0 (#604)
  • bcc0276 chore: Update Phi README.md (#593)