feat: [SKU modularization] remove sku_config from v1alpha1 and implement skuHandler interface #601

smritidahal653 · 2024-09-19T20:56:43Z

Reason for Change:

sku_config is Azure cloud specific, this is replaced by skuHandler interface as a part of the effort to modularize gpu skus. Deleting it from the v1alpha1 package
Implementing skuHandler to get gpu configs in place where SupportedGPUConfigs was used from sku_config file
Fixing test cases to use updated skus after skuHandler implementation

Requirements

added unit tests and e2e tests (if applicable).

Notes for Reviewers:

This PR adds the initial draft for the RAGEngine CRD in Kaito. A RAGEngine CRD defines all resources needed to run a RAG on top of a LLM inference service. Upon creating a RAGEngine CR, a new controller will create a deployment which runs a RAG engine instance. The instance provides http endpoints for both `index` and `query` services. The instance can optionally choose a public model embedding service or run a local embedding model with GPU to convert the input index data to vectors. The instance can also connect to a Vector DB instance to persist the vectors db or by default using an in-memory vector DB. The instance uses the `llamaIndex` library to orchestrate the workflow. When RAGEngine instance is up and running, users should send questions to the `query` endpoint of RAG instance instead of the normal `chat` endpoint in the inference service. The RAGEngine is intended to be "standalone". It can use any public inference service or inference services hosted by Kaito workspace. The RAG engine instance is designed to help retrieve prompts from unstructured data (arbitrary index data provided by the users). Retrieving from structured data or search engine is out of the scope for now.

smritidahal653 and others added 7 commits September 19, 2024 13:34

delete sku_config

1a0269e

implement skuHandler interface

39cd273

delete sku_config

2629234

delete sku_config

b75126a

smritidahal653 requested a deployment to unit-tests September 19, 2024 21:06 — with GitHub Actions Waiting

smritidahal653 requested a deployment to e2e-test September 19, 2024 21:06 — with GitHub Actions Waiting

smritidahal653 closed this Sep 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: [SKU modularization] remove sku_config from v1alpha1 and implement skuHandler interface #601

feat: [SKU modularization] remove sku_config from v1alpha1 and implement skuHandler interface #601

smritidahal653 commented Sep 19, 2024

feat: [SKU modularization] remove sku_config from v1alpha1 and implement skuHandler interface #601

feat: [SKU modularization] remove sku_config from v1alpha1 and implement skuHandler interface #601

Conversation

smritidahal653 commented Sep 19, 2024