ubicloud

Files

Junhao Li 1fb9add186 Create control plane for inference router

Creates the Clover control plane that manages the lifecycle
of inference router replicas, which will handle inference requests
across all models and route them to an appropriate inference
endpoint based on priority, capacity, and cache characteristics.

2025-04-24 11:51:46 -04:00

ai_models.yml

Update ai_models.yml for vLLM 0.8

2025-03-20 11:28:39 +01:00

billing_rates.yml

Adjust Billing Rates for k8s

2025-04-03 12:50:38 +03:00

default_quotas.yml

Use vCPU in quotas instead of cores

2025-01-03 13:26:43 +01:00

e2e_test_cases.yml

Update postgres image version for E2E