ubicloud

Files

Benjamin Satzger 09cec4e2bc Set max_concurrency of inference gateway

In a previous commit we added max_concurrency to inference endpoints. In
this commit we set inference gateway's corresponding setting
IG_MAX_REQUESTS to that value during replica setup.

2025-02-11 17:47:12 +01:00

download-lb-cert

Rhizome for inference endpoints

2024-10-01 15:36:57 +02:00

setup-replica

Set max_concurrency of inference gateway

2025-02-11 17:47:12 +01:00