This commit introduces a new `InferenceTokens` billing resource type. Billing rates are added that define how much a user is billed for each inference token used. Billing depends on the amount of tokens and the type of model used for inference.
1.7 KiB
1.7 KiB