Service Certificate – STACKIT Model Serving

Service Name

STACKIT Model Serving

High level service description

STACKIT Model Serving (“Model Serving”) provides open-source Large-Language-Models (“LLM”) and other GenAI-Models as shared instances. Customers can use shared instances via an OpenAI-compatible REST API. Chat and embedding models are provided. An API key is used for authentication. When using the Model Serving Service, STACKIT does not collect or evaluate any customer data other than billing-relevant data.

Key Features

Service Plans

Each model provided is assigned to a service plan. The service plans are assigned to the categories Base, Plus or Premium according to ascending model size. The assignment is described in the STACKIT portal and in the STACKIT documentation.

Metric

Billing for Model Serving is token-based based on the type of model:

SLA Specifics

In deviation from the availability specifications in the general STACKIT Service Description, an availability of 99.5% per calendar month is agreed (measured by the external availability of the LLM API).

Backup

Customer requests are not backed up.

Additional Terms

Version and start of validity

Version 1.0, valid from 04.02.2025