Skip to content

Latest commit

 

History

History
38 lines (26 loc) · 1.33 KB

File metadata and controls

38 lines (26 loc) · 1.33 KB

GetDeploymentResponse

AI deployment

Properties

Name Type Description Notes
gpuCount Long Number of GPUs [optional]
updatedAt OffsetDateTime Update time [optional] [readonly]
deploymentUrl String Deployment URL (nullable) [optional]
serviceLevel String Service level [optional]
inferenceEngineVersion InferenceEngineVersion [optional]
name String Deployment name [optional]
state StateEnum Deployment state [optional]
gpuType String GPU type family [optional]
id UUID Deployment ID [optional] [readonly]
replicas Long Number of replicas (>=0) [optional]
stateDetails String Deployment state details [optional]
createdAt OffsetDateTime Creation time [optional] [readonly]
inferenceEngineParameters List<String> Optional extra inference engine server CLI args [optional]
model ModelRef [optional]

Enum: StateEnum

Name Value
READY "ready"
CREATING "creating"
ERROR "error"
DEPLOYING "deploying"