Scalable Inference Serving · Schema
RequestOutput
Specifies which output tensor to include in the response.
AICNCFDeploymentInferenceKubernetesLLMMachine LearningModel ServingMLOpsScalability
Specifies which output tensor to include in the response.