Letta · Schema

SGLangModelSettings

SGLang model configuration (OpenAI-compatible runtime with SGLang-specific parsing).

AIAgentsStateful AgentsMemoryMemGPTContinual LearningMCPMulti-AgentRAGOpen Source

Properties

Name Type Description
max_output_tokens integer The maximum number of tokens the model can generate.
parallel_tool_calls boolean Whether to enable parallel tool calling.
provider_type string The type of the provider.
temperature number The temperature of the model.
reasoning object The reasoning configuration for the model.
response_format object The response format for the model.
strict boolean Enable strict mode for tool calling. When true, tool outputs are guaranteed to match JSON schemas.
tool_call_parser object SGLang tool call parser name (for example 'glm47', 'qwen25', or 'hermes').
View JSON Schema on GitHub

JSON Schema

letta-sglangmodelsettings-schema.json Raw ↑
{
  "$schema": "https://json-schema.org/draft/2020-12/schema",
  "$id": "#/components/schemas/SGLangModelSettings",
  "title": "SGLangModelSettings",
  "properties": {
    "max_output_tokens": {
      "type": "integer",
      "title": "Max Output Tokens",
      "description": "The maximum number of tokens the model can generate.",
      "default": 4096
    },
    "parallel_tool_calls": {
      "type": "boolean",
      "title": "Parallel Tool Calls",
      "description": "Whether to enable parallel tool calling.",
      "default": true
    },
    "provider_type": {
      "type": "string",
      "const": "sglang",
      "title": "Provider Type",
      "description": "The type of the provider.",
      "default": "sglang"
    },
    "temperature": {
      "type": "number",
      "title": "Temperature",
      "description": "The temperature of the model.",
      "default": 0.7
    },
    "reasoning": {
      "$ref": "#/components/schemas/OpenAIReasoning",
      "description": "The reasoning configuration for the model.",
      "default": {
        "reasoning_effort": "high"
      }
    },
    "response_format": {
      "anyOf": [
        {
          "oneOf": [
            {
              "$ref": "#/components/schemas/TextResponseFormat"
            },
            {
              "$ref": "#/components/schemas/JsonSchemaResponseFormat"
            },
            {
              "$ref": "#/components/schemas/JsonObjectResponseFormat"
            }
          ],
          "discriminator": {
            "propertyName": "type",
            "mapping": {
              "json_object": "#/components/schemas/JsonObjectResponseFormat",
              "json_schema": "#/components/schemas/JsonSchemaResponseFormat",
              "text": "#/components/schemas/TextResponseFormat"
            }
          }
        },
        {
          "type": "null"
        }
      ],
      "title": "Response Format",
      "description": "The response format for the model."
    },
    "strict": {
      "type": "boolean",
      "title": "Strict",
      "description": "Enable strict mode for tool calling. When true, tool outputs are guaranteed to match JSON schemas.",
      "default": true
    },
    "tool_call_parser": {
      "anyOf": [
        {
          "type": "string"
        },
        {
          "type": "null"
        }
      ],
      "title": "Tool Call Parser",
      "description": "SGLang tool call parser name (for example 'glm47', 'qwen25', or 'hermes')."
    }
  },
  "type": "object",
  "description": "SGLang model configuration (OpenAI-compatible runtime with SGLang-specific parsing)."
}