Reducto · Schema
ExtractSettings
Reducto ExtractSettings schema
Document ParsingPDFOCRData ExtractionAIMachine LearningDocument IntelligenceStructured Data
Properties
| Name | Type | Description |
|---|---|---|
| include_images | boolean | If True, include images in the extraction. |
| optimize_for_latency | boolean | If True, jobs will be processed with a higher throughput and priority at a higher cost. Defaults to False. |
| array_extract | boolean | If True, use array extraction. |
| deep_extract | boolean | If True, use Deep Extract, an agentic extraction mode that iteratively refines its output to achieve near-perfect accuracy. Best for complex documents where accuracy is critical. |
| citations | object | The citations to use for the extraction. |
JSON Schema
{
"$schema": "https://json-schema.org/draft/2020-12/schema",
"$id": "https://raw.githubusercontent.com/api-evangelist/reducto/refs/heads/main/json-schema/reducto-extractsettings.json",
"title": "ExtractSettings",
"description": "Reducto ExtractSettings schema",
"properties": {
"include_images": {
"type": "boolean",
"title": "Include Images",
"description": "If True, include images in the extraction.",
"default": false
},
"optimize_for_latency": {
"type": "boolean",
"title": "Optimize For Latency",
"description": "If True, jobs will be processed with a higher throughput and priority at a higher cost. Defaults to False.",
"default": false
},
"array_extract": {
"type": "boolean",
"title": "Array Extract",
"description": "If True, use array extraction.",
"default": false
},
"deep_extract": {
"type": "boolean",
"title": "Deep Extract",
"description": "If True, use Deep Extract, an agentic extraction mode that iteratively refines its output to achieve near-perfect accuracy. Best for complex documents where accuracy is critical.",
"default": false
},
"citations": {
"$ref": "#/components/schemas/Citations",
"description": "The citations to use for the extraction.",
"default": {
"enabled": false,
"numerical_confidence": true
}
}
},
"type": "object"
}