Weaviate · Schema

TextAnalyzerConfig

Text analysis options for a property. These settings are immutable after the property is created. Applies only to text and text[] data types that use an inverted index (searchable or filterable).

Vector DatabaseAIMachine LearningSemantic SearchOpen SourceGraphQLKubernetes

Properties

Name Type Description
asciiFold boolean If true, accent/diacritic marks are folded to their base characters during indexing and search. For example, 'école' matches 'ecole'. Defaults to false.
asciiFoldIgnore array If provided, specifies a list of characters that should be excluded from ascii folding. For example, if ['é'] is provided, then 'é' will not be folded to 'e' during indexing and search. This list is i
stopwordPreset string Stopword preset name. Overrides the collection-level invertedIndexConfig.stopwords for this property. Only applies to properties using 'word' tokenization. Can be a built-in preset ('en', 'none') or a
View JSON Schema on GitHub

JSON Schema

weaviate-textanalyzerconfig-schema.json Raw ↑
{
  "$schema": "https://json-schema.org/draft/2020-12/schema",
  "$id": "#/components/schemas/TextAnalyzerConfig",
  "title": "TextAnalyzerConfig",
  "type": "object",
  "description": "Text analysis options for a property. These settings are immutable after the property is created. Applies only to text and text[] data types that use an inverted index (searchable or filterable).",
  "properties": {
    "asciiFold": {
      "type": "boolean",
      "description": "If true, accent/diacritic marks are folded to their base characters during indexing and search. For example, '\u00e9cole' matches 'ecole'. Defaults to false."
    },
    "asciiFoldIgnore": {
      "type": "array",
      "description": "If provided, specifies a list of characters that should be excluded from ascii folding. For example, if ['\u00e9'] is provided, then '\u00e9' will not be folded to 'e' during indexing and search. This list is immutable after the property is created.",
      "items": {
        "type": "string"
      }
    },
    "stopwordPreset": {
      "type": "string",
      "description": "Stopword preset name. Overrides the collection-level invertedIndexConfig.stopwords for this property. Only applies to properties using 'word' tokenization. Can be a built-in preset ('en', 'none') or a user-defined preset from invertedIndexConfig.stopwordPresets."
    }
  }
}