Apache Nutch · Schema

NodeSchema

Schema describing the fields in a node reader response.

Web CrawlerIndexingSearchApacheJavaHadoopOpen Source

Properties

Name Type Description
key_url string
num_inlinks string
num_outlinks string
inlink_score string
outlink_score string
metadata string
View JSON Schema on GitHub

JSON Schema

apache-nutch-node-schema-schema.json Raw ↑
{
  "$schema": "https://json-schema.org/draft/2020-12/schema",
  "$id": "https://raw.githubusercontent.com/api-evangelist/apache-nutch/refs/heads/main/json-schema/apache-nutch-node-schema-schema.json",
  "title": "NodeSchema",
  "description": "Schema describing the fields in a node reader response.",
  "type": "object",
  "properties": {
    "key_url": {
      "type": "string",
      "example": "string"
    },
    "num_inlinks": {
      "type": "string",
      "example": "int"
    },
    "num_outlinks": {
      "type": "string",
      "example": "int"
    },
    "inlink_score": {
      "type": "string",
      "example": "float"
    },
    "outlink_score": {
      "type": "string",
      "example": "float"
    },
    "metadata": {
      "type": "string",
      "example": "string"
    }
  }
}