Apache Nutch · Schema

ChildNode

A child (outlink) of a fetched node.

Web CrawlerIndexingSearchApacheJavaHadoopOpen Source

Properties

Name Type Description
childUrl string The URL of the child node.
anchorText string The anchor text of the link.
View JSON Schema on GitHub

JSON Schema

apache-nutch-child-node-schema.json Raw ↑
{
  "$schema": "https://json-schema.org/draft/2020-12/schema",
  "$id": "https://raw.githubusercontent.com/api-evangelist/apache-nutch/refs/heads/main/json-schema/apache-nutch-child-node-schema.json",
  "title": "ChildNode",
  "description": "A child (outlink) of a fetched node.",
  "type": "object",
  "properties": {
    "childUrl": {
      "type": "string",
      "description": "The URL of the child node."
    },
    "anchorText": {
      "type": "string",
      "description": "The anchor text of the link."
    }
  }
}