Apache Nutch · Schema

FetchNodeDbInfo

Information about a fetched node in the FetchDB.

Web CrawlerIndexingSearchApacheJavaHadoopOpen Source

Properties

Name Type Description
url string The URL of the fetched node.
status integer The HTTP status code of the fetch.
numOfOutlinks integer The number of outgoing links discovered.
children array The outgoing links from this node.
View JSON Schema on GitHub