Apache Nutch · Schema

SeedList

A named list of seed URLs.

Web CrawlerIndexingSearchApacheJavaHadoopOpen Source

Properties

Name Type Description
id integer The seed list identifier.
name string A human-readable name for this seed list.
seedFilePath string The HDFS path where the seed file is stored. Populated after creation.
seedUrls array The collection of seed URLs in this list.
View JSON Schema on GitHub