Apache Nutch · Schema

JobConfig

Configuration for creating a new crawl job.

Web CrawlerIndexingSearchApacheJavaHadoopOpen Source

Properties

Name Type Description
crawlId string The crawl identifier.
type string The type of Nutch crawl job.
confId string The configuration ID to use for this job. Defaults to "default" if not specified.
jobClassName string Fully qualified class name when type is CLASS.
args object Additional arguments for the job.
View JSON Schema on GitHub