JobSample

A sample configuration for profile jobs only, which determines the number of rows on which the profile job is run. If a JobSample value isn't provided, the default is used. The default value is CUSTOM_ROWS for the mode parameter and 20,000 for the size parameter.

Data AnalyticsData PreparationETLMachine Learning

Properties

Name Type Description
Mode object
Size object
View JSON Schema on GitHub

JSON Schema

glue-databrew-job-sample-schema.json Raw ↑
{
  "$schema": "https://json-schema.org/draft/2020-12/schema",
  "$id": "https://raw.githubusercontent.com/api-evangelist/amazon-glue-databrew/refs/heads/main/json-schema/glue-databrew-job-sample-schema.json",
  "title": "JobSample",
  "description": "A sample configuration for profile jobs only, which determines the number of rows on which the profile job is run. If a <code>JobSample</code> value isn't provided, the default is used. The default value is CUSTOM_ROWS for the mode parameter and 20,000 for the size parameter.",
  "type": "object",
  "properties": {
    "Mode": {
      "allOf": [
        {
          "$ref": "#/components/schemas/SampleMode"
        },
        {
          "description": "<p>A value that determines whether the profile job is run on the entire dataset or a specified number of rows. This value must be one of the following:</p> <ul> <li> <p>FULL_DATASET - The profile job is run on the entire dataset.</p> </li> <li> <p>CUSTOM_ROWS - The profile job is run on the number of rows specified in the <code>Size</code> parameter.</p> </li> </ul>"
        }
      ]
    },
    "Size": {
      "allOf": [
        {
          "$ref": "#/components/schemas/JobSize"
        },
        {
          "description": "<p>The <code>Size</code> parameter is only required when the mode is CUSTOM_ROWS. The profile job is run on the specified number of rows. The maximum value for size is Long.MAX_VALUE.</p> <p>Long.MAX_VALUE = 9223372036854775807</p>"
        }
      ]
    }
  }
}