create-aws-datalake

subtitle

Creates an AWS datalake.

version

0.9.128

Description

Creates an AWS datalake.

Synopsis

  create-aws-datalake
--datalake-name <value>
--environment-name <value>
--cloud-provider-configuration <value>
[--scale <value>]
[--tags <value>]
[--runtime <value>]
[--image <value>]
[--enable-ranger-raz | --no-enable-ranger-raz]
[--enable-ranger-rms | --no-enable-ranger-rms]
[--multi-az | --no-multi-az]
[--recipes <value>]
[--custom-instance-groups <value>]
[--java-version <value>]
[--cli-input-json <value>]
[--generate-cli-skeleton]

Options

--datalake-name (string)

The datalake name. This name must be unique, must have between 5 and 40 characters, and must contain only lowercase letters, numbers and hyphens. Names are case-sensitive.

--environment-name (string)

The environment name or CRN.

--cloud-provider-configuration (object)

Request object for AWS configuration.

instanceProfile -> (string)

The ARN of an IAM instance profile.

storageBucketLocation -> (string)

The location of the S3 bucket to be used as storage. The location has to start with s3a:// followed by the bucket name.

Shorthand Syntax:

instanceProfile=string,storageBucketLocation=string

JSON Syntax:

{
  "instanceProfile": "string",
  "storageBucketLocation": "string"
}

--scale (string)

Represents the available datalake scales. Defaults to LIGHT_DUTY if not set.

Possible values:

  • LIGHT_DUTY

  • MEDIUM_DUTY_HA

  • ENTERPRISE

--tags (array)

Tags to be added to Data Lake related resources.

Shorthand Syntax:

key=string,value=string ... (separate items with spaces)

JSON Syntax:

[
  {
    "key": "string",
    "value": "string"
  }
  ...
]

--runtime (string)

Cloudera Runtime version.

--image (object)

The image request for the datalake. When the ‘runtime’ parameter is set, only the ‘os’ parameter can be provided. Otherwise, you can use ‘catalog name’ and/or ‘id’ for selecting an image.

id -> (string)

The image ID from the catalog. The corresponding image will be used for the created cluster machines.

catalogName -> (string)

The name of the custom image catalog to use, defaulting to ‘cdp-default’ if not present.

os -> (string)

The OS of the image used for cluster instances.

Shorthand Syntax:

id=string,catalogName=string,os=string

JSON Syntax:

{
  "id": "string",
  "catalogName": "string",
  "os": "string"
}

--enable-ranger-raz | --no-enable-ranger-raz (boolean)

Whether to enable Ranger RAZ for the datalake. Defaults to not being enabled.

--enable-ranger-rms | --no-enable-ranger-rms (boolean)

Whether to enable Ranger RMS for the datalake. Defaults to not being enabled.

--multi-az | --no-multi-az (boolean)

Controls if the datalake is deployed in a multi-availability zone way.

--recipes (array)

Additional recipes that will be attached on the datalake instances (by instance groups, most common ones are like ‘master’ or ‘idbroker’).

Shorthand Syntax:

instanceGroupName=string,recipeNames=string,string ... (separate items with spaces)

JSON Syntax:

[
  {
    "instanceGroupName": "string",
    "recipeNames": ["string", ...]
  }
  ...
]

--custom-instance-groups (array)

Configure custom properties on an instance group level.

Shorthand Syntax:

name=string,instanceType=string ... (separate items with spaces)

JSON Syntax:

[
  {
    "name": "string",
    "instanceType": "string"
  }
  ...
]

--java-version (integer)

Configure the major version of Java on the cluster.

--cli-input-json (string)

Performs service operation based on the JSON string provided. The JSON string follows the format provided by --generate-cli-skeleton. If other arguments are provided on the command line, the CLI values will override the JSON-provided values.

--generate-cli-skeleton (boolean)

Prints a sample input JSON to standard output. Note the specified operation is not run if this argument is specified. The sample input can be used as an argument for --cli-input-json.

Output

datalake -> (object)

Information about a datalake.

datalakeName -> (string)

The name of the datalake.

crn -> (string)

The CRN of the datalake.

status -> (string)

The status of the datalake.

environmentCrn -> (string)

The CRN of the environment.

creationDate -> (datetime)

The date when the datalake was created.

statusReason -> (string)

The reason for the status of the datalake.

enableRangerRaz -> (boolean)

Whether Ranger RAZ is enabled for the datalake.

certificateExpirationState -> (string)

Indicates the certificate status on the cluster.

multiAz -> (boolean)

Flag which marks that the datalake is deployed in a multi-availability zone way or not.

tags -> (array)

Datalake tags object containing the tag values defined for the datalake.

item -> (object)

Tag for a datalake resource.

key -> (string)

The key of tag.

value -> (string)

The value of the tag.

Form Factors

public