Creates an GCP Data Lake.




Creates an GCP Data Lake.


--datalake-name <value>
--environment-name <value>
--cloud-provider-configuration <value>
[--tags <value>]
[--scale <value>]
[--runtime <value>]
[--image <value>]
[--recipes <value>]
[--java-version <value>]
[--cli-input-json <value>]


--datalake-name (string)

The datalake name. This name must be unique, must have between 5 and 100 characters, and must contain only lowercase letters, numbers and hyphens. Names are case-sensitive.

--environment-name (string)

The environment name or CRN.

--cloud-provider-configuration (object)

Request object for GCP configuration.

serviceAccountEmail -> (string)

Email id of the service account to be associated with the datalake IdBroker instance. This service account should have “token.creator” role for one or more storage accounts that has access to storage.

storageLocation -> (string)

The location of the GCS bucket to be used as storage. The location has to start with gs:// followed by the bucket name.

Shorthand Syntax:


JSON Syntax:

  "serviceAccountEmail": "string",
  "storageLocation": "string"

--tags (array)

Tags that can be attached to GCP Data Lake resources. Please refer to Google documentation for the rules

Shorthand Syntax:

key=string,value=string ... (separate items with spaces)

JSON Syntax:

    "key": "string",
    "value": "string"

--scale (string)

Represents the available datalake scales. Defaults to LIGHT_DUTY if not set.

Possible values:




--runtime (string)

Cloudera Runtime version.

--image (object)

The image request for the datalake. When the ‘runtime’ parameter is set, only the ‘os’ parameter can be provided. Otherwise, you can use ‘catalog name’ and/or ‘id’ for selecting an image.

id -> (string)

The image ID from the catalog. The corresponding image will be used for the created cluster machines.

catalogName -> (string)

The name of the custom image catalog to use, defaulting to ‘cdp-default’ if not present.

os -> (string)

The OS of the image used for cluster instances.

Shorthand Syntax:


JSON Syntax:

  "id": "string",
  "catalogName": "string",
  "os": "string"

--recipes (array)

Additional recipes that will be attached on the datalake instances (by instance groups, most common ones are like ‘master’ or ‘idbroker’).

Shorthand Syntax:

instanceGroupName=string,recipeNames=string,string ... (separate items with spaces)

JSON Syntax:

    "instanceGroupName": "string",
    "recipeNames": ["string", ...]

--java-version (integer)

Configure the major version of Java on the cluster.

--cli-input-json (string)

Performs service operation based on the JSON string provided. The JSON string follows the format provided by --generate-cli-skeleton. If other arguments are provided on the command line, the CLI values will override the JSON-provided values.

--generate-cli-skeleton (boolean)

Prints a sample input JSON to standard output. Note the specified operation is not run if this argument is specified. The sample input can be used as an argument for --cli-input-json.


datalake -> (object)

Information about a datalake.

datalakeName -> (string)

The name of the datalake.

crn -> (string)

The CRN of the datalake.

status -> (string)

The status of the datalake.

environmentCrn -> (string)

The CRN of the environment.

creationDate -> (datetime)

The date when the datalake was created.

statusReason -> (string)

The reason for the status of the datalake.

enableRangerRaz -> (boolean)

Whether Ranger RAZ is enabled for the datalake.

certificateExpirationState -> (string)

Indicates the certificate status on the cluster.

multiAz -> (boolean)

Flag which marks that the datalake is deployed in a multi-availability zone way or not.

Form Factors