Creates an Azure datalake.




Creates an Azure datalake.


--datalake-name <value>
--environment-name <value>
--cloud-provider-configuration <value>
[--scale <value>]
[--tags <value>]
[--runtime <value>]
[--image <value>]
[--load-balancer-sku <value>]
[--enable-ranger-raz | --no-enable-ranger-raz]
[--recipes <value>]
[--custom-instance-groups <value>]
[--cli-input-json <value>]


--datalake-name (string)

The datalake name. This name must be unique, must have between 5 and 100 characters, and must contain only lowercase letters, numbers and hyphens. Names are case-sensitive.

--environment-name (string)

The environment name or CRN.

--cloud-provider-configuration (object)

Request object for Azure configuration.

managedIdentity -> (string)

The managed identity to use. The assumer should have Virtual Machine Contributor and Managed Identity Operator roles on subscription level.

storageLocation -> (string)

The storage location to use. The location has to be in the following format abfs:// The filesystem must already exist and the storage account must be StorageV2.

Shorthand Syntax:


JSON Syntax:

  "managedIdentity": "string",
  "storageLocation": "string"

--scale (string)

Represents the available datalake scales. Defaults to LIGHT_DUTY if not set.

Possible values:



--tags (array)

Tags to be added to Data Lake related resources.

Shorthand Syntax:

key=string,value=string ... (separate items with spaces)

JSON Syntax:

    "key": "string",
    "value": "string"

--runtime (string)

Cloudera Runtime version.

--image (object)

The image request for the datalake. This must not be set if the runtime parameter is provided. The image ID parameter is required if this is present, but the image catalog name is optional, defaulting to ‘cdp-default’ if not present.

id -> (string)

The image ID from the catalog. The corresponding image will be used for the created cluster machines.

catalogName -> (string)

The name of the custom image catalog to use.

Shorthand Syntax:


JSON Syntax:

  "id": "string",
  "catalogName": "string"

--load-balancer-sku (string)

Represents the Azure load balancer SKU type. The current default is BASIC. To disable the load balancer, use type NONE.

Possible values:



  • NONE

--enable-ranger-raz | --no-enable-ranger-raz (boolean)

Whether to enable Ranger RAZ for the datalake. Defaults to not being enabled.

--recipes (array)

Additional recipes that will be attached on the datalake instances (by instance groups, most common ones are like ‘master’ or ‘idbroker’).

Shorthand Syntax:

instanceGroupName=string,recipeNames=string,string ... (separate items with spaces)

JSON Syntax:

    "instanceGroupName": "string",
    "recipeNames": ["string", ...]

--custom-instance-groups (array)

Configure custom properties on an instance group level.

Shorthand Syntax:

name=string,instanceType=string ... (separate items with spaces)

JSON Syntax:

    "name": "string",
    "instanceType": "string"

--cli-input-json (string)

Performs service operation based on the JSON string provided. The JSON string follows the format provided by --generate-cli-skeleton. If other arguments are provided on the command line, the CLI values will override the JSON-provided values.

--generate-cli-skeleton (boolean)

Prints a sample input JSON to standard output. Note the specified operation is not run if this argument is specified. The sample input can be used as an argument for --cli-input-json.


datalake -> (object)

Information about a datalake.

datalakeName -> (string)

The name of the datalake.

crn -> (string)

The CRN of the datalake.

status -> (string)

The status of the datalake.

environmentCrn -> (string)

The CRN of the environment.

creationDate -> (datetime)

The date when the datalake was created.

statusReason -> (string)

The reason for the status of the datalake.

enableRangerRaz -> (boolean)

Whether Ranger RAZ is enabled for the datalake.

certificateExpirationState -> (string)

Indicates the certificate status on the cluster.

multiAz -> (boolean)

Flag which marks that the datalake is deployed in a multi-availability zone way or not.

Form Factors