create-ml-serving-app

subtitle

Deploys Cloudera AI Inference Service into an existing Kubernetes cluster.

version

0.9.126

Description

Deploys Cloudera AI Inference Service into an existing Kubernetes cluster.

Synopsis

  create-ml-serving-app
--app-name <value>
--environment-crn <value>
--cluster-crn <value>
[--provision-k8s-request <value>]
[--use-public-load-balancer | --no-use-public-load-balancer]
[--skip-validation | --no-skip-validation]
[--load-balancer-ip-whitelists <value>]
[--subnets-for-load-balancers <value>]
[--static-subdomain <value>]
[--cli-input-json <value>]
[--generate-cli-skeleton]

Options

--app-name (string)

The name of the Cloudera AI Inference Service to be created.

--environment-crn (string)

The environment CRN.

--cluster-crn (string)

The cluster CRN of an existing cluster where Cloudera AI Inference Service instance will be deployed.

--provision-k8s-request (object)

Request object for provisioning Cloudera AI Inference Service instance groups.

instanceGroups -> (array)

The instance groups.

item -> (object)

Contains the necessary info for an instance group.

instanceType -> (string)

The cloud provider instance type for the node instance.

instanceTier -> (string)

The tier of the instance i.e. on-demand/spot.

instanceCount -> (integer)

The initial number of instance node.

name -> (string)

The unique name of the instance group.

ingressRules -> (array)

The networking rules for the ingress.

item -> (string)

rootVolume -> (object)

Configuration for instance root device volume.

size -> (integer)

The volume size in GB.

autoscaling -> (object)

Configuration for instance auto scaling.

minInstances -> (integer)

The minimum number of instance for auto scaling.

maxInstances -> (integer)

The maximum number of instance for auto scaling.

enabled -> (boolean)

The boolean flag to enable the auto scaling.

environmentCrn -> (string)

The CRN of the environment for the cluster to create.

tags -> (array)

Tags to add to the cloud provider resources created. This is in addition to any tags added by Cloudera.

item -> (object)

Contains a single tag entry that will be configured on cloud resources associated with a workbench.

key -> (string)

The name for the tag.

value -> (string)

The value for the tag.

network -> (object)

Contains the information about overlay network.

plugin -> (string)

The plugin specifies specific cni vendor, ex: calico, weave etc.

topology -> (object)

Contains the information about topology.

subnets -> (array)

The options for subnets.

item -> (string)

JSON Syntax:

{
  "instanceGroups": [
    {
      "instanceType": "string",
      "instanceTier": "string",
      "instanceCount": integer,
      "name": "string",
      "ingressRules": ["string", ...],
      "rootVolume": {
        "size": integer
      },
      "autoscaling": {
        "minInstances": integer,
        "maxInstances": integer,
        "enabled": true|false
      }
    }
    ...
  ],
  "environmentCrn": "string",
  "tags": [
    {
      "key": "string",
      "value": "string"
    }
    ...
  ],
  "network": {
    "plugin": "string",
    "topology": {
      "subnets": ["string", ...]
    }
  }
}

--use-public-load-balancer | --no-use-public-load-balancer (boolean)

The boolean flag to request a public load balancer. By default, a private load balancer is used.

--skip-validation | --no-skip-validation (boolean)

Skip pre-flight validations if requested.

--load-balancer-ip-whitelists (array)

The whitelist of IPs for load balancer.

Syntax:

"string" "string" ...

--subnets-for-load-balancers (array)

The list of subnets to be used for the load balancer.

Syntax:

"string" "string" ...

--static-subdomain (string)

The static subdomain to be used for the Cloudera AI Inference Service instance.

--cli-input-json (string)

Performs service operation based on the JSON string provided. The JSON string follows the format provided by --generate-cli-skeleton. If other arguments are provided on the command line, the CLI values will override the JSON-provided values.

--generate-cli-skeleton (boolean)

Prints a sample input JSON to standard output. Note the specified operation is not run if this argument is specified. The sample input can be used as an argument for --cli-input-json.

Output

appCrn -> (string)

The CRN of the Cloudera AI Inference Service created.

Form Factors

public, private