create-ml-serving-app¶
- subtitle
Deploys Cloudera AI Inference Service into an existing Kubernetes cluster.
- version
0.9.126
Description¶
Deploys Cloudera AI Inference Service into an existing Kubernetes cluster.
Synopsis¶
create-ml-serving-app
--app-name <value>
--environment-crn <value>
--cluster-crn <value>
[--provision-k8s-request <value>]
[--use-public-load-balancer | --no-use-public-load-balancer]
[--skip-validation | --no-skip-validation]
[--load-balancer-ip-whitelists <value>]
[--subnets-for-load-balancers <value>]
[--static-subdomain <value>]
[--cli-input-json <value>]
[--generate-cli-skeleton]
Options¶
--app-name
(string)
The name of the Cloudera AI Inference Service to be created.
--environment-crn
(string)
The environment CRN.
--cluster-crn
(string)
The cluster CRN of an existing cluster where Cloudera AI Inference Service instance will be deployed.
--provision-k8s-request
(object)
Request object for provisioning Cloudera AI Inference Service instance groups.
instanceGroups -> (array)
The instance groups.
item -> (object)
Contains the necessary info for an instance group.
instanceType -> (string)
The cloud provider instance type for the node instance.
instanceTier -> (string)
The tier of the instance i.e. on-demand/spot.
instanceCount -> (integer)
The initial number of instance node.
name -> (string)
The unique name of the instance group.
ingressRules -> (array)
The networking rules for the ingress.
item -> (string)
rootVolume -> (object)
Configuration for instance root device volume.
size -> (integer)
The volume size in GB.
autoscaling -> (object)
Configuration for instance auto scaling.
minInstances -> (integer)
The minimum number of instance for auto scaling.
maxInstances -> (integer)
The maximum number of instance for auto scaling.
enabled -> (boolean)
The boolean flag to enable the auto scaling.
environmentCrn -> (string)
The CRN of the environment for the cluster to create.
tags -> (array)
Tags to add to the cloud provider resources created. This is in addition to any tags added by Cloudera.
item -> (object)
Contains a single tag entry that will be configured on cloud resources associated with a workbench.
key -> (string)
The name for the tag.
value -> (string)
The value for the tag.
network -> (object)
Contains the information about overlay network.
plugin -> (string)
The plugin specifies specific cni vendor, ex: calico, weave etc.
topology -> (object)
Contains the information about topology.
subnets -> (array)
The options for subnets.
item -> (string)
JSON Syntax:
{
"instanceGroups": [
{
"instanceType": "string",
"instanceTier": "string",
"instanceCount": integer,
"name": "string",
"ingressRules": ["string", ...],
"rootVolume": {
"size": integer
},
"autoscaling": {
"minInstances": integer,
"maxInstances": integer,
"enabled": true|false
}
}
...
],
"environmentCrn": "string",
"tags": [
{
"key": "string",
"value": "string"
}
...
],
"network": {
"plugin": "string",
"topology": {
"subnets": ["string", ...]
}
}
}
--use-public-load-balancer
| --no-use-public-load-balancer
(boolean)
The boolean flag to request a public load balancer. By default, a private load balancer is used.
--skip-validation
| --no-skip-validation
(boolean)
Skip pre-flight validations if requested.
--load-balancer-ip-whitelists
(array)
The whitelist of IPs for load balancer.
Syntax:
"string" "string" ...
--subnets-for-load-balancers
(array)
The list of subnets to be used for the load balancer.
Syntax:
"string" "string" ...
--static-subdomain
(string)
The static subdomain to be used for the Cloudera AI Inference Service instance.
--cli-input-json
(string)
Performs service operation based on the JSON string provided. The JSON string follows the format provided by
--generate-cli-skeleton
. If other arguments are provided on the command line, the CLI values will override the JSON-provided values.
--generate-cli-skeleton
(boolean)
Prints a sample input JSON to standard output. Note the specified operation is not run if this argument is specified. The sample input can be used as an argument for
--cli-input-json
.
Form Factors¶
public, private