Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
built-in-runtimes.md		built-in-runtimes.md

README.md

Configuration

System-wide configuration parameters can be set by creating a ConfigMap with name model-serving-config. It should contain a single key named config.yaml, whose value is a yaml doc containing the configuration. All parameters have defaults and are optional. If the ConfigMap does not exist, all parameters will take their defaults.

The configuration can be updated at runtime and will take effect immediately. Be aware however that certain changes could cause temporary disruption to the service - in particular changing the service name, port, TLS configuration and/or headlessness.

Example:

apiVersion: v1
kind: ConfigMap
metadata:
  name: model-serving-config
data:
  config.yaml: |
    #Sample config overrides
    inferenceServiceName: "modelmesh-serving"
    inferenceServicePort: 8033
    podsPerRuntime: 2
    metrics:
      enabled: true

The following parameters are currently supported. Note the keys are expressed here in camel case but are in fact case-insensitive.

Variable	Description	Default
`inferenceServiceName`	The service name which is used for communication with the serving server	`modelmesh-serving`
`inferenceServicePort`	The port number for communication with the inferencing service	`8033`
`storageSecretName`	The secret containing entries for each storage backend from which models can be loaded (* see below)	`storage-config`
`podsPerRuntime`	Number of server Pods to run per enabled Serving Runtime (** see below)	`2`
`tls.secretName`	Kubernetes TLS type secret to use for securing the Service; no TLS if empty (*** see below)
`tls.clientAuth`	Enables mutual TLS authentication. Supported values are `required` and `optional`, disabled if empty
`headlessService`	Whether the Service should be headless (recommended)	`true`
`enableAccessLogging`	Enables logging of each request to the model server	`false`
`serviceAccountName`	The service account to use for runtime Pods	`modelmesh`
`metrics.enabled`	Enables serving of Prometheus metrics	`true`
`metrics.port`	Port on which to serve metrics via the `/metrics` endpoint	`2112`
`metrics.scheme`	Scheme to use for the `/metrics` endpoint (`http` or `https`)	`https`
`metrics.disablePrometheusOperatorSupport`	Disable the support of Prometheus operator for metrics only if `metrics.enabled` is true	`false`
`scaleToZero.enabled`	Whether to scale down Serving Runtimes that have no Predictors	`true`
`scaleToZero.gracePeriodSeconds`	The number of seconds to wait after Predictors are deleted before scaling to zero	`60`
`grpcMaxMessageSizeBytes`	The max number of bytes for the gRPC request payloads (**** see below)	`16777216` (16MiB)
`restProxy.enabled`	Enables the provided REST proxy container being deployed in each ServingRuntime deployment	`true`
`restProxy.port`	Port on which the REST proxy to serve REST requests	`8008`

(*) Currently requires a controller restart to take effect

(**) This parameter will likely be removed in a future release; the Pod replica counts will become more dynamic.

(***) The TLS configuration secret allows for keys:

tls.crt - path to TLS secret certificate
tls.key - path to TLS secret key
ca.crt (optional) - single path or comma-separated list of paths to trusted certificates

(****) The max gRPC request payload size depends on both this setting and adjusting the model serving runtimes' max message limit. See inference docs for details.

Logging

By default, the internal logging of the controller component is set to log stacktraces on errors and sampling, which is the Zap production configuration. To enable the development mode for logging (stacktraces on warnings, no sampling, prettier log outputs), set the environment variable DEV_MODE_LOGGING=true on the ModelMesh Serving controller:

kubectl set env deploy/modelmesh-controller DEV_MODE_LOGGING=true

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

configuration

configuration

README.md

Configuration

Logging

Files

configuration

Directory actions

More options

Directory actions

More options

Latest commit

History

configuration

Folders and files

parent directory

README.md

Configuration

Logging