Skip to content

Releases: kserve/modelmesh-serving

v0.9.0

21 Jul 00:56
deeaad7
Compare
Choose a tag to compare

⚠️ What's Changed

  • ModelMesh Serving now directly imports KServe types for ServingRuntimes and InferenceServices. (#140, #146)
  • InferenceService CRD now copied from KServe and included as part of standalone ModelMesh Serving installation by default.
  • Renamed role/rolebinding names to incllude modelmesh prefix. (#181)
  • ModelMesh now uses Java 17 (kserve/modelmesh#33) and G1 garbage collector. (kserve/modelmesh#41)
  • ModelMesh logging improvements. (kserve/modelmesh#41)
  • InferenceService CRD now included in default standalone mm-serving installation. (#166)
  • Many dependencies including etcd (updated to v3.5.3) were bumped. (#145)

🌈 What's New?

  • Added support for OpenVINO Model Server ServingRuntime. (#141)
  • OpenVINO Model Server adapter implemented. (#kserve/modelmesh-runtime-adapter#18)
  • TotalCopies is now available in the Predictor and InferenceService statuses. (#142)
  • Users can now set labels and annotations for ServingRuntime pods via the model-serving-config ConfigMap. (#144)
  • Users can override adapter environment variables added by the controller. (#149)
  • ServingRuntime matching based on protocolVersion is now supported. (#154)
  • ModelMetadata endpoint now enabled for Triton and MLServer ServingRuntimes. (#164)
  • Azure Blob Storage now added as a supported storage provider. (#174, kserve/modelmesh-runtime-adapter#23)
  • Add ModelMesh metrics for inference request/response payload sizes. (kserve/modelmesh#37)

🐞 Fixes

  • Fixed possible nil pointer dereferences and minor log improvements. (#160)
  • Fixed potential eviction deadlock in ModelMesh. (kserve/modelmesh#25)
  • Disabled FIPS for Java in ModelMesh. (kserve/modelmesh#35)
  • Repair invalid ModelRecord lastUsed values in registry. (kserve/modelmesh#36)
  • Quickstart minio and etcd pods were converted to Deployment resources. (#157)

📄 Documentation

  • OpenVINO ServingRuntime documentation added. (#167)
  • Rest proxy documentation added. (#177)
  • Monitoring and metrics documentation added. (#175)
  • TLS configuration documentation added. (#176)
  • InferenceService CRD now documented as the primary interface for interacting with ModelMesh. (#190)

🦦 Other

  • Upgrade tests to use to Ginkgo V2. (#133)
  • Add performance test to E2E toolchain. (#139)
  • Quickstart etcd version updated to v3.5.4. (#151)

Full Changelog: v0.8.0...v0.9.0

v0.9.0-rc0

01 Jul 00:40
d2541e5
Compare
Choose a tag to compare
v0.9.0-rc0 Pre-release
Pre-release

Release candidate for v0.9.0.

v0.8.0

12 Feb 00:22
41476d5
Compare
Choose a tag to compare

⚠️ What's Changed

  • Removed support for KServe TrainedModel CRD (#54)
  • MLServer ServingRuntime updated to use 0.5.2 (#61)
  • Go version updated to 1.17 along with other tooling updates (5355eb7)
  • MLServer ServingRuntime now has an increased gRPC max message size (#85)
  • In the ServingRuntime CRD, SupportedModelTypes now goes by SupportedModelFormats (#100)
  • The max gRPC response message size via the REST-proxy has been increased to 16MiB (kserve/rest-proxy#11)

🌈 What's New?

  • Multi-namespace support for the ModelMesh controller was introduced (#84)
    • Kube resolver can now work with multiple namespaces for multi-namespace capability (#73)
    • ModelMeshEventStream component can now support multiple namespaces (#76)
    • ServingRuntime controller now works across multiple namespaces (#77)
    • Service Controller is now namespace-aware (#82)
  • Default RBAC is now cluster-scoped instead of namespace-scoped (#88)
  • Users can now configure environment variables for the model-mesh containers in ServingRuntime deployments (98eea55)
  • Reconciliation logic added for new storage spec in InferenceServices and Predictors (#56, #83)
  • A multiModel field added to the ServingRuntime spec for denoting if a ServingRuntime is compatible with ModelMesh or not (#89)
  • The controller can now reconcile InferenceServices using the new Model Spec in the predictor (#101)
  • autoSelect field introduced to ServingRuntime CRD supportedModelTypes spec (#100)
  • Logic was added to have MM only consider SRs with model format containing autoSelect as true when finding compatible runtimes (#108)
  • Install script now allows passing in a URL to a config archive (#118)
  • Models hosted using GCS or HTTP(S) can now be used with ModelMesh through InferenceServices (#121)
  • REST input payloads through the REST-proxy can now be multi-dimensional (kserve/rest-proxy#6)

🐞 Fixes

  • Fix code errors reported by golangci-lint (#57)
  • Fixed a bug where invalid vModel specs led to a nil pointer dereference (1bea198)
  • Fixed a bug where ServingRuntime controller would loop over empty reconcile events (2063f73)
  • Events from plugged-in Predictor sources are now transformed properly when setting up ServingRuntime controller (d6f5c5d)
  • Fixed install issues on Mac (#114, #119)

📄 Documentation

  • Added developer documentation (#59)
  • Added notes about debug flags in custom MLServer runtimes (314761f)
  • Added Keras docs and example (54311bb, #109)
  • Change install instructions to install from a release branch (#117)

🦦 Other

  • Some controller code was cleaned up and optimized (f380a27)
  • Script for setting up a user namespace for ModelMesh was added (#112)

Full Changelog: v0.7.0...v0.8.0

v0.8.0-rc0

22 Jan 00:04
5d16e7b
Compare
Choose a tag to compare
v0.8.0-rc0 Pre-release
Pre-release

Release candidate for v0.8.0.

v0.7.0

12 Oct 17:39
1ba136a
Compare
Choose a tag to compare

The initial release of ModelMesh Serving. Learn more in the announcement blog here.

v0.7.0-rc1

07 Oct 01:07
18cc3e0
Compare
Choose a tag to compare
v0.7.0-rc1 Pre-release
Pre-release

Initial release candidate 1