Releases: kserve/modelmesh-serving
Releases · kserve/modelmesh-serving
v0.9.0
⚠️ What's Changed
- ModelMesh Serving now directly imports KServe types for ServingRuntimes and InferenceServices. (#140, #146)
- InferenceService CRD now copied from KServe and included as part of standalone ModelMesh Serving installation by default.
- Renamed role/rolebinding names to incllude
modelmesh
prefix. (#181) - ModelMesh now uses Java 17 (kserve/modelmesh#33) and G1 garbage collector. (kserve/modelmesh#41)
- ModelMesh logging improvements. (kserve/modelmesh#41)
- InferenceService CRD now included in default standalone mm-serving installation. (#166)
- Many dependencies including etcd (updated to
v3.5.3
) were bumped. (#145)
🌈 What's New?
- Added support for OpenVINO Model Server ServingRuntime. (#141)
- OpenVINO Model Server adapter implemented. (#kserve/modelmesh-runtime-adapter#18)
TotalCopies
is now available in thePredictor
andInferenceService
statuses. (#142)- Users can now set labels and annotations for ServingRuntime pods via the
model-serving-config
ConfigMap. (#144) - Users can override adapter environment variables added by the controller. (#149)
- ServingRuntime matching based on protocolVersion is now supported. (#154)
ModelMetadata
endpoint now enabled for Triton and MLServer ServingRuntimes. (#164)- Azure Blob Storage now added as a supported storage provider. (#174, kserve/modelmesh-runtime-adapter#23)
- Add ModelMesh metrics for inference request/response payload sizes. (kserve/modelmesh#37)
🐞 Fixes
- Fixed possible nil pointer dereferences and minor log improvements. (#160)
- Fixed potential eviction deadlock in ModelMesh. (kserve/modelmesh#25)
- Disabled FIPS for Java in ModelMesh. (kserve/modelmesh#35)
- Repair invalid ModelRecord lastUsed values in registry. (kserve/modelmesh#36)
- Quickstart
minio
andetcd
pods were converted to Deployment resources. (#157)
📄 Documentation
- OpenVINO ServingRuntime documentation added. (#167)
- Rest proxy documentation added. (#177)
- Monitoring and metrics documentation added. (#175)
- TLS configuration documentation added. (#176)
- InferenceService CRD now documented as the primary interface for interacting with ModelMesh. (#190)
🦦 Other
- Upgrade tests to use to Ginkgo V2. (#133)
- Add performance test to E2E toolchain. (#139)
- Quickstart etcd version updated to v3.5.4. (#151)
Full Changelog: v0.8.0...v0.9.0
v0.9.0-rc0
Release candidate for v0.9.0
.
v0.8.0
⚠️ What's Changed
- Removed support for KServe
TrainedModel
CRD (#54) - MLServer
ServingRuntime
updated to use 0.5.2 (#61) - Go version updated to 1.17 along with other tooling updates (5355eb7)
- MLServer
ServingRuntime
now has an increased gRPC max message size (#85) - In the
ServingRuntime
CRD,SupportedModelTypes
now goes bySupportedModelFormats
(#100) - The max gRPC response message size via the REST-proxy has been increased to 16MiB (kserve/rest-proxy#11)
🌈 What's New?
- Multi-namespace support for the ModelMesh controller was introduced (#84)
- Default RBAC is now cluster-scoped instead of namespace-scoped (#88)
- Users can now configure environment variables for the model-mesh containers in
ServingRuntime
deployments (98eea55) - Reconciliation logic added for new storage spec in
InferenceServices
andPredictors
(#56, #83) - A
multiModel
field added to theServingRuntime
spec for denoting if aServingRuntime
is compatible with ModelMesh or not (#89) - The controller can now reconcile
InferenceServices
using the new Model Spec in the predictor (#101) autoSelect
field introduced toServingRuntime
CRDsupportedModelTypes
spec (#100)- Logic was added to have MM only consider SRs with model format containing
autoSelect
as true when finding compatible runtimes (#108) - Install script now allows passing in a URL to a config archive (#118)
- Models hosted using GCS or HTTP(S) can now be used with ModelMesh through
InferenceServices
(#121) - REST input payloads through the REST-proxy can now be multi-dimensional (kserve/rest-proxy#6)
🐞 Fixes
- Fix code errors reported by golangci-lint (#57)
- Fixed a bug where invalid vModel specs led to a nil pointer dereference (1bea198)
- Fixed a bug where
ServingRuntime
controller would loop over empty reconcile events (2063f73) - Events from plugged-in
Predictor
sources are now transformed properly when setting upServingRuntime
controller (d6f5c5d) - Fixed install issues on Mac (#114, #119)
📄 Documentation
- Added developer documentation (#59)
- Added notes about debug flags in custom MLServer runtimes (314761f)
- Added Keras docs and example (54311bb, #109)
- Change install instructions to install from a release branch (#117)
🦦 Other
- Some controller code was cleaned up and optimized (f380a27)
- Script for setting up a user namespace for ModelMesh was added (#112)
Full Changelog: v0.7.0...v0.8.0
v0.8.0-rc0
Release candidate for v0.8.0
.
v0.7.0
v0.7.0-rc1
Initial release candidate 1