v0.9.0
⚠️ What's Changed
- ModelMesh Serving now directly imports KServe types for ServingRuntimes and InferenceServices. (#140, #146)
- InferenceService CRD now copied from KServe and included as part of standalone ModelMesh Serving installation by default.
- Renamed role/rolebinding names to incllude
modelmesh
prefix. (#181) - ModelMesh now uses Java 17 (kserve/modelmesh#33) and G1 garbage collector. (kserve/modelmesh#41)
- ModelMesh logging improvements. (kserve/modelmesh#41)
- InferenceService CRD now included in default standalone mm-serving installation. (#166)
- Many dependencies including etcd (updated to
v3.5.3
) were bumped. (#145)
🌈 What's New?
- Added support for OpenVINO Model Server ServingRuntime. (#141)
- OpenVINO Model Server adapter implemented. (#kserve/modelmesh-runtime-adapter#18)
TotalCopies
is now available in thePredictor
andInferenceService
statuses. (#142)- Users can now set labels and annotations for ServingRuntime pods via the
model-serving-config
ConfigMap. (#144) - Users can override adapter environment variables added by the controller. (#149)
- ServingRuntime matching based on protocolVersion is now supported. (#154)
ModelMetadata
endpoint now enabled for Triton and MLServer ServingRuntimes. (#164)- Azure Blob Storage now added as a supported storage provider. (#174, kserve/modelmesh-runtime-adapter#23)
- Add ModelMesh metrics for inference request/response payload sizes. (kserve/modelmesh#37)
🐞 Fixes
- Fixed possible nil pointer dereferences and minor log improvements. (#160)
- Fixed potential eviction deadlock in ModelMesh. (kserve/modelmesh#25)
- Disabled FIPS for Java in ModelMesh. (kserve/modelmesh#35)
- Repair invalid ModelRecord lastUsed values in registry. (kserve/modelmesh#36)
- Quickstart
minio
andetcd
pods were converted to Deployment resources. (#157)
📄 Documentation
- OpenVINO ServingRuntime documentation added. (#167)
- Rest proxy documentation added. (#177)
- Monitoring and metrics documentation added. (#175)
- TLS configuration documentation added. (#176)
- InferenceService CRD now documented as the primary interface for interacting with ModelMesh. (#190)
🦦 Other
- Upgrade tests to use to Ginkgo V2. (#133)
- Add performance test to E2E toolchain. (#139)
- Quickstart etcd version updated to v3.5.4. (#151)
Full Changelog: v0.8.0...v0.9.0