Skip to content

v0.8.0

Compare
Choose a tag to compare
@pvaneck pvaneck released this 12 Feb 00:22
· 1 commit to release-0.8 since this release
41476d5

⚠️ What's Changed

  • Removed support for KServe TrainedModel CRD (#54)
  • MLServer ServingRuntime updated to use 0.5.2 (#61)
  • Go version updated to 1.17 along with other tooling updates (5355eb7)
  • MLServer ServingRuntime now has an increased gRPC max message size (#85)
  • In the ServingRuntime CRD, SupportedModelTypes now goes by SupportedModelFormats (#100)
  • The max gRPC response message size via the REST-proxy has been increased to 16MiB (kserve/rest-proxy#11)

🌈 What's New?

  • Multi-namespace support for the ModelMesh controller was introduced (#84)
    • Kube resolver can now work with multiple namespaces for multi-namespace capability (#73)
    • ModelMeshEventStream component can now support multiple namespaces (#76)
    • ServingRuntime controller now works across multiple namespaces (#77)
    • Service Controller is now namespace-aware (#82)
  • Default RBAC is now cluster-scoped instead of namespace-scoped (#88)
  • Users can now configure environment variables for the model-mesh containers in ServingRuntime deployments (98eea55)
  • Reconciliation logic added for new storage spec in InferenceServices and Predictors (#56, #83)
  • A multiModel field added to the ServingRuntime spec for denoting if a ServingRuntime is compatible with ModelMesh or not (#89)
  • The controller can now reconcile InferenceServices using the new Model Spec in the predictor (#101)
  • autoSelect field introduced to ServingRuntime CRD supportedModelTypes spec (#100)
  • Logic was added to have MM only consider SRs with model format containing autoSelect as true when finding compatible runtimes (#108)
  • Install script now allows passing in a URL to a config archive (#118)
  • Models hosted using GCS or HTTP(S) can now be used with ModelMesh through InferenceServices (#121)
  • REST input payloads through the REST-proxy can now be multi-dimensional (kserve/rest-proxy#6)

🐞 Fixes

  • Fix code errors reported by golangci-lint (#57)
  • Fixed a bug where invalid vModel specs led to a nil pointer dereference (1bea198)
  • Fixed a bug where ServingRuntime controller would loop over empty reconcile events (2063f73)
  • Events from plugged-in Predictor sources are now transformed properly when setting up ServingRuntime controller (d6f5c5d)
  • Fixed install issues on Mac (#114, #119)

📄 Documentation

  • Added developer documentation (#59)
  • Added notes about debug flags in custom MLServer runtimes (314761f)
  • Added Keras docs and example (54311bb, #109)
  • Change install instructions to install from a release branch (#117)

🦦 Other

  • Some controller code was cleaned up and optimized (f380a27)
  • Script for setting up a user namespace for ModelMesh was added (#112)

Full Changelog: v0.7.0...v0.8.0