Skip to content

Releases: bentoml/BentoML

v1.2.0

02 Feb 17:46
9169155
Compare
Choose a tag to compare

🍱 We are excited to share with you that we have released BentoML v1.2, the biggest release since the launch of v1.0. This release includes improvements from all the learning and feedback from our community over the past year. We invite you to read our release blog post for a comprehensive overview of the new features and the motivations behind their development.

Here are a few key points to note before we delve into the new features:

  • v1.2 ensures complete backward compatibility, meaning that Bentos built with v1.1 will continue to function seamlessly with this release.
  • We remain committed to supporting v1.1. Critical bug fixes and security updates will be backported to the v1.1 branch.
  • BentoML documentation has been updated with examples and guides for v1.2. More guides are being added every week.
  • BentoCloud is fully equipped to handle deployments from both v1.1 and v1.2 releases of BentoML.

⛏️ Introduced a simplified service SDK to empower developers with greater control and flexibility.

  • Simplified the service and API interfaces as Python classes, allowing developers to add custom logic and use third party libraries flexibly with ease.
  • Introduced @bentoml.service and @bentoml.api decorators to customize the behaviors of services and APIs.
  • Moved configuration from YAML files to the service decorator @bentoml.service next to the class definition.
  • See this example demonstrating the flexibility of the service API by initializing a vLLM AsyncEngine in the service constructor and run inference with continuous batching in the service API.

🔭 Revamped IO descriptors with more familiar input and output types.

  • Enable use of Pythonic types directly, without the need for additional IO descriptor definitions or decorations.
  • Integrated with Pydantic to leverage its robust validation capabilities and wide array of supported types.
  • Expanded support to ML and Generative AI specific IO types.

📦 Updated model saving and loading API to be more generic to enable integration with more ML frameworks.

  • Allow flexible saving and loading models using the bentoml.models.create API instead of framework specific APIs, e.g. bentoml.pytorch.save_model, bentoml.tensorflow.save_model.

🚚 Streamlined the deployment workflow to allow more rapid development iterations and a faster time to production.

  • Enabled direct deployment to production through CLI and Python API from Git projects.

🎨 Improved API development experience with generated web UI and rich Python client.
image

  • All bentos are now accompanied by a custom-generated UI in the BentoCloud Playground, tailored to their API definitions.
  • BentoClient offers a Pythonic way to invoke the service endpoint, allowing parameters to be supplied in native Python format, letting the client efficiently handles the necessary serialization while ensuring compatibility and performance.

🎭 We've learned that the best way to showcase what BentoML can do is not through dry, conceptual documentation but through real-world examples. Check out our current list of examples, and we'll continue to publish new ones to the gallery as exciting new models are released.

🙏 Thank you for your continued support!

What's Changed

New Contributors

Full Changelog: v1.2.0rc1...v1.2.0

v1.2.0rc1

31 Jan 13:46
b23185b
Compare
Choose a tag to compare
v1.2.0rc1 Pre-release
Pre-release

What's Changed

Full Changelog: v1.2.0a7...v1.2.0rc1

v1.2.0a7

30 Jan 04:52
5713dab
Compare
Choose a tag to compare
v1.2.0a7 Pre-release
Pre-release

What's Changed

Full Changelog: v1.2.0a6...v1.2.0a7

v1.2.0a6

26 Jan 03:42
a658e64
Compare
Choose a tag to compare
v1.2.0a6 Pre-release
Pre-release

What's Changed

Full Changelog: v1.2.0a5...v1.2.0a6

v1.2.0a5

20 Jan 14:06
2d73651
Compare
Choose a tag to compare
v1.2.0a5 Pre-release
Pre-release

What's Changed

  • fix: image encoding issue when the format is not specified by @frostming in #4435

Full Changelog: v1.2.0a4...v1.2.0a5

v1.2.0a4

19 Jan 10:02
813ef71
Compare
Choose a tag to compare
v1.2.0a4 Pre-release
Pre-release

What's Changed

Full Changelog: v1.2.0a3...v1.2.0a4

v1.2.0a3

19 Jan 03:26
e481e65
Compare
Choose a tag to compare
v1.2.0a3 Pre-release
Pre-release

What's Changed

Full Changelog: v1.2.0a2...v1.2.0a3

v1.2.0a2

18 Jan 11:55
95e20f8
Compare
Choose a tag to compare
v1.2.0a2 Pre-release
Pre-release

What's Changed

Full Changelog: v1.2.0a1...v1.2.0a2

v1.2.0a1

17 Jan 06:32
81751f4
Compare
Choose a tag to compare
v1.2.0a1 Pre-release
Pre-release

What's Changed

  • fix(docs): remove eager type evaluation for python 3.8 on openllm examples by @aarnphm in #4393
  • docs: fix typo in tritonserver_type parameter and code example by @ilyahabr in #4394

New Contributors

Full Changelog: v1.2.0a0...v1.2.0a1

v1.2.0a0

09 Jan 07:01
777301d
Compare
Choose a tag to compare
v1.2.0a0 Pre-release
Pre-release

What's Changed

Full Changelog: v1.1.11...v1.2.0a0