Releases: bentoml/BentoML
Releases · bentoml/BentoML
v1.4.3
What's Changed
- docs: Update examples to use new APIs by @Sherlock113 in #5252
- Add alt text to all images in documentation by @devin-ai-integration in #5253
- ci: pre-commit autoupdate [skip ci] by @pre-commit-ci in #5254
- docs: update deprecated links by @aarnphm in #5256
- docs: Add root input usage by @Sherlock113 in #5257
- fix: reformat the code by @frostming in #5258
- fix: forbid requests with pickle encoding at the entry service
Full Changelog: v1.4.2...v1.4.3
v1.4.2
What's Changed
- Fix RunPod unhashable model error by converting svc.models to hashable ids by @rivaon in #5244
- fix(io): OpenAPI schema for multipart form request body by @frostming in #5249
- fix: drop uv as a hard dependency by @frostming in #5238
- docs: Update examples to use new HF API by @Sherlock113 in #5242
- refactor: drop deepmerge dependency by @frostming in #5250
- refactor: unify logic of loading service by @frostming in #5232
- fix: collect requirements from image spec for codespaces by @frostming in #5251
New Contributors
Full Changelog: v1.4.1...v1.4.2
v1.4.1
What's Changed
- docs: use diff for runtime image by @parano in #5236
- docs: Update runtime explanations by @Sherlock113 in #5240
- fix(regression): call error when context parameter is present by @frostming in #5247
Full Changelog: v1.4.0...v1.4.1
v1.4.0
We are thrilled to announce the release of BentoML 1.4! This version introduces several new features and improvements to accelerate your iteration cycle and enhance the overall developer experience.
Below are the key highlights of 1.4, and you can find more details in the release blog post.
🚀 20x faster iteration with Codespaces
- Introduced BentoML Codespaces, a development platform built on BentoCloud
- Added the
bentoml code
command for creating a Codespace - Auto-sync of local changes to the cloud environment
- Access to a variety of powerful cloud GPUs
- Real-time logs and debugging through the cloud dashboard
- Eliminate dependency headaches and ensure consistency between dev and prod environments
🐍 New Python SDK for runtime configurations
- Added
bentoml.images.PythonImage
for defining the Bento runtime environment in Python instead of usingbentofile.yaml
orpyproject.toml
- Support customizing runtime configurations (e.g., Python version, system packages, and dependencies) directly in the
service.py
file - Introduced context-sensitive
run()
method for running custom build commands - Backward compatible with existing
bentofile.yaml
andpyproject.toml
configurations
⚡ Accelerated model loading with safetensors
- Implemented build-time model downloads and parallel loading of model weights using safetensors to reduce cold start time and improve scaling performance. See the documentation to learn more.
- Added
bentoml.models.HuggingFaceModel
for loading models from HF. It supports private model repositories and custom endpoints - Added
bentoml.models.BentoModel
for loading models from BentoCloud and the Model Store
🌍 External deployment dependencies
- Extended
bentoml.depends()
to support external deployments - Added support for calling BentoCloud Deployments via name or URL
- Added support for calling self-hosted HTTP AI services outside BentoCloud
⚠️ Legacy Service API deprecation
- The legacy
bentoml.Service
API (with runners) is now officially deprecated and is scheduled for removal in a future release. We recommend you use the@bentoml.service
decorator.
Note that:
1.4
remains fully compatible with Bentos created by1.3
.- The BentoML documentation has been updated with examples and guides for
1.4
.
🙏 As always, we appreciate your continued support!
What's Changed
- feat: support bentoml serve without service name by @frostming in #5208
- feat(service): expose service-level labels definition by @aarnphm in #5211
- fix: restore path after import by @frostming in #5214
- fix: compile bytecode when installing python packages by @frostming in #5212
- fix: IO descriptor honor validators by @frostming in #5213
- feat(image): add support for chaining
.pyproject.toml
by @aarnphm in #5218 - feat: support root input spec using positonal-only argument by @frostming in #5217
- fix: gradio error when uploading file by @frostming in #5220
- fix: input data validation for root input by @frostming in #5221
- fix: don't restore model store after importing service by @frostming in #5223
- feat(metrics): extend histogram buckets to support LLM latencies by @devin-ai-integration in #5222
- fix: always add bentoml req unless it is specified as a url dependency by @frostming in #5225
- docs: update links to examples by @aarnphm in #5224
- docs: add environment variable authentication documentation by @devin-ai-integration in #5231
- docs: Update docs to use new runtime API by @Sherlock113 in #5177
- fix: add files under env/docker by @frostming in #5234
Full Changelog: v1.3.22...v1.4.0
v1.4.0a2
What's Changed
- fix: restore path after import by @frostming in #5214
Full Changelog: v1.4.0a1...v1.4.0a2
v1.4.0a1
What's Changed
- feat: support bentoml serve without service name by @frostming in #5208
- feat(service): expose service-level labels definition by @aarnphm in #5211
Full Changelog: v1.3.22...v1.4.0a1
v1.3.22
What's Changed
- fix: add line endings for python_packages(...) by @frostming in #5203
- fix: correct the log for locking platform by @frostming in #5205
- fix: uv installation in container by @frostming in #5206
- fix: add cluster option for secrets command by @xianml in #5209
- fix(build): setattr frozen sets by @aarnphm in #5207
- fix:correct context path when serving from bento by @frostming in #5210
Full Changelog: v1.3.21...v1.3.22
v1.3.21
What's Changed
- docs: correct types for example by @NitzanShwartz in #5188
- fix: make all top-level attributes lazy by @frostming in #5184
- fix: mounted apps don't eagerly match path prefix by @frostming in #5190
- chore: add spec into bento manifest schema by @xianml in #5192
- fix: get commit id from requested revision when it is missing by @frostming in #5193
- feat: add @bentoml.on_startup decorator by @devin-ai-integration in #5194
- fix: conditionally include endpoint_urls in deployment dictionary by @Haivilo in #5195
- fix: enable image spec by default by @frostming in #5197
- ci: pre-commit autoupdate [skip ci] by @pre-commit-ci in #5200
- docs: update AWS byoc cloudformation template by @sauyon in #5201
- feat: support running scripts in new image spec by @frostming in #5196
New Contributors
- @NitzanShwartz made their first contribution in #5188
Full Changelog: v1.3.20...v1.3.21
v1.3.20
What's Changed
- Add logging for service cleanup hooks in new-style BentoML services by @devin-ai-integration in #5171
- ci: pre-commit autoupdate [skip ci] by @pre-commit-ci in #5170
- fix: set the arbiter thread to be daemon thread by @frostming in #5172
- chore(analytics): use utils
is_jupyter
to determine notebook state by @aarnphm in #5173 - docs: Add comfyUI example doc by @Sherlock113 in #5148
- fix: allow building bento without build config file by @frostming in #5175
- fix: Remove distutils dependency by @jaume-ferrarons in #5179
- docs: Add streaming and websocket docs by @Sherlock113 in #5180
- docs: Update ASGI decorator by @Sherlock113 in #5182
- fix: allow usage like bentoml deploy service:MyService by @frostming in #5181
- docs: Update Gradio example link by @Sherlock113 in #5183
- fix: use a specific manylinux version as build platform for better compatibility by @frostming in #5185
- feat(metrics): init zero for request count of apis by @Haivilo in #5186
New Contributors
- @devin-ai-integration made their first contribution in #5171
- @jaume-ferrarons made their first contribution in #5179
Full Changelog: v1.3.19...v1.3.20
v1.3.19
What's Changed
- fix: turn off build log stream in jupyter notebook by @frostming in #5165
- docs: Add notes for loading HF private models by @Sherlock113 in #5166
- fix: patch annotations inside a container type by @frostming in #5167
Full Changelog: v1.3.18...v1.3.19