v1.0.6
Summary
This new release tag to mark the latest stable version of the codebase. Key updates included in this release:
- New Feature: Introduced the /v1/models endpoint for monitoring model serving status.
- Updates: Synced with the latest versions of mlx_vlm and mlx_lm for up-to-date performance and compatibility.
- Bug Fix: Fixed a text extraction issue when processing chunks.
- Enhancement: Refined the resource cleanup logic for improved efficiency and stability.