v1.0.2
Changes in this release:
- Refactored API schemas and response formats for improved consistency and maintainability.
- Updated chat history handling logic for better performance and reliability.
- Exposed the /v1/embeddings endpoint to support MLX-LM models (text-only).
- Added a new notebook, embeddings_examples, demonstrating how to use the embeddings endpoint via the OpenAI-compatible API.