Skip to content

v1.0.2

Compare
Choose a tag to compare
@cubist38 cubist38 released this 20 Apr 13:45
· 117 commits to main since this release
d34536f

Changes in this release:

  • Refactored API schemas and response formats for improved consistency and maintainability.
  • Updated chat history handling logic for better performance and reliability.
  • Exposed the /v1/embeddings endpoint to support MLX-LM models (text-only).
  • Added a new notebook, embeddings_examples, demonstrating how to use the embeddings endpoint via the OpenAI-compatible API.