MLX INFERENCE

MLX INFERENCE is an OpenAI API compatible inference service based on MLX-LM and MLX-VLM, providing the following endpoints:

pip install -r requirements.txt
# Copy environment file
cp .env.example .env

Execute in project root directory:

uvicorn mlx_Inference:app --workers 1 --port 8002

Parameters:

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
mlx_inference		mlx_inference
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
README_ZH.md		README_ZH.md
mlx_Inference.py		mlx_Inference.py
requirements.txt		requirements.txt

Provide feedback