Local-Infer

Local-Infer is a Rust-based local inference gateway that lets you run open-source AI models completely offline. It provides a simple API and CLI for interacting with models such as LLaMA and Whisper without relying on cloud services.

Project Goal

To create a unified local backend for text, speech inference etc that is lightweight, modular, and privacy-preserving. To also expose adapters that allows ease in plugging in opensource models and have a ready project.

Core Features

Common trait interface for model engines
HTTP API for inference and transcription
CLI for running local tasks
Adapter system for engines like llama.cpp and whisper.cpp
Optional SQLite persistence for model registry and job history
Async runtime with Axum and Tokio
Extensible architecture for adding new adapters

Roadmap

Core + API workspace setup
Engine trait definition and LLaMA adapter
Basic inference endpoint
Persistent storage integration
CLI tool
Streaming support
Additional adapters in the future (Whisper, OCR, etc.)

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
crates		crates
.gitignore		.gitignore
Cargo.lock		Cargo.lock
DEVELOPMENT_GOALS.md		DEVELOPMENT_GOALS.md
README.md		README.md
cargo.toml		cargo.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Local-Infer

Project Goal

Core Features

Roadmap

License

About

Uh oh!

Releases

Packages

Languages

otobongfp/Local-infer

Folders and files

Latest commit

History

Repository files navigation

Local-Infer

Project Goal

Core Features

Roadmap

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages