inference-api

Star

Here are 84 public repositories matching this topic...

roboflow / inference

Star

Turn any computer or edge device into a command center for your computer vision projects.

Updated Sep 11, 2025
Python

basetenlabs / truss

Star

The simplest way to serve AI/ML models in production

open-source machine-learning packaging artificial-intelligence falcon easy-to-use whisper inference-server model-serving inference-api stable-diffusion wizardlm

Updated Sep 10, 2025
Python

quic / ai-hub-models

Star

The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.

machine-learning inference pytorch machinelearning deeplearning demos inference-engine onnx tensorflow-lite qnn inference-api

Updated Aug 28, 2025
Python

quic / ai-hub-apps

Star

The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.

machine-learning inference pytorch machinelearning deeplearning demos inference-engine onnx tensorflow-lite qnn inference-api

Updated Aug 29, 2025
Java

SearchSavior / OpenArc

Star

Lightweight Inference server for OpenVINO

transformers inference-engine fastapi inference-api openvino-toolkit optimum-intel agentic-ai

Updated Sep 11, 2025
Python

Michael-OvO / Yolov7-Flask

Star

A Beautiful Flask Web API for Yolov7 (and custom) models

python flask pytorch object-detection flask-web pretrained-weights model-deployment torchhub inference-api yolov7

Updated Sep 20, 2022
Python

mustafamerttunali / deep-learning-training-gui

Star

Train and predict your model on pre-trained deep learning models through the GUI (web app). No more many parameters, no more data preprocessing.

python api flask gui computer-vision deep-learning tensorflow image-classification tensorboard tensorflow-training mobilenetv2 inference-api tensorflow-predict

Updated Mar 24, 2024
Python

pszemraj / textsum

Star

CLI & Python API to easily summarize text-based files with transformers

pipeline text transformers inference transformer summarization summary batch-processing inference-api text-to-text-transformer

Updated Nov 2, 2024
Python

BMW-InnovationLab / BMW-Classification-Training-GUI

Star

This repository allows you to get started with training a State-of-the-art Deep Learning model with little to no configuration needed! You provide your labeled dataset and you can start the training right away. You can even test your model with our built-in Inference REST API. Training classification models with GluonCV has never been so easy.

training computer-vision deep-learning classification gluoncv inference-api

Updated May 11, 2022
Python

intelligencedev / eternal

Star

Eternal is an experimental platform for machine learning models and workflows.

go ai ml inference-api htmx gpt-4 stable-diffusion llamacpp comfyui claude-ai gemini-pro

Updated Mar 9, 2025
Go

Kardbord / hfapigo

Star

Unofficial (Golang) Go bindings for the Hugging Face Inference API

Updated May 6, 2025
Go

inference-gateway / inference-gateway

Star

An open-source, high-performance gateway unifying multiple LLM providers, from local solutions like Ollama to major cloud providers such as OpenAI, Groq, Cohere, Anthropic, Cloudflare and DeepSeek.

Updated Sep 4, 2025
Go

hupe1980 / go-huggingface

Star

🤗 Hugging Face Inference Client written in Go

golang huggingface inference-api

Updated Jan 7, 2024
Go

BMW-InnovationLab / BMW-Classification-Inference-GPU-CPU

Star

This is a repository for an image classification inference API using the Gluoncv framework. The inference REST API works on CPU/GPU. It's supported on Windows and Linux Operating systems. Models trained using our Gluoncv Classification training repository can be deployed in this API. Several models can be loaded and used at the same time.

computer-vision deep-learning inference classification gluoncv inference-api

Updated May 4, 2022
Python

Prismadic / magnet

Star

the small distributed language model toolkit; fine-tune state-of-the-art LLMs anywhere, rapidly

Updated Oct 19, 2024
Python

TimMikeladze / huggingface

Sponsor

Star

Typescript wrapper for the Hugging Face Inference API.

typescript inference huggingface inference-api hugging-face

Updated Mar 2, 2023
TypeScript

stephanj / Llama3JavaChatCompletionService

Star

Llama3.java Inference engine with OpenAI Chat Completion REST API/

java inference-api openai-api llama3

Updated Feb 3, 2025
Java

TommyLemon / CVAuto

Star

👁 零代码零标注 CV AI 自动化测试工具 🚀 免除大量人工画框和打标签等，直接零代码快速自动化测试 CV 计算机视觉 AI 人工智能图像识别算法：行人检测、动植物分类、人脸识别、OCR 车牌识别、旋转校正、舞蹈姿态、抠图分割等，还可一键下载测试报告、导出训练和测试数据集

Updated Sep 4, 2025
JavaScript

decisionfacts / semantic-ai

Star

An open source framework for Retrieval-Augmented System (RAG) uses semantic search helps to retrieve the expected results and generate human readable conversational response with the help of LLM (Large Language Model).

pdf machine-learning ocr deep-neural-networks openai docx approximate-nearest-neighbor-search semantic-search document-parser rag fastapi vector-database inference-api openai-api llm retrieval-augmented-generation llama2

Updated Jul 19, 2024
Python

yas-sim / openvino-ep-enabled-onnxruntime

Star

Describing How to Enable OpenVINO Execution Provider for ONNX Runtime

deep-learning intel inference inference-engine inference-library onnx onnx-format onnx-backend openvino onnxruntime inference-api openvino-toolkit

Updated Jun 29, 2020
C++

Improve this page

Add a description, image, and links to the inference-api topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the inference-api topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

inference-api

Here are 84 public repositories matching this topic...

roboflow / inference

basetenlabs / truss

quic / ai-hub-models

quic / ai-hub-apps

SearchSavior / OpenArc

Michael-OvO / Yolov7-Flask

mustafamerttunali / deep-learning-training-gui

pszemraj / textsum

BMW-InnovationLab / BMW-Classification-Training-GUI

intelligencedev / eternal

Kardbord / hfapigo

inference-gateway / inference-gateway

hupe1980 / go-huggingface

BMW-InnovationLab / BMW-Classification-Inference-GPU-CPU

Prismadic / magnet

TimMikeladze / huggingface

stephanj / Llama3JavaChatCompletionService

TommyLemon / CVAuto

decisionfacts / semantic-ai

yas-sim / openvino-ep-enabled-onnxruntime

Improve this page

Add this topic to your repo