One API

One API is an open-source API gateway that provides a unified interface for various AI models, allowing users to manage and utilize multiple AI services through a single platform. It supports token management, quota management, and usage statistics, making it easier for developers to integrate and manage different AI models in their applications.

With one-api, you can aggregate various AI APIs and request in ChatCompletion, Response, or Claude Messages API formats as needed.

The original author of one-api has not been active for a long time, resulting in a backlog of PRs that cannot be updated. Therefore, I forked the code and merged some PRs that I consider important. I also welcome everyone to submit PRs, and I will respond and handle them actively and quickly.

Fully compatible with the upstream version, can be used directly by replacing the container image, docker images:

ppcelery/one-api:latest
ppcelery/one-api:arm64-latest

Also welcome to register and use my deployed one-api gateway, which supports various mainstream models. For usage instructions, please refer to https://wiki.laisky.com/projects/gpt/pay/cn/#page_gpt_pay_cn.

One API

Turtorial

Run one-api using docker-compose:

oneapi:
  image: ppcelery/one-api:latest
  restart: unless-stopped
  logging:
    driver: "json-file"
    options:
      max-size: "10m"
  environment:
    # (optional) SESSION_SECRET set a fixed session secret so that user sessions won't be invalidated after server restart
    SESSION_SECRET: xxxxxxx
    # (optional) If you access one-api using a non-HTTPS address, you need to set DISABLE_COOKIE_SECURE to true
    DISABLE_COOKIE_SECURE: "true"

    # (optional) DEBUG enable debug mode
    DEBUG: "true"

    # (optional) DEBUG_SQL display SQL logs
    DEBUG_SQL: "true"
    # (optional) REDIS_CONN_STRING set REDIS cache connection
    REDIS_CONN_STRING: redis://100.122.41.16:6379/1
    # (optional) SQL_DSN set SQL database connection,
    # default is sqlite3, support mysql, postgresql, sqlite3
    SQL_DSN: "postgres://laisky:xxxxxxx@1.2.3.4/oneapi"

    # (optional) ENFORCE_INCLUDE_USAGE require upstream API responses to include usage field
    ENFORCE_INCLUDE_USAGE: "true"

    # (optional) MAX_ITEMS_PER_PAGE maximum items per page, default is 10
    MAX_ITEMS_PER_PAGE: 10

    # (optional) GLOBAL_API_RATE_LIMIT maximum API requests per IP within three minutes, default is 1000
    GLOBAL_API_RATE_LIMIT: 1000
    # (optional) GLOBAL_WEB_RATE_LIMIT maximum web page requests per IP within three minutes, default is 1000
    GLOBAL_WEB_RATE_LIMIT: 1000
    # (optional) /v1 API ratelimit for each token
    GLOBAL_RELAY_RATE_LIMIT: 1000
    # (optional) Whether to ratelimit for channel, 0 is unlimited, 1 is to enable the ratelimit
    GLOBAL_CHANNEL_RATE_LIMIT: 1

    # (optional) FRONTEND_BASE_URL redirect page requests to specified address, server-side setting only
    FRONTEND_BASE_URL: https://oneapi.laisky.com

    # (optional) OPENROUTER_PROVIDER_SORT set sorting method for OpenRouter Providers, default is throughput
    OPENROUTER_PROVIDER_SORT: throughput

    # (optional) CHANNEL_SUSPEND_SECONDS_FOR_429 set the duration for channel suspension when receiving 429 error, default is 60 seconds
    CHANNEL_SUSPEND_SECONDS_FOR_429: 60

    # (optional) DEFAULT_MAX_TOKEN set the default maximum number of tokens for requests, default is 2048
      DEFAULT_MAX_TOKEN: 2048
    # (optional) MAX_INLINE_IMAGE_SIZE_MB set the maximum allowed image size (in MB) for inlining images as base64, default is 30
      MAX_INLINE_IMAGE_SIZE_MB: 30

    # (optional) LOG_PUSH_API set the API address for pushing error logs to telegram.
    # More information about log push can be found at: https://github.com/Laisky/laisky-blog-graphql/tree/master/internal/web/telegram
    LOG_PUSH_API: "https://gq.laisky.com/query/"
    LOG_PUSH_TYPE: "oneapi"
    LOG_PUSH_TOKEN: "xxxxxxx"

  volumes:
    - /var/lib/oneapi:/data
  ports:
    - 3000:3000

The initial default account and password are root / 123456.

Contributors

New Features

Universal Features

Support update user's remained quota

You can update the used quota using the API key of any token, allowing other consumption to be aggregated into the one-api for centralized management.

Get request's cost

Each chat completion request will include a X-Oneapi-Request-Id in the returned headers. You can use this request id to request GET /api/cost/request/:request_id to get the cost of this request.

The returned structure is:

type UserRequestCost struct {
  Id          int     `json:"id"`
  CreatedTime int64   `json:"created_time" gorm:"bigint"`
  UserID      int     `json:"user_id"`
  RequestID   string  `json:"request_id"`
  Quota       int64   `json:"quota"`
  CostUSD     float64 `json:"cost_usd" gorm:"-"`
}

Support Tracing info in logs

Support Cached Input

Now supports cached input, which can significantly reduce the cost.

Support Anthropic Prompt caching

https://docs.anthropic.com/en/docs/build-with-claude/prompt-caching

Automatically Enable Thinking and Customize Reasoning Format via URL Parameters

Supports two URL parameters: thinking and reasoning_format.

thinking: Whether to enable thinking mode, disabled by default.
reasoning_format: Specifies the format of the returned reasoning.
- reasoning_content: DeepSeek official API format, returned in the reasoning_content field.
- reasoning: OpenRouter format, returned in the reasoning field.
- thinking: Claude format, returned in the thinking field.

Reasoning Format - reasoning-content

Reasoning Format - reasoning

Reasoning Format - thinking

OpenAI Features

(Merged) Support gpt-vision

support gpt-4o-search-preview & gpt-4o-mini-search-preview

Support gpt-image-1's image generation & edits

Support o3-mini

feat: extend support for o3 models and update model ratios #2048

Support o3 & o4-mini & gpt-4.1

Support o3-pro & reasoning content

Support OpenAI Response API

Partially supported, still in development.

Anthropic (Claude) Features

(Merged) Support aws claude

Support claude-3-7-sonnet & thinking

By default, the thinking mode is not enabled. You need to manually pass the thinking field in the request body to enable it.

Stream

Non-Stream

Support /v1/messages Claude Messages API

Support Claude Code

export ANTHROPIC_MODEL="openai/gpt-oss-120b"
export ANTHROPIC_BASE_URL="https://oneapi.laisky.com/"
export ANTHROPIC_AUTH_TOKEN="sk-xxxxxxx"

You can use any model you like for Claude Code, even if the model doesn’t natively support the Claude Messages API.

Google (Gemini & Vertex) Features

Support gemini-2.0-flash-exp

feat: add gemini-2.0-flash-exp #1983

Support gemini-2.0-flash

feat: support gemini-2.0-flash #2055

Support gemini-2.0-flash-thinking-exp-01-21

feature: add deepseek-reasoner & gemini-2.0-flash-thinking-exp-01-21 #2045

Support Vertex Imagen3

feat: support vertex imagen3 #2030

Support gemini multimodal output #2197

feature: support gemini multimodal output #2197

Support gemini-2.5-pro

Support GCP Vertex gloabl region and gemini-2.5-pro-preview-06-05

Support gemini-2.5-flash-image-preview & imagen-4 series

AWS Features

Support AWS cross-region inferences

fix: support aws cross region inferences #2182

Support AWS BedRock Inference Profile

Replicate Features

Support replicate flux & remix

Support replicate chat models

feat: 支持 replicate chat models #1989

DeepSeek Features

Support deepseek-reasoner

feature: add deepseek-reasoner & gemini-2.0-flash-thinking-exp-01-21 #2045

OpenRouter Features

Support OpenRouter's reasoning content

feat: support OpenRouter reasoning #2108

By default, the thinking mode is automatically enabled for the deepseek-r1 model, and the response is returned in the open-router format.

Name		Name	Last commit message	Last commit date
Latest commit History 2,156 Commits
.github		.github
.junie		.junie
bin		bin
cmd/migrate		cmd/migrate
common		common
controller		controller
docs		docs
middleware		middleware
model		model
monitor		monitor
relay		relay
router		router
scripts		scripts
test		test
web		web
.dockerignore		.dockerignore
.gitignore		.gitignore
.golangci.yml		.golangci.yml
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
VERSION		VERSION
docker-compose.yml		docker-compose.yml
go.mod		go.mod
go.sum		go.sum
main.go		main.go
one-api.service		one-api.service

Uh oh!

License

Laisky/one-api

Folders and files

Latest commit

History

Repository files navigation

One API

Turtorial

Contributors

New Features

Universal Features

Support update user's remained quota

Get request's cost

Support Tracing info in logs

Support Cached Input

Support Anthropic Prompt caching

Automatically Enable Thinking and Customize Reasoning Format via URL Parameters

Reasoning Format - reasoning-content

Reasoning Format - reasoning

Reasoning Format - thinking

OpenAI Features

(Merged) Support gpt-vision

Support openai images edits

Support OpenAI o1/o1-mini/o1-preview

Support gpt-4o-audio

Support OpenAI web search models

Support gpt-image-1's image generation & edits

Support o3-mini

Support o3 & o4-mini & gpt-4.1

Support o3-pro & reasoning content

Support OpenAI Response API

Anthropic (Claude) Features

(Merged) Support aws claude

Support claude-3-7-sonnet & thinking

Stream

Non-Stream

Support /v1/messages Claude Messages API

Support Claude Code

Google (Gemini & Vertex) Features

Support gemini-2.0-flash-exp

Support gemini-2.0-flash

Support gemini-2.0-flash-thinking-exp-01-21

Support Vertex Imagen3

Support gemini multimodal output #2197

Support gemini-2.5-pro

Support GCP Vertex gloabl region and gemini-2.5-pro-preview-06-05

Support gemini-2.5-flash-image-preview & imagen-4 series

AWS Features

Support AWS cross-region inferences

Support AWS BedRock Inference Profile

Replicate Features

Support replicate flux & remix

Support replicate chat models

DeepSeek Features

Support deepseek-reasoner

OpenRouter Features

Support OpenRouter's reasoning content

Coze Features

Support coze oauth authentication

XAI / Grok Features

Support XAI/Grok Text & Image Models

Black Forest Labs Features

Support black-forest-labs/flux-kontext-pro

Bug fix

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Languages

Packages