Simple token counter

A simple server with two API endpoints. Performs two functions:

to count tokens in strings or chat completion objects; and,
limit the length of strings/chat completion objects to fit under a given token count (e.g. for keeping strings under the embeddings max length, ensuring chat completion message history is under max token length, or counting usage tokens for streaming chat completions).

The logic is contained within the token_service.py file if you are just after the functionality.

Endpoints:

`/tokens`

takes text and a number. The number is the max tokens you want the string to be. For exmple, if you are using the OpenAI/Azure OpenAI service for embeddings, you can make a call with the string.

model is optional and will default to gpt-4o.

 { "text": "the string to be counted", "number": 8192, "model": "gpt-4o" }

`/chat_tokens`

takes an array of chat message history objects as per the OpenAI/Azure OpenAI service chat message. It will count image tokens as part of the request, as well as function_call messages and all other message and content types currently supported by the OpenAI specification.

The request will take the below and return a list of the chatMessages that fit within the numberOfMaxTokens that you specified. It will also return the final token_count of the messages.

model is optional and will default to gpt-4o.

{ "messages": listOfOpenAIChatMessages, "number": numberOfMaxTokens , "model": "gpt-4o" }

How to Use

To get this code up and running, simply clone this repository and run:

pip install -r requirements.txt

Then run the server with (update the port as necessary):

uvicorn main:app --host 0.0.0.0 --port 8000

API Key (optional)

The code is set up to handle basic API key auth. All you need to do is set the API_KEY environmental variable with a key and pass this value when calling the API by setting the X-Api-Key header to the API_KEY value.

Notes:

This code has only been tested for its accuracy on cl100k_base encoding using gpt-4 models. How tokens are counted by the OpenAI API and how you are charged may change. As such, it is recommended to test thoroughly before using in any production capacity.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
routers		routers
services		services
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE.txt		LICENSE.txt
README.md		README.md
__init__.py		__init__.py
main.py		main.py
models.py		models.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Simple token counter

Endpoints:

`/tokens`

`/chat_tokens`

How to Use

API Key (optional)

Notes:

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

Academic-ID/token-counter

Folders and files

Latest commit

History

Repository files navigation

Simple token counter

Endpoints:

/tokens

/chat_tokens

How to Use

API Key (optional)

Notes:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

`/tokens`

`/chat_tokens`

Packages