Vision Token Calculator

A Python tool for calculating the number of tokens generated when processing images with various Vision Language Models (VLMs).

Features

Calculate image tokens for different VLMs
Support for both existing images and dummy images
Detailed token analysis including image size and token count
Easy-to-use command line interface

Installation

Option 1: Install from PyPI (recommended)

pip install vt-calc

Option 2: Install as editable package (for development)

pip install -e .

Option 3: Install dependencies only

pip install -r requirements.txt

Usage

Method 1: Using the vt-calc command (after pip install -e .)

After installing with pip install -e ., you can use the vt-calc command directly:

# Using an existing image
vt-calc --image path/to/your/image.jpg

# Creating a dummy image with specific dimensions
vt-calc --size 1920 1080

# Specifying a different model
vt-calc --image path/to/your/image.jpg --model-path "model/path"

Method 2: Direct python execution

# Using an existing image
python calculate.py --image path/to/your/image.jpg

# Creating a dummy image with specific dimensions
python calculate.py --size 1920 1080

# Specifying a different model
python calculate.py --image path/to/your/image.jpg --model-path "model/path"

Supported Models

Model	Model size
Qwen2.5-VL	3B / 7B / 32B / 72B
Gemma3	4B / 12B / 27B
InternVL3	1B / 2B / 8B / 14B / 38B / 78B

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
calculate.py		calculate.py
check_style.sh		check_style.sh
printer.py		printer.py
requirements.txt		requirements.txt
setup.py		setup.py
setup_env.py		setup_env.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Vision Token Calculator

Features

Installation

Option 1: Install from PyPI (recommended)

Option 2: Install as editable package (for development)

Option 3: Install dependencies only

Usage

Method 1: Using the vt-calc command (after pip install -e .)

Method 2: Direct python execution

Supported Models

License

About

Uh oh!

Releases

Packages

Languages

License

thisisiron/vision-token-calculator

Folders and files

Latest commit

History

Repository files navigation

Vision Token Calculator

Features

Installation

Option 1: Install from PyPI (recommended)

Option 2: Install as editable package (for development)

Option 3: Install dependencies only

Usage

Method 1: Using the vt-calc command (after pip install -e .)

Method 2: Direct python execution

Supported Models

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages