LLM Tool Call Cost Benchmark

This repository contains a benchmarking to measure the token usage and response time impact of passing different numbers of tools to a large language model.

Purpose

When working with LLMs that support function/tool calling, there's a tradeoff between:

Providing more tools (giving the model more capabilities)
Increasing token usage (which affects cost and latency)

This benchmark quantifies that relationship by measuring:

Cost overhead vs. number of tools provided
Response time vs. number of tools provided

Results

Running the Benchmark

Set up your API keys using .env.example as example.

To run the benchmark:

uv run main.py

The script will:

Fetch all available toolkits from UnifAI
For each benchmark run:
- Randomly select n toolkits (where n ranges from 1 to the total number)
- Get the tools from these toolkits
- Make a LLM call with these static tools
- Record token usage and response time
Save results to benchmark_results.jsonl
Plot the results with linear regression

After running the benchmark, you'll get:

A scatter plot showing cost overhead vs. number of tools
A scatter plot showing response time vs. number of tools
Linear regression lines for both relationships
The plots will be saved as benchmark_results.png

Results are accumulated across runs in the benchmark_results.jsonl file, so you can run the script multiple times to gather more data points.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
benchmark_results.jsonl		benchmark_results.jsonl
benchmark_results.png		benchmark_results.png
main.py		main.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLM Tool Call Cost Benchmark

Purpose

Results

Running the Benchmark

About

Uh oh!

Releases

Packages

Uh oh!

Languages

unifai-network/llm-tool-call-cost-benchmark

Folders and files

Latest commit

History

Repository files navigation

LLM Tool Call Cost Benchmark

Purpose

Results

Running the Benchmark

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages