UnifAI Search Benchmark

This repository contains a benchmark for evaluating the search performance (recall) of UnifAI's service discovery functionality.

Overview

The benchmark simulates realistic service discovery scenarios by:

Generating queries that users might use when looking for specific services
Measuring whether the expected service appears in the search results, and at what position
Calculating recall@k metrics (the percentage of queries where the expected service appears in the top k results)

Results

The graph shows the recall rate at different values of k, where k represents the top k search results. A higher recall rate indicates better search accuracy.

Dataset

The benchmark uses generated search queries stored in search_queries.jsonl. Each query represents a realistic user request paired with the expected service that should be found.

The test data was generated by LLM (with access to UnifAI tools through unifai-mcp-server), and you can view the prompt and generation process here

Note

Results may vary based on the specific search queries used and the total number of available actions. At the time of this benchmark, there were 89 actions available in the UnifAI ecosystem.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
recall_at_k.csv		recall_at_k.csv
recall_at_k.png		recall_at_k.png
search_queries.jsonl		search_queries.jsonl
search_service_positions.jsonl		search_service_positions.jsonl
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

UnifAI Search Benchmark

Overview

Results

Dataset

Note

About

Uh oh!

Releases

Packages

Uh oh!

Languages

unifai-network/unifai-search-benchmark

Folders and files

Latest commit

History

Repository files navigation

UnifAI Search Benchmark

Overview

Results

Dataset

Note

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages