📖 ChapterhouseDB

A distributed SQL query engine built in Rust, designed for extensibility, and developer-first workflows. While still in early development, its long-term vision is to provide a seamless environment where SQL and Rust work together to power data pipelines and backend systems. You will be able to write Rust to connect to APIs, transform complex or nested data, and define custom operators. And use SQL to orchestrate processing, move and clean data, and manage storage—all in a unified, scalable engine.

Note

This project has been renamed to ChapterhouseDB from ChapterhouseQE.

Running the Docker Container

Build the image

DOCKER_BUILDKIT=1 docker compose build chdb-debug-node

Start a container

DOCKER_BUILDKIT=1 docker compose up chdb-debug-node

At this point the system will be ready to accept requests. The image is built with a small set of example datasets that can be queried.

Running the TUI

You can run the TUI with a set of example queries using this command

cargo run --bin client_tui -- --sql-file="sample_queries/simple.sql" --connect-to-address="127.0.0.1"

The TUI will send the queries to the worker and allow you to visualize the result data in a table.

🛢️ Supported SQL

🛠 Architecture

The system is built upon a set of distributed actors that communicate through messages. Each worker can communicate with all other workers connected to it and any worker can accept and manage queries. Queries create operators, a type of actor capable of performing the tasks necessary to compute a query result. For example, the query:

select * from read_files('simple/*.parquet')
  where value2 > 10.0;

will produce these operators

[read files] -> [exchange] -> [filter] -> [exchange] -> [materialize] -> [exchange]

Each of the operators in this query can also have individual instances of themselves so that its task can be computed in parallel. These operators perform some operation on an Apache Arrow record batch. The read files operator reads records from the parquet files and pushes them to the exchange operator. Then the filter operator pulls the next available record from that exchange operator and produces a record containing only the data matching the "where" expression. And so on until the DAG of operators has completed. By structuring the operators in this way it makes it relatively easy to create new operators as each operator either pulls data from an exchange or an external source, and pushes data to an exchanges.

Future Functionality

Support common SQL operations such as those listed in the "Supported SQL" section.
UDFs and custom deployable operators that act as data sources
Create a Kubernetes integration which allows the system to scale based on demand. The nodes will need to communicate their cluster IP through S3.

Name		Name	Last commit message	Last commit date
Latest commit History 347 Commits
imgs		imgs
sample_queries		sample_queries
src		src
worker_configs		worker_configs
.dockerignore		.dockerignore
.gitignore		.gitignore
Cargo.toml		Cargo.toml
DEV_NOTES.md		DEV_NOTES.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

📖 ChapterhouseDB

Running the Docker Container

Running the TUI

🛢️ Supported SQL

🛠 Architecture

Future Functionality

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

alekLukanen/ChapterhouseDB

Folders and files

Latest commit

History

Repository files navigation

📖 ChapterhouseDB

Running the Docker Container

Running the TUI

🛢️ Supported SQL

🛠 Architecture

Future Functionality

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages