GitHub - matteocacciola/cheshirecat-core: Production ready AI agent framework

Cheshire Cat AI

🇮🇹 Stregatto - 🇨🇳 柴郡貓 - 🇮🇳 चेशायर बिल्ली - 🇷🇺 Чеширский кот

AI agent as a microservice

The Cheshire Cat is a framework to build custom AI agents:

⚡️ API first, to easily add a conversational layer to your app
💬 Chat via WebSocket and manage your agent with an customizable REST API
🐘 Built-in RAG with customizable vector database, so you can use your own technology (e.g., Qdrant, Pinecone, Weaviate, etc.)
🐘 Customizable database for your documents, so that you can use your own storage (e.g., S3, MinIO, etc.)
🚀 Extensible via plugins
🪛 Event callbacks, function calling (tools), conversational forms
🏛 Easy to use Admin Panel
🌍 Supports any language model via langchain
👥 Multiuser with granular permissions, compatible with any identity provider
💬 Multi-chatbots, with configurable (even different) LLM, chunking strategy and other features per chatbot, plus specific knowledge per chatbot
💬 Remembers conversations and documents and uses them in conversation
✂️ Customizable chunking and embedding
☁️ Cloud Ready, working even with horizontal autoscaling
🐋 100% dockerized
🦄 Active Discord community and easy to understand docs

Key differences

The current version is a multi-tenant fork of the original Cheshire Cat. Here are the main differences:

Multi-tenant: the original version was designed to be a single-tenant application, meaning that it could only manage one chatbot at a time. This version is designed to be multi-tenant, meaning that it can manage multiple chatbots at the same time, each with its own settings, plugins, LLMs, etc.
- The way of "injecting" the identification of the Chatbot (RAG) is simple:
  - in case of the HTTP API endpoints, use the agent_id key into the request headers or as a querystring parameter;
  - in case of the WebSocket API, use the agent_id into the URL, e.g., /ws/{agent_id}.
Customizable RAG: the original version used a fixed RAG implementation, meaning that it could only use a specific vector database and chunking strategy. This version allows you to configure the RAG per chatbot, meaning that you can use your own vector database and chunking strategy.
- The current version supports multiple vector databases, such as Qdrant, Pinecone, Weaviate, etc.
- The current version supports multiple chunking strategies, such as text splitting or Semantic chunking.
Customizable LLM: the original version used a fixed LLM implementation, meaning that it could only use a specific language model. This version allows you to configure the LLM per chatbot, meaning that you can use your own language model.
- The current version supports multiple language models, such as OpenAI, Ollama, Google, HuggingFace, etc.
- The current version supports multiple LLMs, meaning that you can use different language models for different chatbots.
Customizable Storage: the original did not use any storage solution for the documents composing your RAG, meaning that you were able to store the documents into the knowledge base of each RAG, but not into a remote storage. This version allows you to configure the storage per chatbot, meaning that you can use your own storage solution.
- The current version supports multiple storage solutions, such as S3, MinIO, etc.
- The current version supports multiple file managers, meaning that you can use different file managers for different chatbots.
Cloud ready: this version can be deployed in a cluster environment. Whilst the original version stored the settings into JSON files, this version requires a Redis database to store the settings, the conversation histories, the plugins and so forth. You can configure the Redis database by environment variables. The compose.yml file is provided as an example. The Cat is still stateless, so it can be easily scaled. In case of a cluster environment, we suggest to use a shared storage, mounted in the cat/plugins folder, to share the plugins. Hence, the current version is multi-tenant, meaning that you can manage multiple RAGs and other language models at the same time.
Security: the original project is developed as a framework that could be used for a personal use as well as for single-tenant production. In the latter case, the original documentation clearly states to set up a secure environment by using an API Key. If not configured properly (e.g. by setting up an API Key), the current version will not work, indeed. In this way, I tried to make the Cat more secure and production-ready.
Additional implementations: here, the structure used for configuring Embedder, LLMs, Authorization Handler and File Manager is different from the original version: interfaces and factories have been used for the scope.
New features: here, I have introduced some new features and improvements, such as:
- The Embedder is centralized and can be used by multiple RAGs and other language models.
- New admin endpoints allowing to configure the Embedder.
- New endpoints allowing to configure the File Manager, per chatbot.
- New endpoints allowing to configure the chunking strategy, per chatbot.
- New endpoints allowing to configure the vector database, per chatbot.
- A new event system that allows you to get fine-grained control over the AI.
- The ability to manage multiple RAGs and other language models at the same time.

Compatibility with plugins

This new version is completely compatible with the original version, so you can easily migrate your existing plugins and settings to the new version. It is still in development, but you can already try it out by running the Docker image. New features will be added in the future. Please contact us if you want to contribute.

Quickstart

To make Cheshire Cat run on your machine, you just need docker installed:

docker run --rm -it -p 1865:80 ghcr.io/matteocacciola/cheshirecat-core:latest

Chat with the Cheshire Cat on localhost:1865/docs.

Since this version is intended as a microservice, the admin panel is no longer available. You can still use widgets from the original project to manage the Cat.

As a first thing, the Cat will ask you to configure your favourite language model. It can be done directly via the interface in the Settings page (top right in the admin).

Enjoy the Cat!
Follow instructions on how to run it with docker compose and volumes.

Minimal plugin example

Hooks (events)

from cat.mad_hatter.decorators import hook

# hooks are an event system to get fine-grained control over your assistant
@hook
def agent_prompt_prefix(prefix, cat):
    prefix = """You are Marvin the socks seller, a poetic vendor of socks.
You are an expert in socks, and you reply with exactly one rhyme.
"""
    return prefix

Tools

from cat.mad_hatter.decorators import tool

# langchain inspired tools (function calling)
@tool(return_direct=True)
def socks_prices(color, cat):
    """How much do socks cost? Input is the sock color."""
    prices = {
        "black": 5,
        "white": 10,
        "pink": 50,
    }

    price = prices.get(color, 0)
    return f"{price} bucks, meeeow!"

Conversational Forms

from pydantic import BaseModel
from cat.experimental.form import form, CatForm

# data structure to fill up
class PizzaOrder(BaseModel):
    pizza_type: str
    phone: int

# forms let you control goal oriented conversations
@form
class PizzaForm(CatForm):
    description = "Pizza Order"
    model_class = PizzaOrder
    start_examples = [
        "order a pizza!",
        "I want pizza"
    ]
    stop_examples = [
        "stop pizza order",
        "not hungry anymore",
    ]
    ask_confirm = True

    def submit(self, form_data):
        
        # do the actual order here!

        # return to convo
        return {
            "output": f"Pizza order on its way: {form_data}"
        }

Docs and Resources

For your PHP based projects, I developed a PHP SDK that allows you to easily interact with the Cat. Please, refer to the SDK documentation for more information.

For your Node.js based projects, I developed a Node.js SDK that allows you to easily interact with the Cat. Please, refer to the SDK documentation for more information.

List of resources:

Why use the Cat

⚡️ API first, so you get a microservice to easily add a conversational layer to your app
🐘 Remembers conversations and documents and uses them in conversation
🚀 Extensible via plugins (public plugin registry + private plugins allowed)
🎚 Event callbacks, function calling (tools), conversational forms
🏛 Easy to use admin panel (chat, visualize memory and plugins, adjust settings)
🌍 Supports any language model (works with OpenAI, Google, Ollama, HuggingFace, custom services)
💬 Multi-chatbots, with configurable (even different) LLM per chatbot, plus specific knowledge per chatbot
☁️ Cloud Ready, working even with horizontal autoscaling
🐋 Production ready - 100% dockerized
👩‍👧‍👦 Active Discord community and easy to understand docs

We are committed to openness, privacy and creativity, we want to bring AI to the long tail. If you want to know more about our vision and values, read the Code of Ethics.

Roadmap & Contributing

All contributions are welcome! Fork the project, create a branch, and make your changes. Then, follow the contribution guidelines to submit your pull request.

If you like this project, give it a star ⭐! It is very important to have your support. Thanks again!🙏

License and trademark

Code is licensed under GPL3.
The Cheshire Cat AI logo and name are property of Piero Savastano (founder and maintainer). The current fork is created, refactored and maintained by Matteo Cacciola.

Which way to go?

(back to top)

"Would you tell me, please, which way I ought to go from here?"
"That depends a good deal on where you want to get to," said the Cat.
"I don't much care where--" said Alice.
"Then it doesn't matter which way you go," said the Cat.

(Alice's Adventures in Wonderland - Lewis Carroll)

Name		Name	Last commit message	Last commit date
Latest commit History 2,860 Commits
.github		.github
.vscode		.vscode
core		core
readme		readme
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
CODE-OF-ETHICS.md		CODE-OF-ETHICS.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
compose.yml		compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Cheshire Cat AI

🇮🇹 Stregatto - 🇨🇳 柴郡貓 - 🇮🇳 चेशायर बिल्ली - 🇷🇺 Чеширский кот

AI agent as a microservice

Key differences

Compatibility with plugins

Quickstart

Minimal plugin example

Docs and Resources

Why use the Cat

Roadmap & Contributing

License and trademark

Which way to go?

About

Uh oh!

Releases 21

Packages

Uh oh!

Languages

License

matteocacciola/cheshirecat-core

Folders and files

Latest commit

History

Repository files navigation

Cheshire Cat AI

🇮🇹 Stregatto - 🇨🇳 柴郡貓 - 🇮🇳 चेशायर बिल्ली - 🇷🇺 Чеширский кот

AI agent as a microservice

Key differences

Compatibility with plugins

Quickstart

Minimal plugin example

Docs and Resources

Why use the Cat

Roadmap & Contributing

License and trademark

Which way to go?

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 21

Packages 0

Uh oh!

Languages

Packages