Mixture-of-Agents Demo Powered by Cerebras

This Streamlit application showcases the Mixture of Agents (MOA) architecture proposed by Together AI, powered by Cerebras LLMs. It allows users to interact with a configurable multi-agent system for enhanced AI-driven conversations.

Source: Adaptation of Together AI Blog - Mixture of Agents

Features

Interactive chat interface powered by MOA
Configurable main model and layer agents
Real-time streaming of responses
Visualization of intermediate layer outputs
Customizable agent parameters through the UI

Installation

Clone the repository:

git clone https://github.com/kevint-cerebras/cerebras-moa.git
cd cerebras-moa

Install the required dependencies:
```
pip install -r requirements.txt
```
Set up your environment variables: Create a .env file in the root directory and add your Cerebras API key:
```
CEREBRAS_API_KEY=your_api_key_here
```

Usage

Run the Streamlit app:
```
streamlit run app.py
```
Open your browser and navigate to http://localhost:8501
The application provides two main sections:
- Left sidebar: Configure the MOA system
- Main panel: Chat interface
You can adjust various parameters in real-time, including:
- Main model selection
- Number of cycles (layers)
- Temperature settings
- Layer agent configurations

How It Works

The Mixture of Agents (MOA) architecture involves:

Main Agent: The primary LLM that generates the final response
Layer Agents: Multiple LLMs that analyze the query and provide insights
Cycles: Iterative process where layer agents contribute to enhancing the response

When you submit a query, it goes through these steps:

Layer agents process the query in parallel
Their outputs are combined and formatted
This combined insight is passed to the main agent
The main agent generates the final response

Implementation Details

This implementation uses the Cerebras Cloud SDK directly without any LangChain dependencies, providing:

Direct API calls to Cerebras' high-performance inference endpoints
Custom conversation memory management
Efficient handling of parallel agent execution
Streamlined prompt formatting and response handling

Advanced Configuration

The application allows you to customize the system by editing JSON configurations directly in the UI. You can:

Modify system prompts
Change models for individual layer agents
Adjust temperature and other parameters
Save and load configurations

Models

This demo uses Cerebras' API to access various LLMs, including:

Llama-3.3-70B
Llama3.1-8B
Llama-4-scout-17b-16e-instruct
Qwen-3-32B

Cerebras Technology

Cerebras Systems has developed the world's largest and fastest AI processor, the Wafer-Scale Engine (WSE). This technology powers the inference API used in this application, providing:

Unprecedented speed for AI inference workloads
High throughput for commercial applications
Seamless scaling for complex AI tasks

Credits

MOA architecture: Together AI
LLM inference: Cerebras
Original research paper: arXiv:2406.04692

License

This project is licensed under the MIT License - see the LICENSE file for details.

Contributing

Contributions to this project are welcome! Please follow these steps to contribute:

Fork the repository
Create a new branch for your feature or bug fix
Make your changes and commit them with descriptive commit messages
Push your changes to your fork
Submit a pull request to the main repository

Please ensure that your code adheres to the project's coding standards and includes appropriate tests and documentation.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Acknowledgements

Cerebras for providing the underlying language models
Together AI for proposing the Mixture of Agents architecture and providing the conceptual image
Streamlit for the web application framework

Citation

This project implements the Mixture-of-Agents architecture proposed in the following paper:

@article{wang2024mixture,
  title={Mixture-of-Agents Enhances Large Language Model Capabilities},
  author={Wang, Junlin and Wang, Jue and Athiwaratkun, Ben and Zhang, Ce and Zou, James},
  journal={arXiv preprint arXiv:2406.04692},
  year={2024}
}

For more information about the Mixture-of-Agents concept, please refer to the original research paper and the Together AI blog post.

Contact

For questions or support, please open an issue on the GitHub repository.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
__pycache__		__pycache__
config		config
moa		moa
static		static
.env		.env
.gitattributes		.gitattributes
COMPETITION_README.md		COMPETITION_README.md
DEVICE_TRACKING_POLICY.md		DEVICE_TRACKING_POLICY.md
Dockerfile		Dockerfile
LICENSE		LICENSE
NO_BLOCKING_SYSTEM.md		NO_BLOCKING_SYSTEM.md
QUICK_START.md		QUICK_START.md
README.md		README.md
README_LLM_DETERMINISM.md		README_LLM_DETERMINISM.md
SECURITY_UPDATES.md		SECURITY_UPDATES.md
app.py		app.py
app_backup.py		app_backup.py
competition.db		competition.db
competition_config.json		competition_config.json
competition_ui.py		competition_ui.py
competitive_programming.py		competitive_programming.py
demo_comment_stripping.py		demo_comment_stripping.py
grader.py		grader.py
launch_competition.py		launch_competition.py
llm_determinism_guide.py		llm_determinism_guide.py
requirements.txt		requirements.txt
sample_config.json		sample_config.json
simple_determinism_demo.py		simple_determinism_demo.py
test_dateutil.py		test_dateutil.py
test_device_tracking.py		test_device_tracking.py
test_expanded_imports.py		test_expanded_imports.py
test_expected_outputs.py		test_expected_outputs.py
test_fixes.py		test_fixes.py
test_injection.py		test_injection.py
test_judging_determinism.py		test_judging_determinism.py
test_no_blocking.py		test_no_blocking.py
test_simple_imports.py		test_simple_imports.py
test_zero_days_active.py		test_zero_days_active.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Mixture-of-Agents Demo Powered by Cerebras

Features

Installation

Usage

How It Works

Implementation Details

Advanced Configuration

Models

Cerebras Technology

Credits

License

Contributing

License

Acknowledgements

Citation

Contact

About

Uh oh!

Releases

Packages

Languages

License

brozzay/cerebras-moa

Folders and files

Latest commit

History

Repository files navigation

Mixture-of-Agents Demo Powered by Cerebras

Features

Installation

Usage

How It Works

Implementation Details

Advanced Configuration

Models

Cerebras Technology

Credits

License

Contributing

License

Acknowledgements

Citation

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages