🤖 Lean 4 Coding Agent

This repository contains an AI-powered agent system that automatically generates and verifies Lean 4 code, turning natural language problem statements into formally proven Lean programs. It's part of a project developed during the LLM Agents Learning MOOC, focused on reasoning, theorem proving, and agent-based AI systems.

🧠 Overview

The Lean Coding Agent combines multi-agent reasoning with code generation and verification. Given:

A natural language description (description.txt)
A Lean code template (task.lean)

It outputs:

An implemented function
A formal proof of correctness

All in valid Lean syntax — ready to compile and check!

⚙️ Architecture Multi-Agent Collaboration

The workflow involves multiple distinct steps that illustrate the cooperation between the two agents:

Step 1: Reasoning Agent Analysis

The Reasoning Agent (o3-mini) begins by receiving the problem description and the Lean code template. It:

Breaks down the problem description into smaller components
Suggests a general approach for solving the problem
Prepares a structured analysis to guide the next step

Step 2: LLM Agent Code Generation

The LLM Agent (gpt-4o) uses the Reasoning Agent's analysis to:

Generate the Lean code required to solve the problem
Produce the corresponding proof that supports the solution
Insert the solution into the designated template sections

Step 3: Execution and Verification

The generated code is executed in the Lean environment:

If successful, the solution is considered complete
If errors occur, the system collects the error messages

Step 4: Iterative Refinement

When errors are detected, the system enters a refinement phase where the LLM Agent:

Reanalyzes the problem and code
Refines the implementation and proof to ensure correctness

full explanation is on Lean4 Ai agent Doc

🧪 Main Workflow

Implemented in src/main.py, the key function is:

def main_workflow(problem_description: str, task_lean_code: str = "") -> LeanCode:
    """
    Takes a problem description and a Lean template, and returns completed code and proof.

    """

The output is a LeanCode dictionary:

{
  "code": "<function implementation>",
  "proof": "<formal proof>"
}

To run the agent and setup code you should follow the guidelines in INSTRUCTIONS.md for task setup and Lean 4 implementation.

Run the automated test suite to verify the agent against all test tasks:

make test

This command executes the agent on predefined problems and verifies that the generated code passes Lean's verification.

📦 Requirements

Python 3.12+
Lean 4
OpenAI-compatible LLMs (for GPT-4o access)
Dependencies in requirements.txt

📚 Related Course

This project was developed as part of the LLM Agents MOOC offered by UC Berkeley, led by Professor Dawn Song, Xinyun Chen, and Kaiyu Yang. The course covers:

Inference-time techniques for reasoning
LLM post-training methods
Code generation and verification
Autoformalization and theorem proving

More details: llmagents-learning.org/sp25

🤝 Contributing

PRs are welcome! Whether it's bug fixes, improvements, or ideas — feel free to fork and open a Pull Request.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.devcontainer		.devcontainer
documents		documents
lean_playground		lean_playground
src		src
tasks		tasks
tests		tests
--location		--location
.gitignore		.gitignore
Dockerfile.dev		Dockerfile.dev
INSTRUCTIONS.md		INSTRUCTIONS.md
Lean4CodeGenerator.lean		Lean4CodeGenerator.lean
Makefile		Makefile
README.md		README.md
lake-manifest.json		lake-manifest.json
lakefile.toml		lakefile.toml
lean-toolchain		lean-toolchain
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🤖 Lean 4 Coding Agent

🧠 Overview

⚙️ Architecture Multi-Agent Collaboration

Step 1: Reasoning Agent Analysis

Step 2: LLM Agent Code Generation

Step 3: Execution and Verification

Step 4: Iterative Refinement

🧪 Main Workflow

📦 Requirements

📚 Related Course

🤝 Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Erfan-ram/Lean4-LLM-Ai-Agent-Mooc

Folders and files

Latest commit

History

Repository files navigation

🤖 Lean 4 Coding Agent

🧠 Overview

⚙️ Architecture Multi-Agent Collaboration

Step 1: Reasoning Agent Analysis

Step 2: LLM Agent Code Generation

Step 3: Execution and Verification

Step 4: Iterative Refinement

🧪 Main Workflow

📦 Requirements

📚 Related Course

🤝 Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages