PPO Bunny 🐰

A real-time web demonstration of Proximal Policy Optimization (PPO) featuring cute bunnies navigating complex environments to find optimal rewards.

Overview

PPO Bunny is an interactive visualization that demonstrates reinforcement learning in action. Watch as multiple AI-controlled bunnies learn to navigate through grid-based environments, avoiding obstacles and finding rewards using PPO (Proximal Policy Optimization).

Features

Real-time AI Training: See PPO agents learn and adapt in your browser
Multiple Difficulty Levels: Two distinct environments with increasing complexity
Smooth 3D Visualization: Built with React Three Fiber for performant 3D graphics
Multi-Agent System: 10 agents learning simultaneously
Dynamic Environments: Level 2 features moving obstacles for added challenge

Tech Stack

Frontend: Next.js 14, React, TypeScript
3D Graphics: React Three Fiber, Three.js
AI/ML: ONNX Runtime Web for in-browser inference
Styling: Tailwind CSS, shadcn/ui components
State Management: Zustand
Animation: React Spring

Getting Started

Prerequisites

Node.js 14+
npm or yarn

Installation

# Clone the repository
git clone https://github.com/yourusername/noahgsolomon-ppo-bunny.git

# Navigate to project directory
cd noahgsolomon-ppo-bunny

# Install dependencies
npm install
# or
yarn install

# Run the development server
npm run dev
# or
yarn dev

Open http://localhost:3000 to see the application.

Build for Production

npm run build
npm start

How It Works

The Environment

Grid World: 25x25 tile-based environment
Agents: Bunny agents start from random positions
Goal: Find the pink reward tile while avoiding hologram tiles
Obstacles:
- Level 1: Static hologram tiles (instant failure)
- Level 2: Moving hologram tiles + vision-based navigation

The AI

The bunnies use PPO (Proximal Policy Optimization) to learn optimal policies:

State Space: Agent position, target position, distance to goal (+ vision in Level 2)
Action Space: 4 discrete actions (up, down, left, right)
Reward Structure: Positive reward for reaching the goal, negative for hitting obstacles

Model Details

Architecture: Actor-Critic neural network
Training: Python implementation with stable-baselines3
Deployment: ONNX models running in-browser via ONNX Runtime Web
Hyperparameters: See in-app "Model Details" for complete configuration

Project Structure

├── app/
│   ├── (game)/
│   │   ├── page.tsx          # Main game page
│   │   ├── LevelOne.tsx      # Level 1 implementation
│   │   ├── LevelTwo.tsx      # Level 2 implementation
│   │   ├── Player.tsx        # Player bunny component
│   │   ├── runModel.ts       # ONNX inference logic
│   │   └── store/           # Zustand stores
│   └── components/          # UI components
├── public/
│   └── models/             # 3D models and ONNX files
└── train/                  # Python training scripts

Training Your Own Model

The train/ directory contains Python scripts for training new PPO models:

cd train
python ppo.py  # Train the model
python torch2onnx.py  # Convert to ONNX format

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 208 Commits
app		app
model		model
public		public
src		src
train		train
.DS_Store		.DS_Store
.editorconfig		.editorconfig
.eslintignore		.eslintignore
.eslintrc		.eslintrc
.gitignore		.gitignore
.prettierignore		.prettierignore
.prettierrc		.prettierrc
.vercelignore		.vercelignore
README.md		README.md
bun.lockb		bun.lockb
components.json		components.json
logs		logs
next-env.d.ts		next-env.d.ts
next.config.js		next.config.js
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
sandbox.config.json		sandbox.config.json
tailwind.config.js		tailwind.config.js
test.nnb		test.nnb
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PPO Bunny 🐰

Overview

Features

Tech Stack

Getting Started

Prerequisites

Installation

Build for Production

How It Works

The Environment

The AI

Model Details

Project Structure

Training Your Own Model

License

Links

About

Uh oh!

Releases

Packages

Languages

noahgsolomon/PPO-Bunny

Folders and files

Latest commit

History

Repository files navigation

PPO Bunny 🐰

Overview

Features

Tech Stack

Getting Started

Prerequisites

Installation

Build for Production

How It Works

The Environment

The AI

Model Details

Project Structure

Training Your Own Model

License

Links

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages