Width and Topology Optimization of Analog LDO Using Reinforcement Learning

📋 Overview

This project applies Reinforcement Learning (RL) to optimize transistor widths and topologies in an analog Low-Dropout (LDO) voltage regulator, focusing on minimizing power consumption and output voltage error.

The RL environment models LDO design as a Markov Decision Process (MDP) with a discrete action and state space. Two popular policy gradient methods — Proximal Policy Optimization (PPO) and Advantage Actor-Critic (A2C) — are used for training and evaluation.

All SPICE simulations are precomputed and stored in a lookup table for fast training without invoking SPICE at runtime.

📁 Repository Structure

.
├── environ.py                                                 # Custom Gymnasium-compatible RL environment
├── train_a2c.py                                               # Evaluation script for A2C-trained agent
├── train_ppo.py                                               # Evaluation script for PPO-trained agent
├── test_a2c.py                                                # Evaluation script for A2C-trained agent
├── test_ppo.py                                                # Evaluation script for PPO-trained agent
├── merged_simulation_results_0_159999_full.csv                # Precomputed SPICE results (160,000 configs)
├── best_rl_model_a2c.zip                                      # Best Trained A2C Model
├── best_rl_model_ppo.zip                                      # Best Trained PPO Model
├── README.md                                                  # This file
└── requirements.txt                                           # Python dependencies

🚀 How It Works

1. Environment Design

State Space (8-D):
- w1 to w4: Transistor widths (1–10 µm)
- c1 to c4: Topology (1 = single, 2 = cascoded)
Action Space (16 actions):
- Increment/decrement each of the 8 parameters (width or config)
Reward Function:
```
reward = -(error + power) * 1e6
```

2. Simulation Data

merged_simulation_results_0_159999_full.csv contains SPICE simulation results for all 160,000 configurations.
During training, these are indexed using an in-memory Python dictionary for O(1) lookup.

🔧 How to Run

1. Install Dependencies

pip install -r requirements.txt

Required libraries include:

gymnasium
stable-baselines3
pandas
numpy
torch
ltspice (if needed for simulation validation)

2. Train RL Agents

Note: This is already done if models are provided.

python train_a2c.py    # Optional - for A2C
python train_ppo.py    # Optional - for PPO

3. Evaluate Trained Models

python test_a2c.py     # Runs A2C policy from a default or user-defined state
python test_ppo.py     # Runs PPO policy and logs performance

📌 Key Highlights

Fast training via SPICE result caching
Discrete MDP modeling for analog design
Reproducible results using fixed random seed

📚 References

Proximal Policy Optimization Algorithms
Stable Baselines3
LTspice Simulator
Razavi, B. "Design of Analog CMOS Integrated Circuits"

👨‍💻 Authors

Komaragiri Sai Vishwanath Rohit
Ghantasala Sai Narayana Sujit
Akula Nishith

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Width and Topology Optimization of Analog LDO Using Reinforcement Learning

📋 Overview

📁 Repository Structure

🚀 How It Works

1. Environment Design

2. Simulation Data

🔧 How to Run

1. Install Dependencies

2. Train RL Agents

3. Evaluate Trained Models

📌 Key Highlights

📚 References

👨‍💻 Authors

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
best_rl_model_a2c.zip		best_rl_model_a2c.zip
best_rl_model_ppo.zip		best_rl_model_ppo.zip
environ.py		environ.py
merged_simulation_results_0_159999_full.csv		merged_simulation_results_0_159999_full.csv
requirements.txt		requirements.txt
test_a2c.py		test_a2c.py
test_ppo.py		test_ppo.py
train_a2c.py		train_a2c.py
train_ppo.py		train_ppo.py

TeoZakeru/Reinforcement-Learning-in-Analog-Circuit-Design

Folders and files

Latest commit

History

Repository files navigation

Width and Topology Optimization of Analog LDO Using Reinforcement Learning

📋 Overview

📁 Repository Structure

🚀 How It Works

1. Environment Design

2. Simulation Data

🔧 How to Run

1. Install Dependencies

2. Train RL Agents

3. Evaluate Trained Models

📌 Key Highlights

📚 References

👨‍💻 Authors

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages