SAC Torch MLFLow Vibe Project

A vibe coded implementation of Soft Actor-Critic (SAC) using PyTorch and MLFlow. This project demonstrates how transformer architectures can be integrated with SAC to effectively capture long-term dependencies in reinforcement learning tasks.

Key Features

Torch-Powered: Every component leverages Torch's optimized workflows for data collection, preprocessing, and model training
Transformer-Enhanced RL: Novel integration of transformer architecture with SAC for superior temporal reasoning
MLflow Integration: Complete experiment tracking with parameter logging and model versioning
Modular Design: Clean separation of environment, model, and training components for easy extension

Technical Insights

Rapid Development: This implementation was developed in approximately 12 hours as a proof-of-concept, demonstrating rapid prototyping capability while maintaining a clean architecture. It showcases the ability to quickly deliver working machine learning systems.
Production Readiness: While built as a rapid prototype, the codebase follows a modular design with clear separation between environment, models, and training components. If continued, future iterations will focus on implementing proper logging with configurable verbosity levels and comprehensive exception handling.

Installation

Clone the repository:

gh repo clone MichaelsEngineering/sac-agent-demo
cd sac-agent-demo

Create an env and install the required dependencies:

python -m venv sac-env
source sac-env/bin/activate  # Linux/macOS
# Windows: sac-env\Scripts\activate  
pip install -e .
# Or, for development, include additional dev dependencies
pip install -e ".[dev]"

Track Experiments:

mlflow ui # Then open in browser

Run main:

   python src/main.py

Usage

Run the project
```
python src/main.py
```

Project Structure

├── .idea/
├── src/
│   ├── data/
│   │   ├── simple.json 
│   └── deployment/   
│   ├── environments/
│   │   ├── environment_setup.py
│   ├── models/
│   │   ├── actor.py  
│   │   └── critic.py
│   ├── training/
│   │   ├── replay_buffer.py               
│   │   ├── train_sac.py
│   ├── training/
|   |   └── load_json.py
│   └── main.py
├── tests/
│   └── test_model.py
├── .GITIGNORE
├── LICENSE.md
├── pip_requirements.txt
├── pyproject.toml
└── README.md

Roadmap for Contributions

Upgrade Data Loading for CI/CD engineering best practices
Add support for continuous action spaces
Enhance MLflow dashboards
Containerize with Docker for reproducible deployment

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository
Create your feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Table of Contents

SAC Torch MLFLow Vibe Project

Key Features

Technical Insights

Installation

Usage

Project Structure

Roadmap for Contributions

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.idea		.idea
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

License

MichaelsEngineering/sac-agent-demo

Folders and files

Latest commit

History

Repository files navigation

Table of Contents

SAC Torch MLFLow Vibe Project

Key Features

Technical Insights

Installation

Usage

Project Structure

Roadmap for Contributions

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages