GitHub - mduszyk/controlrl: Soft Actor-Critic (SAC) for Continuous Control Tasks

SAC

This project provides an implementation of the Soft Actor-Critic (SAC) algorithm for solving continuous control tasks using environments from OpenAI Gym (e.g., MuJoCo). SAC is an off-policy actor-critic algorithm that combines maximum entropy reinforcement learning with function approximation for efficient and stable training.

See: https://gymnasium.farama.org/environments/mujoco/

Run local mlflow

mlflow ui

Train

python sac_train.py
python sac_train.py --profile ant
python sac_train.py --profile humanoid

Test

python sac_eval.py
python sac_eval.py --profile ant
python sac_eval.py --profile humanoid
python sac_eval.py --profile ant --model_uri 'runs:/55db85ebf343496783f5f2b88389b604/policy_net_episode_1100'

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
docs		docs
videos		videos
.gitignore		.gitignore
README.md		README.md
environment.yaml		environment.yaml
sac.py		sac.py
sac_eval.py		sac_eval.py
sac_eval.toml		sac_eval.toml
sac_train.py		sac_train.py
sac_train.toml		sac_train.toml
stats.py		stats.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SAC

Run local mlflow

Train

Test

About

Uh oh!

Releases

Packages

Languages

mduszyk/controlrl

Folders and files

Latest commit

History

Repository files navigation

SAC

Run local mlflow

Train

Test

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages