GitHub - poudel-bibek/NFQ_Golf_Cart: Learning to Control DC Motor for Micromobility in Real Time with Reinforcement Learning

Learning to Control DC Motor for Micromobility in Real Time with Reinforcement Learning

Steering wheel oscillating between extreme left and extreme right

Green= Goal states, Blue= Initial states, Red= Forbidden states

1. Installation and Code Dependencies

Built on Python 3.9.16, Install NVIDIA Drivers + CUDA if you want to use GPU.

Intstall requirements:

pip install -r requirements.txt

2. Understanding the Code:

NFQ_main: Central file integrating all components.

NFQ_Env: Simulates environment; hardware execution separate from XXX.

NFQ_model: Defines the neural network (Q-function approximator).

NFQ_Agent: Manages NFQ algorithm functions, supervised data generation, and model training.

Steerbox_Env: Handles position initialization strategies and environment interaction.

Steerbox_NFQ: Covers additional NFQ functions, goal (hint-to-goal) pattern sets, experience collection, and reward definition.

Utils folder: Provides utilities for generating plots and exploration strategies.

Hardware_Data/ Simulation_Data folders: Stores session data for runs.

Hardware_Code folder: Code to interface with the Arduino hardware and train hardware controller.

3. Running the Code:

# run on default arguments
python NFQ_main.py 

# run on experiment related arguments
python NFQ_main.py --num_params [AAA] --hint_size [BBB] --exploration [CCC] --reset_freq [DDD] --pos_init [EEE]

# To save the files of the run
python NFQ_main.py --save_to_file

4. Experiments:

Five experiments are performed:

#	Experiment	Options
1	Parameter count of neural network	39, 61, 91, 121, 171

2	Size of Hint-to-goal transitions	1%, 2%, 5%, 10%, 20%

3	Exploration strategy	No exploration,
		ε-greedy constant 2%,
		ε-greedy constant 10%,
		Linearly decaying ε-greedy,
		Exponentially decaying ε-greedy

4	Neural network reset frequency	No reset, reset every: 1, 10, 50, 100 episodes

5	Steering wheel position	Gaussian: mean=0, variance=0.02,
	initialization	Gaussian: mean=0, variance=0.09,
		Uniform: range [-0.5, 0.5],
		Linearly expanding range,
		Exponentially expanding range

Choose experiments from the arguments in NFQ_main.py

Results of experiments (each averaged over 5 simulation runs)

Cite

@inproceedings{poudel2022learning,
  title={Learning to Control DC Motor for Micromobility in Real Time with Reinforcement Learning},
  author={Poudel, Bibek and Watson, Thomas and Li, Weizi},
  booktitle={2022 IEEE 25th International Conference on Intelligent Transportation Systems (ITSC)},
  pages={1248--1254},
  year={2022},
  organization={IEEE}
}

*The repo is inclusive of hardware and code contributions from: tpwrules

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
Hardware_Code		Hardware_Code
Hardware_Data		Hardware_Data
Plots		Plots
Simulation_Data		Simulation_Data
Utils		Utils
site_assets		site_assets
.gitignore		.gitignore
LICENSE		LICENSE
NFQ_Agent.py		NFQ_Agent.py
NFQ_main.py		NFQ_main.py
NFQ_model.py		NFQ_model.py
README.md		README.md
Steerbox_Env.py		Steerbox_Env.py
Steerbox_NFQ.py		Steerbox_NFQ.py
Vehicle_Env.py		Vehicle_Env.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Learning to Control DC Motor for Micromobility in Real Time with Reinforcement Learning

1. Installation and Code Dependencies

2. Understanding the Code:

3. Running the Code:

4. Experiments:

Cite

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

poudel-bibek/NFQ_Golf_Cart

Folders and files

Latest commit

History

Repository files navigation

Learning to Control DC Motor for Micromobility in Real Time with Reinforcement Learning

1. Installation and Code Dependencies

2. Understanding the Code:

3. Running the Code:

4. Experiments:

Cite

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages