Racing-RL

reinforcement learning for f1tenth racing

How to start

The implementation has been tested with Python 3.9 under Ubuntu 20.04.

Installation:

Clone this repo.
Then initialize submodules f1tenth_gym and f1tenth_racetracks:
```
git submodule init
git submodule update
```

Install f1tenth_gym:

cd f1tenth_gym
pip install --user -e gym

Install requirements:
```
pip install -r requirements.txt
```
Run a sample training:
```
python train.py --track melbourne --algo ppo --reward min_action -include_velocity
```
This will run a default training configuration (20K steps, approx. 7 min) and save the logs.

Offline evaluation of a trained model on a different track:

python test.py --track nuerburgring --checkpoint <path_to_logdir>/models/best_model.zip --n_episodes 1

Current status

Observation space
- 2d-map from lidar scan + velocity
- 2d-map from lidar scan + max frame-aggregation (like atari)
- 2d-map from lidar scan + stack frame-aggregation (on channel dim)
Action space (def. in racing_rl/envs/single_agent_env.py:action_space)
- only steering: steering +/-0.41 rad, fixed speed = 2 m/s
- both controls: steering +/-0.41 rad, speed [0,10] m/s
Reward definitions
- progress (racing_rl/rewards/progress_based.py):
  - the reward is proportional to its progress w.r.t. the centerline, optionally using a penalty for collision
- min_action (racing_rl/rewards/min_action.py):
  - the reward is inversely proportional to the actions' deviation from the mid-value of the action domain ( steering=0.0 rad, speed=5.0 m/s)

Known bugs

add render-mode rgb_array to store video during the training process
track.get_progress does not correctly manage the crossing of the starting-line (from 0.99 to 1.99)

Open questions

find stable problem configuration w.r.t. the following questions:
- what is the minimal observation space? (ideally only lidar-based)
- what is the less-restrictive action space? (ideally constrained only by action ranges)
- what is a simple reward that enable good training?

TODO

refactor the code structure, e.g., make_base_env is getting messy with a lot of wrappers
tune base algorithms (e.g., Optuna)

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
evaluation		evaluation
f1tenth_gym @ 1ed5b3c		f1tenth_gym @ 1ed5b3c
f1tenth_racetracks @ cf77d2a		f1tenth_racetracks @ cf77d2a
racing_rl		racing_rl
scripts		scripts
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
example.py		example.py
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Racing-RL

How to start

Current status

Known bugs

Open questions

TODO

About

Releases

Packages

Languages

License

luigiberducci/racing-rl

Folders and files

Latest commit

History

Repository files navigation

Racing-RL

How to start

Current status

Known bugs

Open questions

TODO

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages