Reinforcement learning algorithms for spiking networks and artificial neural networks.
- Deep deterministic policy gradients with hindsight experience replay
- Stochastic policy gradient with hindsight experience replay
- Biased hindsight policy gradient
- Proximal Policy Optimization on GPU
- Covariance Matrix Adaptation Evolutionary Strategy
- Distributed proximal policy optimization