Paper notes Reinforcement Learning Policy Gradient Methods DQN Multi-agent Learning Guided Policy Search Imitation Learning Transfer Learning NLP Exploration Robot learning Memory Navigation Evolutionary Environments OpenAI Gym Classic control VizDoom Mujoco (select envs) OpenAI Universe Flash games World of bits Mujoco Labyrinth Malmo Gazebo Gym gazebo OpenSim Euro Truck Simulator Airsim More resources Berkeley Deep RL Robot Learning from Demonstration and Interaction More papers: https://github.com/andrewliao11/Deep-Reinforcement-Learning-Survey/blob/master/Reinforcement-Learning-Papers.md CILVR Reading Group David Silver @ UCL Nando de Freitas @ UBC John Schulman @ MLSS Marcello Restelli Lecture Pieter Abbeel @ NIPS 2016 Nuts and bolts of RL Sergey Levine RL Seminar