Grokking Deep Reinforcement Learning. You'll explore, discover, and learn as you lock in the ins and outs of reinforcement learning… In his engaging style, seasoned deep learning expert Andrew Trask shows you the science under the hood, … You'll see how algorithms function and learn to develop your own DRL agents using evaluative feedback. https://www.manning.com/books/grokking-deep-reinforcement-learning. Miguel Morales combines annotated Python code with intuitive explanations to explore Deep Reinforcement Learning … This repository accompanies the book "Grokking Deep Learning", available here. www.manning.com/books/grokking-deep-reinforcement-learning
Introduction to deep reinforcement learning
Mathematical foundations of reinforcement learning
Balancing the gathering and utilization of information
Achieving goals more effectively and efficiently
Introduction to value-based deep reinforcement learning
Introduction to policy-based deep reinforcement learning
This book is widely considered to the "Bible" of Deep Learning.
Grokking Deep Learning is just over 300 pages long. To get to those 300 pages, though, I wrote at least twice that number. Implementation of more effective and efficient reinforcement learning algorithms: Implementation of a value-based deep reinforcement learning baseline: Implementation of "classic" value-based deep reinforcement learning methods: Implementation of main improvements for value-based deep reinforcement learning methods: Implementation of classic policy-based and actor-critic deep reinforcement learning methods: Policy Gradients without value function and Monte-Carlo returns (REINFORCE), Policy Gradients with value function baseline trained with Monte-Carlo returns (VPG), Asynchronous Advantage Actor-Critic (A3C), [Synchronous] Advantage Actor-Critic (A2C).
This common pattern is the foundation of deep reinforcement learning: building machine learning systems that explore and learn based on the responses of the environment. Note: At the moment, only running the code from the docker container (below) is supported.
Docker allows for creating a single environment that is more likely to work on all systems.
To install docker, I recommend a web search for "installing docker on <your os here>".
For running the code on a GPU, you have to additionally install nvidia-docker.
NVIDIA Docker allows for using a host's GPUs inside docker containers.
After you have docker (and nvidia-docker if using a GPU) installed, follow the three steps below. Implementation of more effective and efficient reinforcement learning algorithms: Implementation of a value-based deep reinforcement learning baseline: Implementation of "classic" value-based deep reinforcement learning methods: Implementation of main improvements for value-based deep reinforcement learning methods: Implementation of classic policy-based deep reinforcement learning methods: Policy Gradients without value function and Monte-Carlo returns (REINFORCE), Policy Gradients with value function baseline trained with Monte-Carlo returns (VPG). Reinforcement Learning; Edit on GitHub; Reinforcement Learning in AirSim# We below describe how we can implement DQN in AirSim using an OpenAI gym wrapper around AirSim API, and using stable baselines implementations of standard RL algorithms. Grokking Deep Learning is just over 300 pages long. 