Table of algorithms in the Stable Baselines repo
read moreUnicorn: Continual learning with a universal, off-policy agent
Continual learning with a universal, off-policy agent.
read moreSample-Efficient Deep RL with Generative Adversarial Tree Search
Learned dynamics model with a GAN for image generation and MCTS for planning.
read moreWorld Models
Unsupervised learning of image encoding and dynamics model.
read morePretraining Deep Actor-Critic Reinforcement Learning Algorithms With Expert Demonstrations
Pretraining Deep Actor-Critic Reinforcement Learning Algorithms With Expert Demonstrations.
read moreMastering the game of Go without human knowledge
AlphaGo Zero, all RL self-play.
read morePre-training Neural Networks with Human Demonstrations for Deep Reinforcement Learning
Pre-train using supervised learning on human provided demonstations.
read moreEligibility Traces
Notes on using Eligibility Traces with neural networks
read moreReinforcement Learning with Unsupervised Auxiliary Tasks
Increase speed of a Reinforcement Learning system with auxiliary task.
read moreReinforcement Learning: An Introduction (2nd Edition)
In-progress second edition of an RL textbook.
read moreDeep Reinforcement Learning with Double Q-learning
Improved Q-value estimation by reducing overestimates of Deep Q-networks.
read moreHuman-level control through deep reinforcement learning
One of the first deep reinforcement learning papers.
read moreOff-Policy Actor-Critic
Off-Policy AC with linear state features. Includes elegibility traces.
read more
