LBRY Block Explorer

LBRY Claims • reinforcement-learning-with-augmented

2d2ac211f3bb432d0de33a5ca0497ed777df3daa

Published By
Created On
1 Mar 2021 12:11:33 UTC
Transaction ID
Cost
Safe for Work
Free
Yes
Reinforcement Learning with Augmented Data (Paper Explained)
This ONE SIMPLE TRICK can take a vanilla RL algorithm to achieve state-of-the-art. What is it? Simply augment your training data before feeding it to the learner! This can be dropped into any RL pipeline and promises big improvements across the board.

Paper: https://arxiv.org/abs/2004.14990
Code: https://www.github.com/MishaLaskin/rad

Abstract:
Learning from visual observations is a fundamental yet challenging problem in reinforcement learning (RL). Although algorithmic advancements combined with convolutional neural networks have proved to be a recipe for success, current methods are still lacking on two fronts: (a) sample efficiency of learning and (b) generalization to new environments. To this end, we present RAD: Reinforcement Learning with Augmented Data, a simple plug-and-play module that can enhance any RL algorithm. We show that data augmentations such as random crop, color jitter, patch cutout, and random convolutions can enable simple RL algorithms to match and even outperform complex state-of-the-art methods across common benchmarks in terms of data-efficiency, generalization, and wall-clock speed. We find that data diversity alone can make agents focus on meaningful information from high-dimensional observations without any changes to the reinforcement learning method. On the DeepMind Control Suite, we show that RAD is state-of-the-art in terms of data-efficiency and performance across 15 environments. We further demonstrate that RAD can significantly improve the test-time generalization on several OpenAI ProcGen benchmarks. Finally, our customized data augmentation modules enable faster wall-clock speed compared to competing RL techniques. Our RAD module and training code are available at this https URL.

Authors: Michael Laskin, Kimin Lee, Adam Stooke, Lerrel Pinto, Pieter Abbeel, Aravind Srinivas

Links:
YouTube: https://www.youtube.com/c/yannickilcher
Twitter: https://twitter.com/ykilcher
BitChute: https://www.bitchute.com/channel/yannic-kilcher
Minds: https://www.minds.com/ykilcher
...
https://www.youtube.com/watch?v=to7vCdkLi4s
Author
Content Type
Unspecified
video/mp4
Language
English
Open in LBRY

More from the publisher

Controlling
VIDEO
CAN W
Controlling
VIDEO
DYNAM
Controlling
VIDEO
AUTHO
Controlling
VIDEO
POPUL
Controlling
VIDEO
IMPLI
Controlling
VIDEO
[NEWS
Controlling
VIDEO
WORLD
Controlling
VIDEO
THE W
Controlling
VIDEO
REINF