Stanford AA289/ENGR319 - Robotics and Autonomous Systems Seminar - Stanford Seminar - Toward Scalable Autonomy - Aleksandra Faust

5.0 (1)

12 learners

What you'll learn

This course includes

100.5 hours of video
Certificate of completion
Access on mobile and TV

Summary

Keywords

AA289 Robotics Autonomous Systems StanfordOnline Aleksandra Faust Scalable Autonomy Autonomy Reinforcement learning RL Google Brain Google Brain Research Stanford

Full Transcript

Aleksandra Faust is a Senior Staff Research Scientist and Reinforcement Learning research team co-founder at Google Brain Research. January 28, 2022 Reinforcement learning is a promising technique for training autonomous systems that perform complex tasks in the real world. However, training reinforcement learning agents is a tedious, human-in-the-loop process, requiring heavy engineering and often resulting in suboptimal results. In this talk we explore two main directions toward scalable reinforcement learning. First, we discuss several methods for zero-shot sim2real transfer for mobile and aerial navigation, including visual navigation and fully autonomous navigation on a severely resource constrained nano UAV. Second, we observe that the interaction between the human engineer and the agent under training as a decision-making process that the human agent performs, and consequently automate the training by learning a decision making policy. With that insight, we focus on zero-shot generalization and discuss learning RL loss functions and a compositional task curriculum that generalize to unseen tasks of evolving complexity. We show that across different applications, learning-to-learn methods improve reinforcement learning agents generalization and performance, and raise questions about nurture vs nature in training autonomous systems. View the full playlist: https://www.youtube.com/playlist?list=PLoROMvodv4rMeercb-kvGLUrOq4HR6BZD 0:00 Introduction 3:54 How to train goal reaching policies? 17:46 What about resource-constrained robots? 28:13 Training RL Agents is a (Sequential) Decision Making Problem 29:45 Learning Loss Functions 29:51 Evolving RL Algorithms 36:06 Generative Curriculum for Compositional Tasks 36:36 Autonomous web navigation 39:37 Learning to Navigate Web 43:19 Compositional Design of Environments (CODE) 46:58 Does it work in reality? RealED -- CODE w/ real primitives. 48:54 Better Generalization 51:01 Autonomous Scalable Systems Future #reinforcementlearning #autonomousrobot

Continue this lesson in the app

Install CourseHive on Android or iOS to keep learning while you move.

Welcome