Course Hive
Search

Welcome

Sign in or create your account

Continue with Google
or
RobotLearning: Scaling Offline Reinforcement Learning
Play lesson

Robot Learning 2025: Foundational Models for Robotics and Scaling DeepRL - RobotLearning: Scaling Offline Reinforcement Learning

4.0 (3)
32 learners

What you'll learn

This course includes

  • 34.5 hours of video
  • Certificate of completion
  • Access on mobile and TV

Summary

Keywords

Full Transcript

I started discussing offline reinforcement learning, highlighting its potential to learn from pre-existing datasets, a departure from online RL's data inefficiency and divergence issues. I emphasized the goal of training a policy from offline data without divergence, similar to supervised learning. We explored the concept of "stitching" trajectories, a unique advantage of RL, where optimal paths can be constructed from disparate data segments, leveraging the Markov property. However, I also pointed out that this is difficult to achieve in practice, especially with partial observations. We discussed model-based RL as a potential solution but acknowledged the challenges of error accumulation in long-horizon planning. I then introduced the Decision Transformer, a supervised learning approach using returns as input to generate trajectories, aiming to minimize error across the entire sequence. However, I noted its limitations in stitching and handling stochasticity. Then I discuss recent papers on adapting offlineRL methods to large transformers, how to include offline data to help improve early training performance, and how to perform offline to line RL without needing to keep around the old offline RL dataset, which is typically required.

Course Hive

Continue this lesson in the app

Install CourseHive on Android or iOS to keep learning while you move.

Related Courses

FAQs

Course Hive
Download CourseHive
Keep learning anywhere