Robot Learning 2025: Foundational Models for Robotics and Scaling DeepRL - RobotLearning: Scaling Deep Q-Learning Part2

4.0 (3)

32 learners

What you'll learn

This course includes

34.5 hours of video
Certificate of completion
Access on mobile and TV

Summary

Keywords

robotics foundational models deep learning q-learning

Full Transcript

I discussed the challenges of training a Q-function when using deep learning to maintain contractive learning, highlighting the instability caused by updates that affect both the predicted and target Q-values, leading to potential divergence. To address this, I explained the concept of a target network, which is a delayed copy of the Q-network used to stabilize the learning process by keeping the target values fixed for a period. I also covered the issue of overestimation in Q-learning due to the maximization operation and introduced double Q-learning as a solution, where the online Q-function selects the best action, and the target network evaluates it, reducing overestimation. I then delved into the "deadly triad" of off-policy learning, bootstrapping, and function approximation, emphasizing the difficulties in combining these three elements. Finally, I briefly discussed the use of n-step returns to reduce bias and improve training. I then transitioned into discussing more modern applications of Q-learning, specifically highlighting the QT-Opt algorithm for robotic grasping, which uses multiple robot arms and a cross-entropy method for continuous action spaces, and the PQ-N algorithm which aims to reduce the need for target networks and replay buffers.

Continue this lesson in the app

Install CourseHive on Android or iOS to keep learning while you move.

Related Courses

Welcome

Robot Learning 2025: Foundational Models for Robotics and Scaling DeepRL - RobotLearning: Scaling Deep Q-Learning Part2

What you'll learn

This course includes

Summary

Keywords

Full Transcript

Continue this lesson in the app

Related Courses

Lecture Collection | Introduction to Robotics

Introduction to Robotics | IIT Madras

Robotics

Robotics by Prof. D K Pratihar

FAQs