NYU Deep Learning SP21 - 10 – Self / cross, hard / soft attention and the Transformer

5.0 (3)

21 learners

What you'll learn

This course includes

47.3 hours of video
Certificate of completion
Access on mobile and TV

Summary

Keywords

PyTorch NYU Yann LeCun Deep Learning neural networks

Full Transcript

Course website: http://bit.ly/DLSP21-web Playlist: http://bit.ly/DLSP21-YouTube Speaker: Alfredo Canziani Chapters 00:00 – Welcome to class 00:15 – Listening to YouTube from the terminal 00:36 – Summarising papers with @Notion 01:45 – Reading papers collaboratively 03:15 – Attention! Self / cross, hard / soft 06:44 – Use cases: set encoding! 12:10 – Self-attention 28:45 – Key-value store 29:32 – Queries, keys, and values → self-attention 39:49 – Queries, keys, and values → cross-attention 45:27 – Implementation details 48:11 – The Transformer: an encoder-predictor-decoder architecture 54:59 – The Transformer encoder 56:47 – The Transformer “decoder” (which is an encoder-predictor-decoder module) 1:01:49 – Jupyter Notebook and PyTorch implementation of a Transformer encoder 1:10:51 – Goodbye :)

Continue this lesson in the app

Install CourseHive on Android or iOS to keep learning while you move.

Related Courses

Welcome

NYU Deep Learning SP21 - 10 – Self / cross, hard / soft attention and the Transformer

What you'll learn

This course includes

Summary

Keywords

Full Transcript

Continue this lesson in the app

Related Courses

Machine Learning with Scikit-learn, PyTorch & Hugging Face | Free Preview

Deep Learning with PyTorch Course - December 2020

Deep Learning with PyTorch Live Course

Curso de Deep learning | Aprenda a construir redes neurais com PyTorch

FAQs