MIT 6.S191 (2023): Recurrent Neural Networks, Transformers, and Attention

День тому

MIT Introduction to Deep Learning 6.S191: Lecture 2
Recurrent Neural Networks
Lecturer: Ava Amini
2023 Edition
For all lectures, slides, and lab materials: introtodeeplearning.com
Lecture Outline
0:00 - Introduction
3:07 - Sequence modeling
5:09 - Neurons with recurrence
12:05 - Recurrent neural networks
13:47 - RNN intuition
15:03 - Unfolding RNNs
18:57 - RNNs from scratch
21:50 - Design criteria for sequential modeling
23:45 - Word prediction example
29:57 - Backpropagation through time
32:25 - Gradient issues
37:03 - Long short term memory (LSTM)
39:50 - RNN applications
44:50 - Attention fundamentals
48:10 - Intuition of attention
50:30 - Attention and search relationship
52:40 - Learning attention with neural networks
58:16 - Scaling attention and applications
1:02:02 - Summary
Subscribe to stay up to date with new deep learning lectures at MIT, or follow us @MITDeepLearning on Twitter and Instagram to stay fully-connected!!

КОМЕНТАРІ: 283

@lonewolf-_-8634 11 місяців тому

I just can't believe how amazing the educators are and damn !! they're providing it out here for free... Hats off to the team !!

@js913 10 місяців тому

researchers are providing the content for free too

@jurycould4275 Місяць тому

Would love it, if they found mature experts on these topics instead of children.

@deepakspace Рік тому

I am a Professor and this is the best course I have found to learn about Machine learning and Deep learning....

@Rhapsody83 Рік тому

I just took a paid course in this subject matter, and this free explanation is so much more intelligible.

@sijiaxiao1557 Рік тому

agreed

@avinashdwivedi2015 9 місяців тому

Coursera machine learning specialization

@olutoki 3 місяці тому

Why do I think you are an undergraduate student 😂

@PriyanshuAman-dn5jx Місяць тому

@@olutokigenes

@tgyawali 11 місяців тому

Thank you so much MIT and instructors for making these very high quality lectures available to everyone. Students from developing countries who have aspirations to achieve something big is now possible with this type of content and information!

@geosaiofficial1070 11 місяців тому

couldn't agree more. thanks once again MIT for providing world class education.

@joxa6119 6 місяців тому

Over all videos on UKposts that explained about Transformer architecture (including the visual explanation) , this is the BEST EXPLANATION ever done. Simple, contextual, high level, step by step complexity progression. Thank you the educators and MIT!

@xvaruunx Рік тому

Best end to the lecture: “Thank you for your attention.” ❤😂

@kiarashgeraili8595 5 місяців тому

As a CS student from University of Tehran, you guys don't have any idea how much such content could be helpful and the idea that all of this is free make it really amazing. Really appreciate it Alexander and Ava. Best hops.

@pankajsinha385 11 місяців тому

One of the best lectures I have seen on Sequence Models, with crystal clear explanations! :)

@anshikajain3298 Рік тому

This is what we need in this day and age, the teaching is amazing and can be understood by people of variable intelligence. Nice work and thanks for this course.

@MrPejotah Рік тому

These are some spectacular lessons. Thank you very much for making this available.

@lazydart4117 Рік тому

Watching those MIT courses alongside course at my Uni in Poland, so grateful to be able to experience such a high quality education

@GuinessOriginal Рік тому

This girl looks so young

@ukaszkasprzak5921 Рік тому

Mogę spytać gdzie i co studiujesz ? ( jestem maturzystą i chciałbym wiedzieć gdzie w Polsce są kierunki podobnego typu )

@lazydart4117 Рік тому

@@ukaszkasprzak5921 Kognitywistyka UW Zagadnienia z AI, machine learningu i matematyki są tu omawiane obok zagadnień humanistycznych: Lingwistyka, Filozofia Umysłu, Psychologia Poznawcza etc. Radzę przejrzeć Program studiów, proste googlowanie wystarczy

@vsevolodnedora7779 Рік тому

Extremely informative, well structured and paced. A pleasure to watch and follow. Thank you.

@roy11883 11 місяців тому

Indeed commendable the way this lecture has been ordered and difficult topic like self-attention has been lucidly explained. Thanks to the instructors, really appreciated.

@hamza-325 10 місяців тому

I watched and read a lot of content about Transformers and never understood what are those three Q, K, and V vectors doing so I coulnd't understand how attention works, until today when I watched this lecture doing the analogy of UKposts search and the Iron Man picture. Now it became much much clearer! Thanks for the brilliant analogies that you are making!

@nataliameira2283 Рік тому

Thank you for this amazing content! There are many concepts discussed intuitively!

@sorover111 Рік тому

ty to MIT for giving back a little in an impactful way

@nagashayanreddy7237 9 місяців тому

Wow, Transformers, and Attention was an absolute lifesaver! 🚀🙌 The explanations were crystal clear, and I finally have a solid grasp on these concepts. This video saved me so much time and confusion. Huge thanks to the Ava for making such an informative and engaging tutorial! Can't wait to delve deeper into the world of AI and machine learning. 🤖💡

@excitingtomorrow 11 місяців тому

Your explanation of attention took me 2 revisits to this video to truly truly understand! But now when I did, my love for deep learning got stronger :)

@manojbp07 26 днів тому

oh epochs=3 rofl

@hullabulla Рік тому

These lectures are simply amazing. Thank you so much!

@TimelyTimeSeries 4 місяці тому

Came here to refresh my memory of deep learning for sequential data. I really like how Ava brings us from one algorithm to another. It makes perfect sense to me.

@gidi1899 Рік тому

This is my favorite subject :) (following is self clarification of said words that feel exaggerated) 4:08 - binary classification or filtering is a sequence of steps: - new recording - retrieval of a constant record - compare new and constant record - express a property of the compare process So, sequencing really is a property of maybe all systems. While "wave sequencing" is built on top of a Sequencer System, that repeatedly uses the "same actions" per sequence element.

@AIlysAI Рік тому

The most intutive explanation of Self Attention I have seen!

@alhassanchoubassi2441 Рік тому

Just watched lecture 1, looking forward to this and the lab coming after. Thanks for this great open resource!

@subcorney Рік тому

Are there the labs available as well?

@nitul_singha 3 місяці тому

I am trying to step into deep learning for last couple of month. This is the best thing I have found so far. Thank you sir!.

@aravindsd6839 10 місяців тому

50:30 - Attention mechnaism beautifully explained. Thank you #AvaAmini

@ViniciusVA1 Рік тому

This is incredible! Thanks a lot for this video, it’s going to help me a lot in my undergrad reasearch :)

@RNDbyvaibhav 3 місяці тому

Till Now best Course, I am doing great when I found these MIT's Lecture

@monome3038 5 місяців тому

Grateful for the efforts of MIT and its incredible professors delivering high quality free lectures. Filling every gap I have in my current classes ❤

@gemini_537 3 місяці тому

Summary by Gemini: The lecture is about recurrent neural networks, transformers, and attention. The speaker, Ava, starts the lecture by introducing the concept of sequential data and how it is different from the data that we typically work with in neural networks. She then goes on to discuss the different types of sequential modeling problems, such as text generation, machine translation, and image captioning. Next, Ava introduces the concept of recurrent neural networks (RNNs) and how they can be used to process sequential data. She explains that RNNs are able to learn from the past and use that information to make predictions about the future. However, she also points out that RNNs can suffer from vanishing and exploding gradients, which can make them difficult to train. To address these limitations, Ava introduces the concept of transformers. Transformers are a type of neural network that does not rely on recurrence. Instead, they use attention to focus on the most important parts of the input data. Ava explains that transformers have been shown to be very effective for a variety of sequential modeling tasks, including machine translation and text generation. In the last part of the lecture, Ava discusses the applications of transformers in various fields, such as biology, medicine, and computer vision. She concludes the lecture by summarizing the key points and encouraging the audience to ask questions.

@Shadowfaex 2 місяці тому

👍🌚

@user-mc5ox7cv8k 16 днів тому

You should comment on every video. Liked it.

@ngrunmann 11 місяців тому

Amazing course! Thank you so much!

@jackq2331 Рік тому

I have used LSTM and Transformer a lot, but I can still get more insights from this lecture.

@mostinho7 5 місяців тому

15:05 we have different weights matrix for generating h_t and generating y_t h_t generated using two different weights matrix, to take contribution from previous state and current input 51:20 start of attention explanation 59:30 each attention head focus on some part similar to how each filter in cnn can learn to extract specific features like horizontal lines etc

@jamesandino8346 3 місяці тому

Great Presentation @8:00 minutes it really explained a circuitry I was looking forward to exploring

@Djellowman Рік тому

She absolutely killed it. Amazing lecture(r)!

@cienciadedados 10 місяців тому

I have many years of lecturing experience and just wish I was as competent she is. Great job.

@tcoc15yuktamore4 11 місяців тому

How beautifully explained. Loved it 🥰

@umarfarooq-gc7vz 10 місяців тому

I was searching about RNN for my Thesis work.She solved it...Nice Miss:)

@ellenxiao223 Рік тому

Great lecture, learnt a lot. Thank you for sharing!

@chineduezeofor2481 11 днів тому

Thank you for this beautiful lecture.

@eee8 9 місяців тому

Great Teamwork of Alex Amini and Ava Amini.

@BruWozniak Рік тому

Simply brilliant!

@digitalnomad2196 Рік тому

amazing lecture series, thanks for sharing this knowledge with the world. I am curious if theres a lecture on LSTM'S

@tapanmahata8330 7 місяців тому

Amazing . thank you MIT.

@nikteshy9131 Рік тому

Thank you Ava Soleimany and MIT ☺😊🤗💜

@nazrinnagori 5 місяців тому

query key value pairs always put me off whener I start to learn about transformers, this time I actually finished the video. Thanks MIT

@neuralnexus340 5 місяців тому

Thanks for this amazing course

@michaelngecha9227 11 місяців тому

I always meant to watch these lectures since 2020, but something always comes up. Now, nothing is going to stop me. Not even nothing. Great lectures, best way to learn.

@josephlee392 11 місяців тому

Same man. The academic stress as an undergraduate was my "something always comes up," but since I just graduated a few days ago, I now have no excuse to not indulge myself in these videos lol.

@varunahlawat9013 Рік тому

Lovely presentation! It couldn't get more interesting!

@Itangalo 10 місяців тому

This was the third video I watched in search of understanding what transformers are, and by far the best one. Thanks.

@jingji6665 10 місяців тому

Thank you so much for the free course. Benifit and appreciate

@Reaperaxe9 Рік тому

Fully understand transformers. One of the clearest and succinct explanations out there, so intuitive. Thank you!!

@dotmalec 3 місяці тому

What an amazing content! Thank you! ❤️

@TJ-hs1qm Рік тому

best Friday after-work fun thanks!

@luizmeier Рік тому

I already have some knowledge on the subject, however, I like to keep myself updated and there is always something new to learn. She clearly explains how what she is teaching really works. The whole video is worth watching.

@AnonymousIguana Рік тому

Wonderful, easy to focus and understand :). Great quality! Grateful that this is open source!

@estherni9412 Рік тому

Thank you for this amazing and easy to understand course! I'm a beginner of the RNN, but I can almost know all the concepts from this lecture!

@riyajunjannat7294 10 місяців тому

I worked in spatial statistics during my graduation. And now, I think your classes will push me more and more towards the machine learning. Looking forward to apply my learning in my upcoming level of study. Thanks for your efforts 💝

@user-xq3sw9fj3d 7 місяців тому

Штоэто.запрасмоттр.непанядно

@vin-deep 10 місяців тому

Best explanation ever!!!! thank you

@twiddlebit Рік тому

I come back every year to check these lectures and to see what innovations made it into the lectures. Pleasantly surprised to see the name change, congrats!

@agamersdiary1622 Рік тому

What do you mean by name change?

@diamondshock4405 11 місяців тому

@@agamersdiary1622 This woman got married to one of the other lecturers (the channel owner Alexander).

@glowish1993 6 місяців тому

legendary lecture, thank you for sharing

@vohra82 4 місяці тому

I am an auditor and have very little to do with this subject, except for my curiosity. I feel lucky that these kind of videos are available for free

@elu1 Рік тому

Finally I understand the transformer concept now. Great lecture series👍!

@jennifergo2024 5 місяців тому

Thanks for sharing!

@ziku8910 Рік тому

Very intuitive explanation, thanks!

@mohadreza9419 5 місяців тому

Mr Amini thanks for your channel

@gksr 5 місяців тому

Thank you@MIT

@goswamimohit 9 місяців тому

Wow just amazing, no words left. Really Thanks 🙏

@maduresenerd5716 6 місяців тому

I just started learning about RNN and LSTM especially for NLP and found this video very helpful to me. It would be really exciting if you provided a video about transformers in more depth :)

@NoppadatchSukchote 11 місяців тому

Awesome Course, Very easy to understand+++, Thx all MIT instructors 😊😊😊

@holderstown643 Місяць тому

Thank you for the awesome lecture

@MuhammadIbrahim-ut3rq 4 місяці тому

Thank you very much for this great oppurtunity to watch MIT lectures. always dreamt of a world class education and finally im doing a degree in AI and such videos are supporting my learning process very much

@andyandurkar7814 5 місяців тому

Great material and the best educator!. Thank you for the fantastic video! The material was not only informative but also engaging, and the quality of the presentation was top-notch. Your depth of knowledge truly shines through, making the learning experience both enriching and enjoyable. Presented such complex material with such ease. You've done an exceptional job in communicating the concepts clearly. Great work!" and everything is free! Great job MIT team!!

@derrickxu908 3 місяці тому

She is so good!!!!🎉🎉❤❤

@lf655 9 місяців тому

Very good !

@pw7225 8 місяців тому

She is fantastic at teaching. I love how easily understandable she makes it. Thank you, Prof Amini.

@meghan______669 2 місяці тому

Really helpful! ⭐️

@alexchow9629 2 місяці тому

This is shockingly good. Thank you.

@megalomaniacal Рік тому

I am 6 years old, and I have been able to follow everything said, after watching 3 times.

@johnpaily Місяць тому

Life works on what she is speaking . We need to look deep into life to evolve and make a shift in thinking

@chukwunta Рік тому

This is some really deep learning. MIT is the height of institutional education. 👏👏. Thanks for sharing.

@akj3344 11 місяців тому

Code showed at RNN Intuition chapter at 14:00 makes thing clear af. I literally said "Wow"

@joshismyhandle 11 місяців тому

Thanks for sharing

@peetprogressngoune3806 Рік тому

I can't wait to watch

@krishnakumark.p8184 8 місяців тому

Great 👍 presentation 👏

@jerahmeelsangil247 4 місяці тому

The fact that these videos now have millions of views.... the world is evolving so fast scientifically or at least scientific culture.

@theneumann7 Рік тому

Thanks for sharing such high quality content! 👌

@johnpaily Місяць тому

Salutes hopr to come back MIT Deep learning. I feel you peple need to look deep inro life

@omerfarukcelebi6813 17 днів тому

This is the best lecture on UKposts! Thank you for the clear explanation. I wish you could delve deeper into the transformer architecture, though, as it was only covered in the last 15 minutes. Nevertheless, this is the most understandable video on the topic. I've watched nearly all of them, but this one stands out as the best! It would be great if you provided a more detailed explanation of transformers.

@imransaleem9125 Рік тому

Pretty straight forward lecture.

@ayo4757 10 місяців тому

que increible! esto es genial!

@FREAK-st6kk 22 дні тому

Whoever is listening to this awesome lecture I just want to say, Attention is all you need!!

@johanliebert6206 Місяць тому

Thank you so much

@Roy-hk8yh Рік тому

This is amazing. Studying from Kenya, and this absolutely is quality lectures.

@user-ov7ze8yc9l 8 місяців тому

perfect

@NoppadatchSukchote 11 місяців тому

Awesome Course, Very easy to understand+++

@user-gv4uk2ez6g 10 місяців тому

It's very helpful for me ❤

@avideshmukh6308 5 місяців тому

Great job simplifying very complex understanding the functions of neural networks! Avi MD MBA, MS, MHA

@prishamaiti Рік тому

I've always wanted to study deep learning, but I never really knew where to start. This MIT course was my answer

@uttamsinghchaudhary6334 8 місяців тому

Is there any videos regarding lab code implementation? I didn't found them on website. please do respond.. and of-course salute to those immensily knowledable tutors for explaining such deep topics with proper examples.

@Friemelkubus 3 місяці тому

Thanks a lot!

@Jupiter-Optimus-Maximus 8 місяців тому

Awsome! Video!! Very well thought out lecture. Keep rockin' !!! You just solved my problem in my NNW optimization project, in just two sentences.🤣 For 4 months, this has been driving me completely insane.💥🤣🔫 I think I'm in love.😀

@johnpaily Місяць тому

Great I don't know math , but you are feeding my conceptual thoughts about life and the universe from an informational point