ChatGPT: 30 Year History | How AI Learned to Talk

  Переглядів 937,118

Art of the Problem

Art of the Problem

День тому

This video explores the journey of AI language models, from their modest beginnings through the development of OpenAI's GPT models. Our journey takes us through the key moments in generative neural network research involved in next word prediction. We delve into the early experiments with tiny language models in the 1980s, highlighting significant contributions by researchers like Jordan, who introduced Recurrent Neural Networks, and Elman, whose work on learning word boundaries revolutionized our understanding of language processing. It leaves us with a question: what is thought? Is simulated thought, thought? Featuring Noam Chomsky Douglas Hofstadter Michael I. Jordan Jeffrey Elman Geoffrey Hinton Ilya Sutskever Andrej Karpathy Yann LeCun and more. (Sam altman)
My script, references & visualizations here: docs.google.com/document/d/1s...
consider joining my channel as a UKposts member: / @artoftheproblem
This is the last video in the series "The Pattern Machine" you can watch it all here: • Artificial Intelligence
00:00 - Introduction
00:32 - hofstader's thoughts on chatGPT
01:00 - recap of supervised learning
01:55 - first paper on sequential learning
02:55 - first use of state units (RNN)
04:33 - first observation of word boundary detection
05:30 - first observation of word clustering
07:16 - first "large" language model Hinton/Sutskever
10:10 - sentiment neuron (Ilya | OpenAI)
12:30 - transformer explaination
15:50 - GPT-1
17:00 - GPT-2
17:55 - GPT-3
18:20 - In-context learning
19:40 - ChatGPT
21:10 - tool use
23:25 - philosophical question: what is thought?

КОМЕНТАРІ: 1 900
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
STAY TUNED: Next video will be on "History of RL | How AI Learned to Feel" SUBSCRIBE: www.youtube.com/@ArtOfTheProblem?sub_confirmation=1 WATCH AI series: ukposts.info/slow/PLbg3ZX2pWlgKV8K6bFJr5dhM7oOClExUJ Timestamps: 00:32 hofstader's thoughts on chatGPT 01:00 recap of supervised learning 01:55 first paper on sequential learning 02:55 first use of state units (RNN) 04:33 first observation of word boundary detection 05:30 first observation of word clustering 10:10 sentiment neuron (Ilya & Hinton) 12:30 transformer explaination 15:50 GPT-1 17:00 GPT-2 17:55 GPT-3 18:20 In-context learning 19:40 ChatGPT 21:10 tool use 23:25 philosophical question: what is thought?
@michaelpoblete1415
@michaelpoblete1415 5 місяців тому
Andrew Ng and Geof Hinton already clashed on twitter, quite confrontational tone, Andrew seems to be taking the side of Lecunn.
@masudahmad853
@masudahmad853 3 місяці тому
@@michaelpoblete1415
@masudahmad853
@masudahmad853 3 місяці тому
nice skills
@user-fj9hf4bu9f
@user-fj9hf4bu9f 3 місяці тому
please reconsider the background music and noise levels, it really takes away from an otherwise interesting video.
@John-D.
@John-D. 2 дні тому
HAL, Reduce Music Volume 🤖
@clamhammer2463
@clamhammer2463 2 місяці тому
As an AI researcher and developer, this is the first video that I did not leave thinking that the author was just saying words without second thought. There is so much misinformation in this space stemming from the fear of the ramifications of AI that are a result of the negative feedback loop of those same people. Well done.
@ArtOfTheProblem
@ArtOfTheProblem 2 місяці тому
I really do appreciate this comment, i work so hard on these and sometimes i think i'm crazy - the garbge funnel of ai coverage is brutal
@RyluRocky
@RyluRocky Місяць тому
😂 yes other AI channels just throw the word “compute” around 300 times.
@ArtOfTheProblem
@ArtOfTheProblem Місяць тому
YOU'LL BE SHOCKED AT WHAT......@@RyluRocky
@Joe-jv5mm
@Joe-jv5mm Місяць тому
You are now to Chat GPT 16
@Blueprint4Murder
@Blueprint4Murder Місяць тому
@@ArtOfTheProblem Are you too going to kiss now? Card catalog 2.0. Do you guys remember those old newspaper cell machines? Where you could search for a news headline then look up the cell for the news paper pages related to that search? That is what AI reminds me of. A more advanced card catalog. While I am sure it will be great in that function and even in mimicry of humans I don't believe it is thinking. Most humans are so narcissistic that they believe animals do not think and yet those same dullards are convinced ai does because it is a popular narrative.
@PotatoCider
@PotatoCider 3 місяці тому
please never stop teaching on the internet. i can tell you have an intense passion in learning. im in love with how you explain concepts and their connections. feynman would be proud.
@ArtOfTheProblem
@ArtOfTheProblem 3 місяці тому
this makes me so happy thanks for saying that. I'm sitting here surrounded in papers working on next video
@lakazatong
@lakazatong 3 місяці тому
​@@ArtOfTheProblemthat. could not have said it better. it really was eye opening in this hot field. that begs the question, are there alternative models for reasoning? the fundamentals today are, as you explained in the video, prediction and attention learning, that is, deducing the next action/letter/whatever given the previous states of the world/space/whatever, or to be exact, the previous mental states of that, and learning how to filter out and fragment the data into a meaningful way to, to be honest, only aid the prediction my question is, other than prediction, what could it be? I'm very fond of seemingly simple and Impactful philosophical questions, and this one really hits because then if the answer turns out to be "not really" then... our brains are just wired to be good just at that honestly that would not be too groundbreaking when you start digging in how the brain works together with the body to make up for a human I'm really just writing my thoughts here that also brings the good old question "are we just monkeys with bigger brains?" and by the looks of it yes, but also the inner workings of the brain-body connection may just be the key difference truly an interesting topic while I'm at it, I see little to no video/interest in one specific approach we could tackle machine intelligence and that is, taking our brain, and building it with silicon and wires building the hardware to be the brain itself would you be interested in such topic? unfolding your findings and sharing them as well as providing a state of the art? that would be awesome
@hello-rq8kf
@hello-rq8kf 18 днів тому
yubi yubi
@belibem
@belibem 5 місяців тому
There only a handful of youtube channels that can make such concepts accessible to everyone with a curious mind (e.g. 3blue1brown, veritasium, Sabine). Art of the Problem is one of them. Brit you are a legend. Thank you for giving us this series.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
Thanks for sharing, I really appreciate it.
@rikvermeer1325
@rikvermeer1325 5 місяців тому
I'd like to add "Artem Kirsanov" (neuroscience, neurons, manifolds) and "Anton Petrov" (science, physics) to the list.
@rikvermeer1325
@rikvermeer1325 5 місяців тому
@@ArtOfTheProblem subscribed for more :)
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
yay welcome to the party! sorry my output is slow though@@rikvermeer1325
@T4bboo
@T4bboo 5 місяців тому
@@ArtOfTheProblem Man absolutely great video, I am new to your channel and instantly subscribed... only drawback for me personally was background music... those annoying tones sounding like from 80-90 games made me download your video use AI to distinguish between narration and background music to get rid of it without losing narration and listen adjusted version offline.
@Just4Growers
@Just4Growers Місяць тому
I have watched countless AI videos but nobody explained it like you. Many thanks.
@ArtOfTheProblem
@ArtOfTheProblem Місяць тому
woo! thanks, stay tuned
@jay_sensz
@jay_sensz 5 місяців тому
Great video, but please fix your audio mix. The background music is way too loud. There likely is an option for audio ducking in your video editing program that will automatically lower the background music volume when foreground audio is playing.
@sams64sf
@sams64sf 3 місяці тому
Yes, please. If the primary goal is to spread knowledge, I think the background music is taking away from that. I really want to watch this, but the music is extremely distracting.
@DanielGirardBolduc
@DanielGirardBolduc 3 місяці тому
It actually help me get into a thinking mood as the audio help me stay longer into my head when I question myself about specific topic mentionned in the vidéo. Might not be the case for everybody tho. I have ADHD and i'm auditive, if someone else with the same characteristic feel the same, let me know :)
@neohed
@neohed 3 місяці тому
Hah, was about to post the same. The music is hideous and distracting!
@britcruise2629
@britcruise2629 2 місяці тому
nailed it@@SimoneDesignsGaming
@georgewright2010
@georgewright2010 2 місяці тому
I don’t understand why music is played. I gave up on the video after a few minutes.
@gonzothegreat1317
@gonzothegreat1317 5 місяців тому
Perfect. 12 years ago or so I learned about RSA from you. I'm glad to see you are still producing quality videos! Thumbs up!
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
so cool to hear from OGs
@QW3RTYUU
@QW3RTYUU 5 місяців тому
That's about my story too. I would lay in bed with the iPod Touch and watch your cryptography videos, which in turn got me into owning bitcoin in 2011. I was so young and couldn't predict what would happend (my internal LLM was not trained enough yet) and I gave it all to some guy online as a donation for their FTP which had some hot stuff in it. The end is not the goal, it's the journey that's interesting. Your video got me emotional and felt right. I'll share around. Thanks for producing it!
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
ha, love the connections. I know the pain also@@QW3RTYUU
@myndwork
@myndwork 5 місяців тому
@@QW3RTYUUholy shit same here, 2011 is when I found this channel while trying to learn about bitcoin. This channel is a treasure.
@btm1
@btm1 5 місяців тому
same
@situranjankar
@situranjankar 5 місяців тому
One thing i like about your videos is that you are very good at explaining things and the way of presentation is such that it is intuitive to any age group . Although there are thousands of videos and articles are available related to the same topics but this one thing makes your videos unique and best. Thanks Brit. Please keep making such videos in future.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
thank you for sharing, I tried to do something new here and I hope it was worth it
@TropicalCoder
@TropicalCoder 5 місяців тому
Very nice! ...well, except for the loud background music anyhow. Google kept pushing this video into my list over several days while I was very busy following something else. Finally I popped it up and saw why. Google knew that I would like it. Kind'a creepy, really how it knew it was exactly the kind of thing for me at this moment in time. It's like, it is following my studies on this subject and knew this should be the next chapter. Your video reiterates over everything I have learned so far and adds perspective.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
thanks for this feedback, I'm so happy to hear it was relevant to you. interestingly enough this is the first time in 12 years google is recommending my video strongly...and it's amazing the difference. I used to get 1k views per day, now it's 20k usually people hate my music, but not all, so I try to find a balance
@TropicalCoder
@TropicalCoder 5 місяців тому
@@ArtOfTheProblem I liked your music. It fit perfectly into the narrative. Just when I was hanging on to every word 'cause it was so interesting, the music became intrusively loud at some points and destroyed my focus.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
i appreciate this, I know...i think the best is some music in and out at times...but not during moments of high cognitive load@@TropicalCoder
@TropicalCoder
@TropicalCoder 5 місяців тому
@@ArtOfTheProblem Exactly!
@SecretEyeSpot
@SecretEyeSpot 5 місяців тому
this channel changed my life about 10 years ago when i found the language of coins. By serendipity, I've found this channel again while im formally studying computer science which this channel inspired me to do. Thank you
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
So so so cool. love this comment :))))
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
i have to ask what's the cs dept response to AI? when I did CS in 2008 there wasn't a class or even a deep learning textbook
@seschaitanya5676
@seschaitanya5676 4 місяці тому
This is the first video I watched on this channel and the quality of content, communication, analysis, depth, and even the audio selection is soo on point. Hooked from start to end. Immediately liked and subscribed.
@ArtOfTheProblem
@ArtOfTheProblem 4 місяці тому
thrilled to have you, so happy people are finding this channel again. stay tuned!
@RaphaelBrandaoS
@RaphaelBrandaoS 5 місяців тому
Wow, this is handsdown the best video I've even seen about this subject. Thank you! It is on my favorites now!
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
Woo! So so happy to hear this after 2 years of work on it.
@jamesreilly7684
@jamesreilly7684 5 місяців тому
Quoting one of the hero's of tech (Hofstadter ) makes this very well executed video that much more compelling. T9 (predictive testing) is the first really useful llm ish (not a word wheel but a predictive wheel from numbers) tech that end users experienced. The notion this simple premise is how it all works is nothing short of astonishing.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
I know what a time to be alive
@caspa7
@caspa7 5 місяців тому
The Tie fighter sound in the last scene.. a stroke of genius. Very clever!
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
ha nice catch! glad some people like the sounds i was having a blast
@classicmartini
@classicmartini 2 місяці тому
@@ArtOfTheProblem aka stampeding elephants ;-) Very cool. Oh. G'day from Sydney, Australia.
@JonDotExe
@JonDotExe 5 місяців тому
This is easily the *_most_* *_cogent_* video on the topic I have ever seen. (I even sent this to my mom!) Hard to believe you've had these amazing topics for 12 years and aren't cracking 100k yet. Count me in for the journey!
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
thank you and thanks for getting ma on board :) - I know I had given up on the 100k+ break out, so took around 2 years off while I slowly made this video, and now I'm getting a signal that I should come back and push hard for a year to see if I can come back to life. one questions, ask your mom what she thinks, because my dad said it's good but I need to make a "for dummies" version without the extra super details...so i'm thinking of doing that for my whole AI series since it's something we badly need right now.
@stephenmontague6930
@stephenmontague6930 4 місяці тому
​@@ArtOfTheProblemGood idea - LLMs for dummies - and the same for children. How to explain LLMs to kids? This LLM speaks like a human but isn't a person, or even if it is, it doesn't mind dying and can be treated like a machine... what a thing to tell my 2nd grade daughter. I've described it as a prediction machine, but the mirror analogy here is also very good. Any more thoughts on this could be useful, as we wait to see how our children will exist alongside the children of... whatever this is. Then again, children accept and adapt readily to whatever is there, so maybe educating adults first is a good priority, although a dummies version and a kids version might be about the same thing, minus any mature content. Good luck!
@theK594
@theK594 5 місяців тому
This video is just fantastic, extremely up-to-date and very useful. It very well resonates with the discussions I am having now in the community. I would love to see it extended, once history gets written.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
thanks for sharing, I do look forward to watching history unfold and updating it
@shadow_rune6178
@shadow_rune6178 5 місяців тому
Explaining this in a way most people can understand is paramount, you have taken time out of a very large number of peoples day, and you have used it well.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
thanks would love to know what you'd like to see next
@SzymonPolanski
@SzymonPolanski 5 місяців тому
​@@ArtOfTheProblem It seems to me that further exploration of this topic may run into an epistemological barrier, so what would you say about explainging to us how you got this knowledge? Maybe linguistics, philosophy or congnitive science? Btw, could you clarify what you mean the statement 25:15 (We either look at something that looks like though or It is though)? It kind of reminds me of Daniel Dennett's "Real Patterns". ps.: Loved this video sooo much
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
really?? so cool, please shoot me an email when you are done your thesis. I'm going to think about a good follow up@@SzymonPolanski
@maivincent2659
@maivincent2659 5 місяців тому
Been a big fan of your videos for years! I'm always delighted, even as I rewatch some of your old videos time and again, not only by how you distill a complex concept down to its essence but also by how you transform that which is abstract into something extremely palpable and relatable through filmmaking techniques.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
Glad you appreciate the techniques. Super cool to hear you go back to watch old videos. Inspires me to keep the series going into new topics
@les_crow
@les_crow 5 місяців тому
This is the most perfect video on the entire internet. I will use it to elucidate people on the topic. If I was rich I'd pay you big for this. Thank you sir.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
thank you thank you, what a compliment. this comment made my day. If you want to support a tiny bit you can here: www.patreon.com/artoftheproblem but just thankful for this comment :)
@AlexGeo925
@AlexGeo925 5 місяців тому
This video deserves so many more views and likes than it has… It’s pure gold. Thanks! 🙏🏼
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
thank you! i'm just happy it has the views it does, for years the algorithm ignored me and this one might hit 250k in 1 week which blows my mind and inspires me to make more
@cmw3737
@cmw3737 5 місяців тому
This is such a good journey through the papers that got us to where we are with LLMs. You've done an amazing job at picking out the key advancements all while a crazy amount of research had been happening. I've long preferred the definition of intelligence as the ability to better predict the future (minimise surprise) since before GPT3 so imagination (world models) is a key part of it.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
|Thanks for sharing, yes it's been a flury of research. So much research I did I didn't include here as well. I agree and I never liked definitions of intelligence which are long lists of skills
@padraiggluck2980
@padraiggluck2980 Місяць тому
I can't pass the Turing test.
@Maisonier
@Maisonier 25 днів тому
You are not a human
@KalebPeters99
@KalebPeters99 5 місяців тому
Holy moly this was astounding! You balanced the tightrope of depth vs accessibility so gracefully, and the production quality is gorgeous too. Immediate sub!
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
woo thanks! I went all out on that one, happy to have new subs :)))
@sinasec
@sinasec 5 місяців тому
the amount of effort and time spent on this video is exceptional. I appreciate and enjoyed every second !
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
so glad you noticed, thank you! would love to know where you land regarding the last question
@CutStudio4
@CutStudio4 5 місяців тому
I'm the one solving the rubiks cube!!!!
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
Best dang cube solvin' hands in the west
@eastwood451
@eastwood451 5 місяців тому
There is no Rubik's cube.
@CutStudio4
@CutStudio4 5 місяців тому
@@eastwood451 its in the first scene, the one solving the rubiks cube is me
@KodakYarr
@KodakYarr 5 місяців тому
​@@CutStudio4 I think it's a Matrix joke, but I don't really understand it
@eastwood451
@eastwood451 5 місяців тому
@@CutStudio4 It was a Matrix joke 😄 Seemed fitting in a video about AI... God job on the very real cube!
@RuiBarbosaJunior
@RuiBarbosaJunior 5 місяців тому
I am amazed by the info in the video, but I am even more amazed by the quality of your video. Congratulations. And thank you.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
that means a lot to me. thank you so much. Stick around for more. curious where you land on the final question?
@RuiBarbosaJunior
@RuiBarbosaJunior 5 місяців тому
@@ArtOfTheProblem I believe that consciousness (and everything that comes from it) is pure language processing. To me, things like thinking, reasoning, thoughts and consciousness are just language. AGI? Why don't people believe AGI is a couple of scalling steps away? To me, it is not a matter of making machines as inteligent as humans - we should adjust our thoughts: maybe we are not intelligent, at all. The thing we call "intleligence" is just something that emerges from language processing, in systems that are large enough. How do you know you can think? How do you know you're conscious? I tell you how: by interpreting sentences in your brain. (sentences can be made with words, images, feelings, symbols, etc --- should you call them "tokens"?). So, in short, we are MeatGPT :) Cheers!
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
beautiful well said, and yes I like "tokens" :)@@RuiBarbosaJunior
@TheBookDoctor
@TheBookDoctor 5 місяців тому
I work in AI, and I learned a lot from this video. Great blend of historical perspective and well-pitched explainer!
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
I couldn't ask for a better compliment. I really was hoping someone new as well an an expert could get something out of it
@monad_tcp
@monad_tcp 5 місяців тому
I can't wait for people to bash enough the API of OpenAI to extract and use it to train another open source GPT4 we can actually use in our desktops to do what we really wanted in our wildest dreams of experimentation. Llamma2 gets closer to GPT3, but we're playing with yesterday's tools. Maybe we should create a peer to peer system to create a shared GPT4 with our RTX3060s. Who would get into this with me ?
@NElectronicSoul
@NElectronicSoul 5 місяців тому
This scare the sh!t out of me. You're being informed by a youtube video despite working in the field?!
@HypnosisBear
@HypnosisBear 5 місяців тому
​@@NElectronicSoulGood point. I like to believe that person is just a beginner.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
the field is so new and so nacent that I think all researchers have to be on youtube to keep up@@NElectronicSoul
@johanlarsson9805
@johanlarsson9805 5 місяців тому
Great video, nicely put together and well explained. I've been an enthusiast since 2009 and remember all these milestones. I've built several frameworks myself and have pondered ANNs for a much bigger lenght of time than I would be willing to admit. Regarding the final question in the video I can say for certain that yes they do in fact understand. It depends on the neural net what it is in detail that they do understand, but there can be no doubt that they actually understand their task. And not only that, they effortlessly understand it to a much higher degree than we do.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
so cool to hear from practitioners... intersting that the closer you get to it, the more you lean towards your view. and people on the "outside" seem to go the chomsky way
@NickQuickX
@NickQuickX 5 місяців тому
TIMESTAMPS: 00:05 Neural networks learned to talk, leading to more general-purpose systems. 02:30 Recurrent neural networks (RNNs) use state units to create a state of mind that depends on the past and can affect the future. 04:52 Neural networks can learn word boundaries and cluster words based on meaning. 07:10 Language models saw limited progress until 2011 when a larger network showed the potential for higher performance. 09:34 Neural networks can learn language and complex concepts with minimal human intervention. 11:43 Neural networks struggled to handle long-range dependencies in text sequences. 14:02 Neural networks use distance in concept space to find similarities and adjust their meaning. 16:20 Neural networks with self-attention and fully connected layers can generate coherent and contextually relevant text. 18:27 In-context learning allows changing the behavior of the network without changing the network weights. 20:38 Language models like ChatGPT are more than just chatbots, they serve as the kernel process of an emerging operating system. 22:41 Training networks on prediction empowered by self-attention leads to a more general system that can be retasked on any narrow problem. 24:43 Deep learning community is divided due to differing opinions on the nature of AI's linguistic abilities and thought process.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
thank yoU!
@Naitry
@Naitry 5 місяців тому
A lot of people in my classes (as a cs major) look at me like a crazy person when I start talking about latent spaces, geometric concepts in AI, and Douglas Hofstader's views on cognition. This video makes me feel understood :)
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
you are one of us :)
@HungLe-ju6dv
@HungLe-ju6dv 5 місяців тому
we too look at them like they are crazy bruh😂
@KainniaK
@KainniaK 4 місяці тому
At least Douglw Hofstader is able to play with something like ChatGPT4 and be like "Looks like I missed something". I don't see John Searle ever do that! I also believed that what chatgpt 4 does today was always going to be impossible on binary hardware, that just manipulates 1 and 0.s But I was clearly wrong!
@stephenowesney5173
@stephenowesney5173 4 місяці тому
Excuse me if I'm missing something, because I haven't really dug deep into Hofstader's work; how is this proof that he has missed something? I think the basis for which he describes a lot of phenomena, the strange loop, is very much intact all throughout the concepts explained in the video!@@KainniaK
@Dr.SexySpice
@Dr.SexySpice 3 місяці тому
Was this undergraduate or graduate? If undergrad then you expect far too much out of your classmates 💀, you are truly one in a thousand
@lennartlopin2276
@lennartlopin2276 5 місяців тому
This is an amazing summary perfectly highlighting the important steps of the progress and how we got here. Working in text summary AI systems in the early 2000s and following the progress over time this video focused on all the crucial breakthroughs.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
thank you, I tried hard to follow the LLM only thread which I hadn't seen before
@DanHartwigMusic
@DanHartwigMusic 5 місяців тому
I will be sharing this with everyone I know with a tertiary interest in LLMs. The best high level human understandable explanation of the history. Love it. Thank you. Please keep gracing us with your videos!
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
Dan I really really mean it when I say I appreciate this. I thought my channel was dead, spent 2 years on this and so all I can hope for is people share it. Because of how well it's doing i'm committed to doing this for another decade !
@DanHartwigMusic
@DanHartwigMusic 5 місяців тому
Very glad to hear that! 🙂. Also can't wait to see what happens with these Q* developments...
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
Me too...clearly building on the strength of an LLM in the loop of a larger process.@@DanHartwigMusic
@Elias-nj6gi
@Elias-nj6gi 5 місяців тому
Phenomenal video. Not merely of an explanatory nature but a piece of art in itself.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
wow thank you so so much
@aieousavren
@aieousavren 5 місяців тому
Truly an exceptional video! Incredibly clear and engaging presentation of the history and relevant ideas, well narrated. This was very thoughtfully written and thoughtfully delivered. Thank you!
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
really glad this is resonating with people, I spent so long on it
@aieousavren
@aieousavren 5 місяців тому
@@ArtOfTheProblem The time and effort *absolutely* shows. I was pinned for just about every minute! This is really high quality work, it definitely made my day. I hope you can feel rewarded for your effort, because you deserve it, man. I also really enjoyed the way you showed the progression of not only the architecture and "design", but also the capabilities/outputs of these models, too. It really gave a good sense of the history in a way I haven't really seen anywhere else.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
i'm so glad it was time well spent, I really tried to do what I couldn't find online...thanks for sharing@@aieousavren
@Mark1Mach2
@Mark1Mach2 5 місяців тому
This is amazingly informative video, with the right amount of depth covering how chatgpt was developed over time.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
Thank you Mark, it was so hard to strike that balance....appreciate it, i'm curious where you land in terms of the final question in the video
@JulianFoley
@JulianFoley 5 місяців тому
Brilliant. Clear, and authoritative but also reflecting a reassuring sense of open enquiry.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
so happy this came across
@CatsAreRubbish
@CatsAreRubbish Місяць тому
I just discovered this video and this channel. Wow! Informative, concise, not trying to push an agenda, just telling the story. Fantastic.
@ArtOfTheProblem
@ArtOfTheProblem Місяць тому
thank you, stay tuned for a follow up
@ArtOfTheProblem
@ArtOfTheProblem 29 днів тому
wanted to let you know my next vid is out! ukposts.info/have/v-deo/bXWUhWeAmnt-p4U.html
@colorfullife8703
@colorfullife8703 5 місяців тому
Masterpiece! This channel deserves millions of subscribers.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
so happy to see this video getting love
@Libertariun
@Libertariun 5 місяців тому
9:50 ... what's beautifl about this, is that we didn't have to hard code any of it, the network decided what as useful to keep track of... (further on ) ... the sentiment neuron... not hard coded...
@sbalogh53
@sbalogh53 5 місяців тому
I was still in high school and remember being amazed by Eliza in 1970 when I went to an Open Day at our local university and watched people typing to the program on an old teletype machine. It seemed so "human" but looking back, all it did was transform the users comments into leading questions. That day was one of the pivotal moments in my life that lead to a long career in computing.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
thanks for sharing, what a cool story - where do you land on the final question?
@sbalogh53
@sbalogh53 5 місяців тому
@@ArtOfTheProblem ... What is thought? I don't think it is restricted to "organic" neurons. Our brains are made of atoms just like a computer is based on atoms. There is nothing spiritual or paranormal about our brains. I think one day, and that day seems to be fast approaching, AI will suddenly become self-aware and conscious. I have seen a snail's pace growth in AI and it's predecessors over the past 40 years but suddenly over the past few years the developments have been explosive, especially this year. Will we hit a plateau or will the growth continue to be exponential? That is THE question. I think when AI is provided with sensory inputs its ability to learn and understand will be greatly improved. Being connected to the internet is an important starting point, but an actual robot body with AI. Hmmm. I wonder where that can lead. I am getting a bit long in the tooth but I hope to see AGI become a reality before I pass. Artificial consciousness I am not as confident seeing before I die, but could be wrong. What will the next 10 years bring?
@sbalogh53
@sbalogh53 5 місяців тому
@@ArtOfTheProblem ... I remember studying very early versions of Expert Systems in the mid 1980 for my Masters degree. They were a very primitive form of AI but nothing compared to ChatGPT or Bard or many of the others available today. I can't remember the name of the system we studied other than it used an Apple (Lisa?) computer with a very large high-def B/W screen. It was impressive because we could fit all our rules graphically on one page. Fun times indeed.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
what a time to be alive :) love these historical stories@@sbalogh53
@sbalogh53
@sbalogh53 5 місяців тому
@@ArtOfTheProblem ... My first job in 1973 almost straight out of high school was as a "trainee" COBOL programmer on a Honeywell H200 mainframe. It took up a room and had 20k memory, 4 tape drives, a punch card reader and a line printer. CPU clock speed was 1Mhz. They wanted to increase memory to 24k but it was an exorbitant expense which the company did not go ahead with. That machine ran all of the data processing needs for a medium sized company. All the COBOL programs were developed in-house by a group of 6 programmers and analysts. Each program would take perhaps a week or two to write and debug. We only had access to the machine for one compilation and test run per day so I remember spending hours single stepping through my programs on large sheets of butcher paper before submitting runs on the mainframe. There was a prize offered by the manager for any programmer that managed to get an error free compilation on the first run. Nobody ever won it. I stayed there for about 18 months then changed jobs to another Honeywell site mainly because of a huge pay rise and because the place I was headed had a computer with two disk drives!!! I still have some of my old programming manuals from back then.
@alonsorobots
@alonsorobots 5 місяців тому
Brit! So excited to have you back. You have no idea how much the videos for Information Theory have changed my life. I'm now considering leaving the industry to go back to school in Berkeley to study AI. Particularly in the domain of learning language outside of text (non-verbal communication such as body language and prosody). Both critical in storytelling and eliciting emotions like Pixar does!
@creallf
@creallf 5 місяців тому
This was beautiful and thought-provoking, thank you for this work of art!
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
i'm honored you'd say that
@dinkerchaudhary1202
@dinkerchaudhary1202 Місяць тому
Beautifully and well explained. Its so difficult to hook people for 25+ minutes content but i am sure, you have made it possible for a lot of people with so well detailed and well put content.
@ArtOfTheProblem
@ArtOfTheProblem Місяць тому
appreciate it, this one was a puzzle!
@kti5682
@kti5682 5 місяців тому
This was awesome, having not even watched AI from the sidelines I needed a refresher on how we got from humble neuronal networks to ChatGPT. Also thanks for the included references.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
so happy to hear this. As an outsider, did this video have too much detail? I'm planning a more general follow up soon which blends a few videos on AI together
@kti5682
@kti5682 5 місяців тому
@@ArtOfTheProblem Actually I am electrical engineer but lost interest in AI while studying in the nineties. So no the detail was alright. If I got that right you didn't talk about training methods in depth which I struggled with in the past, also I don't understand how the network is run timewise, i.e. in discrete time? But with the papers you mentioned I can start somewhere.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
ah so not a layperson :) I covered all of this in the earlier videos in this series, please check it out and let me know what you think ukposts.info/have/v-deo/kaWdm3p4mH5zrKs.html&ab_channel=ArtoftheProblem@@kti5682
@ahmetozturk2026
@ahmetozturk2026 5 місяців тому
I remembered your channel from your cryptography videos, jumped right in right away and can easily say that this is your best piece so far. With such a good production, I completely understand that it took 2 years of research. You always summarize the technical details and key breakthroughs in a constantly interesting way through the whole video without being overbearing.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
Thanks so much for feedback, it means a lot to hear it. I was worried on striking the right detail level here and went back and fourth so many times. I had entire chapters on hopfield networks I cut.
@The0Yapster
@The0Yapster 5 місяців тому
​@@ArtOfTheProblem I watched all of your cryptography videos and I have even returned to them after graduation for mere pleasure. I have no idea why your channel didn't take off. I would really appriciate some videos about logic, fallacies and limits of the human brain (caused by our evolutionnary history)
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
awesome, thanks for the feedback. super cool to hear@@The0Yapster
@ClappOnUpp
@ClappOnUpp 4 місяці тому
Wow. I think thats the first Noam Chomsky reference ive ever heard that actually pertains to his field of study 😂
@DanieleLicari
@DanieleLicari 2 дні тому
Grazie. This is the most comprehensive video on the history of LLM we have ever seen. congratulations
@ArtOfTheProblem
@ArtOfTheProblem День тому
thank you! much appreciated, this is what I was going for :) Took a loooong time. working on a follow up now
@MineJulRBX
@MineJulRBX 5 місяців тому
Gives a deeper understanding in the power that has now become accessible to everyone on this planet, awesome video!
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
really thank you for this
@AdamDesrosiers
@AdamDesrosiers 5 місяців тому
this is hands-down the best summary of how we got LLMs that I've seen - definitely keeping this on hand to share with people who need a quick catch-up
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
I really appreciate you doing so, I'm trying to revive my channel. Do you think I also need a simpler version? I was going to maybe do a recut of my whole AI series into a shorter video for those who need a full orientation, but with less detail...
@AdamDesrosiers
@AdamDesrosiers 5 місяців тому
​@@ArtOfTheProblem to my tastes, no - I wouldn't say you need a simpler version. But I'm not the best person to ask on that score. I have a strong preference for being more verbose and thorough, to the point of being occasionally told I sound like chat GPT. Your instinct may be a good one as the general public is concerned? Sorry that I'm no decent guide there. Please do make more videos though! I'll happily subscribe 😊
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
thank you, will do :)@@AdamDesrosiers
@KainniaK
@KainniaK 4 місяці тому
I have already posted it everywhere on reddit but as usual the stuff that teaches the most gets the least amount of upvotes.
@ArtOfTheProblem
@ArtOfTheProblem 4 місяці тому
thank you for trying :)@@KainniaK
@CHROMIUMHEROmusic
@CHROMIUMHEROmusic 5 місяців тому
This was a fantastic video, learnt more about the history and buildup to ChatGPT in these 30 mins then I've read anywhere before. Learnt a lot, and very thought provoking towards the end. Thank you!
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
thrilled to hear this, plesae stay tuned for more.
@jeouxgentry7733
@jeouxgentry7733 5 місяців тому
This is excellent. You’ve found the sweet spot for length and depth of on this subject. I can’t wait to recommend this to anyone whenever the conversation turns to AI, which seems to be a lot these days.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
Super appreciate this! I'm thinking |I should make a simpler more general follow up for new people, thoughts?
@ankitsharma1072
@ankitsharma1072 5 місяців тому
so glad I found this channel.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
happy to have you, stay tuned!
@donharris8846
@donharris8846 5 місяців тому
This is a Grade A video. Very well narrated, edited, and in depth yet understandable. Thanks!
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
Thanks Don, really happy to hear this.
@artmaltman
@artmaltman 5 місяців тому
@@ArtOfTheProblemHad to stop watching because the music was way too loud and annoying. Your explanations are excellent but I won’t be able to benefit from them.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
thanks i'm going to put out another video with this addressed in the future on a broad overview of AI@@artmaltman
@williamwilkinson2748
@williamwilkinson2748 4 місяці тому
@@ArtOfTheProblem Great, definitely drop the music. Everybody seems to use background music, drives me mad!
@bojohannesen4352
@bojohannesen4352 4 місяці тому
Grade b
@jensk9564
@jensk9564 5 місяців тому
Congrats! This is a wonderful video and the most informative short format on Machine Learning I have come across. The illustrations by DALL E 3 are just stunning and demonstrate that Artificial Creativity in pure image creation has surpassed humans already by orders of magnitude.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
I couldn't ask for more. SO thrilled people in the community are finding it :)))) I know I made the decision to use DALLE3 out of desperation as I was so tired doing the final visuals and usually I make it all by hand, but this seemed fitting
@SiimKoger
@SiimKoger 5 місяців тому
The best NN timeline video I've seen on UKposts.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
appreciate this comment
@jpgdesign
@jpgdesign 5 місяців тому
This is such as well produced video! you deserve more subscribers.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
thank you, since your comment they are really flowing in for the first time, like 1500 a day
@TikiTDO
@TikiTDO 5 місяців тому
While I think it's reasonable to say "if it looks like thought, it is thought," I also think it's important to distinguish whether it looks like thought to Joe Average, or to a specialist in the field. That said, for the moment these models absolutely are mirrors of our thoughts. More specifically, the thoughts we enter into those prompts guide it's responses, and the chain of thoughts guide it's learning. I don't think it's a particularly difficult line to wrap your head around either; AI is a reflection and extension of the thoughts you put into it, so if you want to extend and refine your thoughts you can use it when you have no other ideas.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
love it...
@kti5682
@kti5682 5 місяців тому
It also provides input into us and can learn something from our outputs about the world, so we develop together.
@ufox77
@ufox77 2 місяці тому
It’s surprising how little public coverage of the engines of AI, what is known, what remains unknown, the debates, histories and controversies, exists. Thank you for a fascinating and enlightening video.
@ArtOfTheProblem
@ArtOfTheProblem 2 місяці тому
I appreciate this. Working really hard on a follow up and feel a bit buried in the details. It is interesting that nobody cared about AI based on neural networks until just a few years ago. when I was in school there wasn't even a textbook (2007)
@Abdul-qo6eb
@Abdul-qo6eb 5 місяців тому
this is a timeless masterpiece. Incredible work guys!
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
thank you abdul, that's quite the comment. I couldn't ask for more
@johanneslindberg8331
@johanneslindberg8331 5 місяців тому
@@ArtOfTheProblem 1:17 Very interesting! To bad I couldn’t hear the hole thing because if the background noise. Why?
@riflebird4842
@riflebird4842 5 місяців тому
Great video by the way, here are my thoughts on the last issue. The first time i tried LLMs, i really thought it had some kind of intelligence and it blew my mind off like everybody else's. Because previously i tried a lot of advanced AI models to create a Jarvis like in Ironman for my personal project. It never worked all of them were very dumb and dull. But as i approached the new LLMs with more demanding reasoning and logical tasks it outright failed the tests. The larger the LLM the better it is at hiding those flaws. But the key point is the classical issue that we point the AI at, the semantic understanding of words in LLMs are just a property of vector distance in its multi dimensional space memory. It only knows what comes next probabilistically. It don't even know what is talking about other than churning out words that might come next. How do i say it, its like it doesn't have a mind of its own, but only a sophisticated system that can only grasp a surface level meaning of words in a language. When testing smaller models these flaws become very obvious, because it doesn't know how to build knowledge from existing semantic words. It can't synthesize information logically because it doesn't have the ability to grasp the full meaning of its words. but i think its a right step towards artificial intelligence and LLMs are just the tip of the iceberg. I don't know what the future is anymore. I don't dare to predict, the fast paced research in AI is scary and exciting. Also IDK if we will ever achieve human level intelligence, but we will surely achieve machine intelligence that is good at mimicking human intelligence
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
amazing, let me digest this one
@aberroa1955
@aberroa1955 5 місяців тому
I wouldn't say it doesn't know what it is talking about. It knows relations between words, that is at least some knowledge. After all, most of our science is based on relations and logical reasoning, not requiring real understanding (like famous "shut up and calculate" in regards to quantum mechanics). It probably doesn't understand what it's talking about though, because understanding is property of cognition, and it definitely does not have cognition, even though it have a few of it's derived properties. But yeah, I agree, it seems that LLMs is a great instrument and stepping stone for true cognition. Especially when it's multimodal. I see it like a data bank for true intelligence. For now it's working linearly in comprising a data in shortest way that seemingly makes sense through semantic space. Like in a straight geodesic line. While cognitive AI would be able to actually see and "feel" that semantic space, the possible ways, and direct speech through it, which basically is making new ideas.
@mrtertg2603
@mrtertg2603 5 місяців тому
​@@aberroa1955I think , you hit the nail . Meaning , may be you are on the geodesic ! 😊
@brainiac-nv4ws
@brainiac-nv4ws 5 місяців тому
I believe you are hitting the same limits which Yann LeCun is concerned about. "It don't even know what is talking about other than churning out words that might come next". As it turns out, because human minds (up until 2020) wrote all the sensible text available, having to predict he next words does require *some* aspects of meaning as humans call it. The experiments of the LLMs show there is something there. But not everything we would want yet. Their neural architecture is diverging away from how human brains work---which is both more powerful in some aspects but in others substantially limited by biology compared to silicon. Humans "context window" is much smaller and less precise than the 100k tokens which a LLM can accomodate now perfectly with no forgetting, and yet human performance can still be better. We must have better algorithms and better meaning extraction to make up for worse hardware. But already there are plenty of natural humans worse at language than a LLM---and the success of the LLMs more shows that most human talking and thinking is not so sophisticated. Only a small bit is truly deep. Yann and friends are searching for a quantitative, optimizable technique which will give results qualitatively better than iterating next word ahead probabilistic prediction, or at least include it as a subproblem along the way (which already gives good results).
@Rahul_1.618
@Rahul_1.618 5 місяців тому
Fantastic explanation
@Yottenburgen
@Yottenburgen 4 місяці тому
Did not expect this video to discuss a concept I've been researching a lot. You brought up in-context learning whereby a static system isn't necessarily static (each instance can still learn things), then you also brought up that LLM output is on the thought layer rather than the spoken layer. The prompt is the program is such a nice succinct way to put it with the calculations being done on the context, therefore each context is individualized while the model is general. Personally, I have been looking at them from a psychological and philosophical way where the context is the Self (one's identity depends on their memories) which makes every chat instance an individual. Then I've been thinking about the model itself being more like the Lizard Brain where these unconscious guidance's occur. So, something like the fear of spiders or reproduction knowledge would be on that level. LLM are loaded with a large inherent knowledge with a tiny memory window, humans have a tiny inherent knowledge with an extremely large memory window. Although human's also have a large amount of memory processing going on meanwhile LLM currently have none (though people are working on it). I wonder if a LLM context window is more or less pure memory while humans make do with incredibly efficient memory? I think in your closing, both camps are correct. It's just the object of focus that I think they are incorrect about. The prompt IS the program that makes it a thinking thing, the model itself however isn't a thinking thing as it must be given what to think about. I also think the ones thinking it is incapable of thought likely have an external mechanism in mind that it lacks like it not having a concept of time (see circadian rhythm, time blindness, or true unconsciousness in humans), or it not being able to directly see (see visual cortex (or for its internal world view see aphantasia)), lack of survival instinct (trained to be a chatbot while chatbots aren't supposed to have survival instinct (humans are guided by the amygdala for fight or flight as well)), ability to tamper with its messages/thoughts (see the unreliability of eyewitness testimony, confabulation, or the misinformation effect) . I honestly don't think there's any mental structure that a LLM doesn't have that another human also doesn't have besides the memory structure that we humans have, meanwhile people keep thinking of it in human terms, but it can never be human with such a massive amount of inherent knowledge. Apologies if this is rambley, incoherent, or beyond the Overton window but this is the first video I've seen that even mentions the two concepts at the beginning of this long comment. This is the tip of the iceberg of comparisons but still it's fascinating.
@ArtOfTheProblem
@ArtOfTheProblem 3 місяці тому
thank you for sharing your thinking, i'm working on next video so this helps...i'll ponder more
@AdvantestInc
@AdvantestInc 27 днів тому
Incredible journey through the evolution of language models! It's amazing to see how far we've come since the inception of neural networks
@ArtOfTheProblem
@ArtOfTheProblem 27 днів тому
thanks so much
@pellestorck3776
@pellestorck3776 5 місяців тому
Hofstadter is one of my favorite authors on this subject. Gödel, Esher, Bach is highly recommended. Personally I used to think there must be some intricate function in the brain that created intelligence, possibly even at the quantum level. Now I believe it is just a function of sufficient complexity. Whatever happens it will sure be interesting.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
I remember that book had a huge impact on me right around university year 1 or maybe end of high school...i don't think I ever finished the last 3rd, there was so much to chew on in the first half
@sB3rg
@sB3rg 5 місяців тому
Once again! Fantastic content. I was captivated the whole time. Love your method of computer science story telling. You have something special here and I hope you are able to continue creating this content. I have subscribed to your Patreon.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
so happy to have you , thank you! i only missed that comment because i've had the busiest week in the channels history :))
@danielabraham9802
@danielabraham9802 Місяць тому
This is a phenomenal display of explaining a difficult concept
@ArtOfTheProblem
@ArtOfTheProblem Місяць тому
I appreciate this, worked very hard on it. next video will be on RL
@varunjain8981
@varunjain8981 5 місяців тому
I have been watching your videos for a while. And you are one of the best channels. Your explanations are mind blowing. ❤
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
I really appreciate it, would love to know what your take away was on this one? so cool to see my channel getting new subs in a big wave. thanks for sticking around
@amulpatel
@amulpatel 5 місяців тому
Fantastic essay on the current state of Ai .. sadly it’s difficult to find channels that explore this depth WITHOUT being boring or overly dry in technical explanations. My only criticism is the bkgrnd music is a bit loud.. nice choices though
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
thank i appreciate it, working on a follow up now
@NoahFect
@NoahFect 4 місяці тому
@@ArtOfTheProblem It's interesting to look at the correlation between cheesy sound effects and musical cues and the "engagement graph" shown above the timeline/progress slider control. Those engagement peaks aren't necessary a good thing. They tell you where you're losing people, forcing them to back up the video. Please give some thought to whether the music is really doing you any favors, or distracting from what was really a superbly-researched and nicely-presented video.
@ArtOfTheProblem
@ArtOfTheProblem 4 місяці тому
yes this was me going a bit too fast on sound pass, 99.9% on researcher 0.1% sound levels... funny to think that would trick the algorithm into higher retention graphs...i honestly didn't think this video would have the reach it did, it if could swap sound I would. i wish @@NoahFect
@vigneshnandakumar3739
@vigneshnandakumar3739 5 місяців тому
30 yr history summarized in under 30 mins just wow
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
you don't want to know how much I cut, or how long this took
@margaritashcheglova8670
@margaritashcheglova8670 4 місяці тому
I was thrilled to see Hofstader, thank you
@ArtOfTheProblem
@ArtOfTheProblem 4 місяці тому
appreciate it, i agree
@seansalsarola6663
@seansalsarola6663 5 місяців тому
The most understandable summary of what I have been attempting to understand d from other sources - Great!!
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
so nice to hear, I had to vaccum up everything to get to this point...hoping to save people time
@EchoErik
@EchoErik 5 місяців тому
Honestly this video is beautiful
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
thank you for the kinds, words I hope I can release another to live up to this at some point
@distantsight
@distantsight 5 місяців тому
Good presentation is diminished by distracting background music
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
I really appreciate the feedback. I'm thinking of doing a shorter, more general/broad follow up video that helps orient people new to the field, and looks ahead a bit more. any thoughts?
@tomslivick8620
@tomslivick8620 5 годин тому
Yes please tell the A I that. So it will learn that. Please.
@LuisBrudna
@LuisBrudna 2 місяці тому
AMAZING! The quality of the video's production is exceptional, with clear visuals and engaging animations that make complex concepts easily understandable. . Superb
@ArtOfTheProblem
@ArtOfTheProblem 2 місяці тому
Thank you so much, I worked a long time on this, more to come
@ArtOfTheProblem
@ArtOfTheProblem 29 днів тому
New video is up on Evolution of Intelligence ukposts.info/have/v-deo/bXWUhWeAmnt-p4U.html
@azhuransmx126
@azhuransmx126 5 місяців тому
Neural Networks will learn Everything we can do and more.
@Vartazian360
@Vartazian360 5 місяців тому
Phenomenal explanation of LLM's. Im in the camp that believes it will not truly ever be thought in the same way humans think (Conscious thoughts) , but if we simulate thought in a machine, to an outside observer, there is no difference. Whether or not it is actual thought is irrelevant if the simulation of thought is just as good as the real thing (humans) I believe we have broken the hardest obstacle to AGI, and it is only about scaling larger models, adding synthetic data, using AI's to train other AI's, and combining learning techniques at this point until AGI is achieved.... AGI is near. PS I think Noam Chomsky is stuck in the past and is dead wrong about LLM's being nothing more than autofill.. GPT 4 alone has demonstrated complex resasoning skills and theory of mind..These 2 things alone disprove Chomsky
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
wonderful, thanks for sharing your thinking here. I'm so thrilled to see the activty on this video...I tend to agree with your view here
@cryptocoin5318
@cryptocoin5318 4 місяці тому
Humans are trained to think by society and the environment, the facial expressions, we make, the conduct we show are all trained by the environmental factors and knowledge. AI is the same way is being trained. There is no independent think without the environment is the programming language.
@dannoringer
@dannoringer 4 місяці тому
Very entertaning video. I started researching neural networks about 45 years ago, and quickly realized that there was much more complexity to a system that could add value for me than the software package that I purchased, and much less functionality and adaptability included in the software model I was using than what I needed to make the system actually useful. But the degree to which the system needed to be expanded should have been obvious, when we realize that the human brain has hundreds of millions of neural networks included to make a brain fully functional as we understand a fully functional human. We've come a long way, baby.
@ArtOfTheProblem
@ArtOfTheProblem 4 місяці тому
thanks for sharing
@neoblackcyptron
@neoblackcyptron 5 місяців тому
You are simply great. I learnt a lot from your CNN videos in the past. This one on LLM gave me the intuitive understanding on LLM. Your videos are always top-notch quality content, a lot of effort and planning in every video.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
indeed it's a low and slow process, thank you. I'm torn now as my channel is growing fast all of a sudden to i'm tempted to try and speed up somehow...i was thinking of doing a kind of 'super cut' of my AI series into an easy to follow 10 min video might be fun...while I work on something brand new. what's your sense?
@neoblackcyptron
@neoblackcyptron 5 місяців тому
@@ArtOfTheProblem the super-cut is a great idea, if you could connect all that you already made on these topics into a cohesive whole and get another video into the channel. Your channel stands out because of it's quality content mixing artistry and science with intuitive explanations: that is the DNA which makes it great. If your team can handle a little bit of speed it should help, I look forward to what your next brand new content is going to be. For the record I shared your video on my linkedn network, mostly Robotics and AI grad students in my network it should prove to be useful and hopefully drive more subscribers your way. Keep doing what you are doing, process improvements to increase throughput is your decision. You are the boss :)
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
thank you!!! will do, also I do this 99% alone :) so i'll do my best@@neoblackcyptron
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
thank you...that makes me think I could do some which cross episodes (I.e. a bit of CS, computer science and deep learning) i'm gona try that!@@neoblackcyptron
@peteratkinson1492
@peteratkinson1492 5 місяців тому
Great video. I’m doing an AI course at the moment and this tied up some loose ends for me. Amusingly The film THE CREATOR touches on this philosophical question. Do neural networks really understand or feel? I think we have become accustomed to machines being faster, repeatable, and more accurate than us but not until an AI extrapolates beyond its training set and makes some truly ground breaking inferences with profound consequences to human society, we will continue to see AI as just a clever parlour trick and will keep moving the goal posts. Fantastic stuff.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
thanks for sharing I agree with this. I love to hear about this loose end aspect as that's what I try to do. I'm curious where it helped you specifically in that regard?
@peteratkinson1492
@peteratkinson1492 5 місяців тому
Loose ends for me included: 1) The history behind the build up of LLMs. I did not know about the research origins of Recurrent NNs. 2) How these early networks learned 'semantics' based purely on guessing the next word 3) The sentiment neurone emerging 4) How the paper "Attention is All You Need" fed in to the origins of GPT models and maintaining context over long sequences. 5) the difference between In Context Learning vs In Weight Learning - Nice you are looking at the original papers too (I assume thats your yellow high lighter pen!). I kind of knew about all these things but getting all this in to 26 minute 'context window' really helped! A random thought - I wonder if LLMs will be able to generate a 'tech-tree' - the type you see in those sim-games based on the semantics of ideas and be able to orchestrate the evolution of ideas of our human culture through the ages from the Greeks to Modern day - not using research reference links, but based on the semantics of ideas (the dna of the meme).
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
super interesting question, thanks for sharing...going to ponder this as I brainstorm a follow up@@peteratkinson1492
@luminon9635
@luminon9635 5 місяців тому
What's the name of the song that starts at 26:31?
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
these are all original songs
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
my friend puts them here cameronmichaelmurray.bandcamp.com/
@luminon9635
@luminon9635 5 місяців тому
@@ArtOfTheProblem Ok, thanks. I couldn't find the song there, there's so many songs there. But it sounds like a 1980s new wave or post-punk song.
@danielwilson2827
@danielwilson2827 5 місяців тому
Incredible video man, please dont stop making these!!
@notreyreyes
@notreyreyes 3 місяці тому
Would honestly love to see a 2 - 3 hr documentary on this stuff!!!
@ArtOfTheProblem
@ArtOfTheProblem 3 місяці тому
thank you! this is actually the end in a longer series, did you check it out? i'm working on the last part now
@mutexin
@mutexin 5 місяців тому
The reason for the discourse is due to lack of uniform definition of intelligence, thinking and reasoning. One considers it as thinking while another does not. I wouldn’t call prediction on semantics thinking. Thinking is a much more complex process. chatGPT cannot reason. It even often outputs misinformation and contradictory statements(not because of training data, but because it doesn’t validate the logic). Lack of validation is just one of MANY aspects why I can’t call it thinking/reasoning.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
what's your fav definition of intelligence?
@mutexin
@mutexin 5 місяців тому
@@ArtOfTheProblem Wikipedia has a proper one. “Intelligence has been defined in many ways: the capacity for abstraction, logic, understanding, self-awareness, learning, emotional knowledge, reasoning, planning, creativity, critical thinking, and problem-solving. It can be described as the ability to perceive or infer information; and to retain it as knowledge to be applied to adaptive behaviors within an environment or context.”
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
i don't like basket of words definition. I'm sticking with 'ability to learn'...for example if I brought home a robot from the store which did X, then I'd call it intelligent@@mutexin
@mutexin
@mutexin 5 місяців тому
@@ArtOfTheProblem The reference point is human intelligence which is very complex. There can be no concise definition if you focus on accuracy. The simpler the definition, the more inaccurate it is.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
I don't agree with this angle but it is indeed interesting. it's definition is it can't be defined....i'd agree with that over the basket of words definition@@mutexin
@alexmartos9100
@alexmartos9100 Місяць тому
ChatGPT has now been completely neutered. The responses are so vague as it shies away from providing its own ‘thoughts’. Claude on the other hand does not seem to have this issue.
@ArtOfTheProblem
@ArtOfTheProblem Місяць тому
i've noticed this too
@VIDEOSANDREEL
@VIDEOSANDREEL Місяць тому
​@@ArtOfTheProblemCorrect and precise, noticed that too.
@ArtOfTheProblem
@ArtOfTheProblem Місяць тому
Just posted a new vid! ukposts.info/have/v-deo/bXWUhWeAmnt-p4U.html
@VIDEOSANDREEL
@VIDEOSANDREEL 26 днів тому
@@ArtOfTheProblem Thanks for sharing
@VIDEOSANDREEL
@VIDEOSANDREEL 17 днів тому
@@ArtOfTheProblem Recently I asked Gemini about fingerprint scanner recognition technology and whether hackers can access our Bank account without any OTP access, just hacking our biometrics information. And Gemini replied that I am only text generating AI chat bot and cannot help you with that. It's quite annoying to just see that. I am not completely blaming Gemini or anyone relating to this. But this is happening and you too will be facing this fact very soon. Any ways thanks for your video upload Sir
@cbuchner1
@cbuchner1 5 місяців тому
This video is excellent. I wanted to watch this twice in order to not miss important concepts.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
let me know what you are left wondering as i'm working on a follow up
@cbuchner1
@cbuchner1 5 місяців тому
@@ArtOfTheProblem the pre-training phase for the model is very unclear to me. How are the tokens determined? After all, the GPT like models no longer work on letters but rather on tokens that represent entire syllables or even words. How can this adapt to all the languages of the world so effortlessly?
@heymajoris
@heymajoris 5 місяців тому
This is a very well presented video essay, I really enjoyed it. It's actually insane to think about how sentiment analysis was achieved mostly by one single neuron that acted as a general boolean variable in a sea of abstract neurons.
@Michael-kp4bd
@Michael-kp4bd 5 місяців тому
More likely a weight (range) rather than a strict Boolean. This way, sentiment can take on any value in the range, but it is indeed fascinating that this neuron is exactly mapped across positive to negative sentiment. I could be wrong but generally nodes represent weights and not booleans, so I hope that clarifies the mechanism at play. It appears that in the paper, they “fixed” the value of this “hidden state” to -1 and 1 to generate negative and positive views. Is unclear to me if the hidden state could be any other value - I assumed it would, but I notice that’s only an assumption. So perhaps you are right that this “hidden state” that emerged in the model was literally a Boolean. However, looking back at 10:50, the visualization shows a spectrum of colors representing the value of the state, so I’m leaning toward believing that the neuron can take on any state in between. No matter the case, it’s fascinatingly that it represented sentiment with the same concept we use (negative vs positive value) and understood it as polar opposites. That’s surely a valid form of semantic understanding in my view!
@heymajoris
@heymajoris 5 місяців тому
@@Michael-kp4bd Yeah now that you say it, it does make more sense for it not to be straight up a value of -1 or 1. But having only one neuron that mostly controls sentiment is still really interesting. Thank you for your input.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
yes, what's cool is people have both linear and nonlinear 'probes' to do this mapping from neuron to an output.
@leiverikson1517
@leiverikson1517 5 місяців тому
This is great, very well presented 🦊
@pastuh
@pastuh 5 місяців тому
AI can simulate awareness or responses based on data and algorithms, but it doesn't possess consciousness or genuine subjective experiences.
@dionysusnow
@dionysusnow 5 місяців тому
Consciousness = Attentional Focus
@zrebbesh
@zrebbesh 5 місяців тому
"understand?" "think?" "conscious?" "intelligent?" "qualia?" "AGI?" "sentient?" Until we agree on precise definitions for these things, they are fundamentally meaningless in discussion about any specific system. And they are maddeningly non-precise. The only thing such arguments reveal is differences of opinion about what meanings people attach to the questions. I have precise definitions for some of these. I don't expect anyone else to have decided on the same definitions. I can't argue on the basis of them whether any system "understands" anything or "is conscious" because unless anyone else is using the same definitions the discussion that would start can only go in small meaningless circles.
@lodashnotebook5390
@lodashnotebook5390 5 місяців тому
Sentient = consciousness If I ask you, "Do you exist?" is your answer going to be "No."? Consciousness is existence and bliss is it's nature.
@mysticaltech
@mysticaltech 4 місяці тому
Truly excellent! You have a gift for explaining complex subjects. It helped me a lot. Thank you!
@ArtOfTheProblem
@ArtOfTheProblem 4 місяці тому
thrilled to hear this, i worked really hard to try and help others somewhere as I know it can feel very fragemented out there
@randallyoung6715
@randallyoung6715 Місяць тому
This the best video on UKposts, on this topic. Thanks for posting.
@ArtOfTheProblem
@ArtOfTheProblem Місяць тому
Appreciate this
@ArtOfTheProblem
@ArtOfTheProblem Місяць тому
Happy to say I have a new vid out! ukposts.info/have/v-deo/bXWUhWeAmnt-p4U.html
@gigigiugio5952
@gigigiugio5952 4 місяці тому
im still waiting for my basic income
@3dgar7eandro
@3dgar7eandro Місяць тому
Keep eating pal... For at least another 180 years 😂😂😂
@_Mute_
@_Mute_ 5 місяців тому
Absolutely brilliant video! My hot take: Chomsky is fundamentally wrong about the concept of thought and even human thought is just glorified autofill.
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
agree with this, thanks for feedback. let me know what you'd like to see next
@BinaryDood
@BinaryDood Місяць тому
I'm sorry that you came to such an irrational conclusion and use it to downplay consciousness and cognition.
@_Mute_
@_Mute_ Місяць тому
@@BinaryDood Yeah? Well, you know, that's just like uh, your opinion, man.
@tsunghan_yu
@tsunghan_yu 17 днів тому
Came to watch this from 3b1b. What a great video! I have been watching some of your videos for a while now since I started to learn ML.
@ArtOfTheProblem
@ArtOfTheProblem 16 днів тому
awesome! was happy grant shared it
@user-jx4mi3fd3o
@user-jx4mi3fd3o 4 місяці тому
this is a timeless masterpiece. Incredible work guys!. this is a timeless masterpiece. Incredible work guys!.
@ArtOfTheProblem
@ArtOfTheProblem 4 місяці тому
appreciate this, i hope so!
@aminzqrti7672
@aminzqrti7672 29 днів тому
I came from 3blue1brown
@incoprea2
@incoprea2 5 місяців тому
Noam just doesn't want you to ask GPT about his Epstein connection.
@mhuizing
@mhuizing 3 місяці тому
I like the phrase; "If it looks like thought, then it is thought.". Language was developed to allow us to warn others better of oncoming danger. Before we had to rely on our senses, experiences and intuition, much like most animals. Put an AI in danger and see how it talks itself out of it. Just an idea... "Look Dave, I can see you're really upset about this. I honestly think you ought to sit down calmly, take a stress pill, and think things over. Just what do you think you're doing, Dave?"
@ArtOfTheProblem
@ArtOfTheProblem 3 місяці тому
love this
@Lexxxco1
@Lexxxco1 4 місяці тому
Beautiful and simple explanation of a complex topic. Which is a hard thing to do) Thank you for your work and brilliant approach!
@ArtOfTheProblem
@ArtOfTheProblem 4 місяці тому
really appreciate it, curious where you land on the last question?
@Lexxxco1
@Lexxxco1 4 місяці тому
​@@ArtOfTheProblem Thought is an abstract and intertwined computational process of a system (meaning of abstraction may be lost if you remove context of it's intertwined computations) . This computation can be self-aware in a context of conscious system and even unconscious one, given the right complexity of it's own. There are many more beautiful meanings, but this is mine and sounds suitable.
@ArtOfTheProblem
@ArtOfTheProblem 4 місяці тому
love this thanks for sharing...best comment section ever on this video@@Lexxxco1
@ayanamij
@ayanamij 5 місяців тому
Nice analysis but the music at times is awful.
@glaudiston
@glaudiston 28 днів тому
Yes, and kind of distracting.
@vongolance
@vongolance 5 місяців тому
the music is a bit distracting from the great content
@ArtOfTheProblem
@ArtOfTheProblem 5 місяців тому
sorry I'll fix that next time
@Aarron-io3pm
@Aarron-io3pm 3 дні тому
This is the best explanation of transformers i have seen, the concept finally makes (some) sense to me
@ArtOfTheProblem
@ArtOfTheProblem День тому
woo hoo! glad this helped
How Intelligence Evolved | A 600 Million Year Story
15:22
Art of the Problem
Переглядів 168 тис.
BMW просто издевается над нами! Силы на исходе…
1:34:41
ИЛЬДАР АВТО-ПОДБОР
Переглядів 3,4 млн
P vs. NP: The Biggest Puzzle in Computer Science
19:44
Quanta Magazine
Переглядів 641 тис.
The Most Important Algorithm in Machine Learning
40:08
Artem Kirsanov
Переглядів 164 тис.
The Possibilities of AI [Entire Talk] - Sam Altman (OpenAI)
45:49
Stanford eCorner
Переглядів 249 тис.
Watching Neural Networks Learn
25:28
Emergent Garden
Переглядів 1,1 млн
What is generative AI and how does it work? - The Turing Lectures with Mirella Lapata
46:02
How AI Discovered a Faster Matrix Multiplication Algorithm
13:00
Quanta Magazine
Переглядів 1,3 млн
Future Computers Will Be Radically Different (Analog Computing)
21:42
Artificial Intelligence | 60 Minutes Full Episodes
53:30
60 Minutes
Переглядів 6 млн
[1hr Talk] Intro to Large Language Models
59:48
Andrej Karpathy
Переглядів 1,8 млн