A.I. teaches itself to drive in Trackmania

  Переглядів 4,890,878

Yosh

Yosh

3 роки тому

A.I. teaches itself to drive in Trackmania, using NEAT algorithm, which is a particular type of Genetic Algorithm. This algorithm is used to select a neural network with optimal weights, and also an optimal structure.
Thanks Trabadia ! His UKposts channel : / trabadia1
More information about NEAT algorithm :
neat-python.readthedocs.io/en...
Contact :
Discord - yosh_tm
Twitter - / yoshtm1

КОМЕНТАРІ: 4 300
@yoshtm
@yoshtm 3 роки тому
Thanks for watching this video ! This is the first time i'm using NEAT algorithm, so there is obviously still room for improvement. The main problem is that my AI doesn't have a map memory, and can't anticipate "what comes next" with its current inputs. I have some ideas to improve my AI, so don't forget to subscribe if you want to see the next steps of this project ;)
@theracerdude
@theracerdude 3 роки тому
Could you in the later generations select the ones that traveled the shortest distance at the last checkpoint? This would make them use the racing line more closely wouldn’t it? And then you could pick out the ones that don’t hit the walls. Not sure if that’s at all possible but it would make sense to me
@Fantaman900
@Fantaman900 3 роки тому
in this vid i saw fast ai that started bouncing of the walls at a later point getting over taken by a slower ai. to create fast ai i needs map knowledge and learn that racing that keeps as much speed as possible and a bump on the wall slows you down. So basically all ai need to know what is faster in much smaller sections than a few checkpoints and then combine them to fastest to checkpoint. it's not going to be easy to create fast driving self learn ai.
@cymbalmonkey8360
@cymbalmonkey8360 3 роки тому
@@theracerdude also if the cars are punished for hitting a wall they will make corners without hitting the walls while also taking the shortest route
@froozynoobfan
@froozynoobfan 3 роки тому
neat optimizes the topology of the model, if i remember it correctly you don't train the weights of the model. ai.googleblog.com/2019/08/exploring-weight-agnostic-neural.html github.com/google/brain-tokyo-workshop/tree/master/WANNRelease/prettyNEAT
@miggy3828
@miggy3828 3 роки тому
neat
@21thCenturyFrog
@21thCenturyFrog 3 роки тому
Trackmaina in this form is a quite sophisticated liquid simulation.
@verrtex7837
@verrtex7837 3 роки тому
13:32 best part
@AkariInsko
@AkariInsko 3 роки тому
@@verrtex7837 thats trippy
@randomneko9
@randomneko9 3 роки тому
Totally not kidding. I bet if you layered more hough level runs at increasingly delayed start times it would appear even more accurate
@vakqdev
@vakqdev 2 роки тому
Fluid simulation*
@Arcsinx
@Arcsinx Рік тому
You should watch the trackmania 20k project from l4bomba
@babiilabilux
@babiilabilux 3 роки тому
Can't wait for the implementation of the brakes in order to see the AI drift !
@Bobba8590
@Bobba8590 3 роки тому
AI
@evgeny8578
@evgeny8578 3 роки тому
Well drifting will be cool but in these types of races like Formula, drifting will be totally useless. But yea will look cool
@masnysunshine
@masnysunshine 3 роки тому
@@evgeny8578 In sharp some sharp corners drifting is faster than releasing acceleration.
@masnysunshine
@masnysunshine 3 роки тому
ukposts.info/have/v-deo/pH6Bf4GIsK2rpoU.html
@floding22
@floding22 3 роки тому
@@evgeny8578 this is trackmania not a simulator so basically there’s a technique called speed drift by holding brake from about half a second while turning the car enters a drift where it gains speed, I was weird game physics but it’s a part the games identity now
@khalidal-mohammed5861
@khalidal-mohammed5861 2 роки тому
Try this: if the car hits the wall, then remove a point. This should make the AI learn faster because hitting the walls will let them learn to not hit walls.
@2DReanimation
@2DReanimation 2 роки тому
Or just kill them off if stopped. A little nudge can even be better than taking a turn slower to not hit the wall.
@khantatat
@khantatat 2 роки тому
Wrong. The GA will converge to *NOT ACCELERATE AT ALL* because you would maximise your points.
@tedocor9395
@tedocor9395 2 роки тому
@@khantatat If it doesn't accelerate, remove a point, problem solved. Acceleration - +1 increment Hits a wall - -1 Doesn't accelerate - -1
@khantatat
@khantatat 2 роки тому
@@tedocor9395 this would be better, it has an incentive to accelerate. Please note that, if feasible, it might converge to a circular path, which would represent a very appealing local maximum
@tedocor9395
@tedocor9395 2 роки тому
@@khantatat Then in this scenario, there should be a limited time for the AI to reach the finish line, this would then force the AI to find the fastest way, without hitting a wall to get to the finish line. Going in a circular pattern, ziz zag motion or going as slow as possible would mean failure for the AI.
@mrshadow4007
@mrshadow4007 2 роки тому
Of course Trabadia can help with developing a TrackMania AI. He's got lots of experience using tools in runs.
@xalat6277
@xalat6277 2 роки тому
Ah a man of culture I see
@sephikong8323
@sephikong8323 2 роки тому
I am mad that because he did such a reckless thing now his name will forever be tainted, he's no longer "one of the best players of the game", he's now stuck as "the guy who cheated". You ruined yourself and for what ?
@Seraphim262
@Seraphim262 2 роки тому
@@sephikong8323 For the worldrecord.
@sephikong8323
@sephikong8323 2 роки тому
@@Seraphim262 A record that he should have known he would have gotten caught for at some point, it's not like he played a rng heavy game and he just cheated by creating the exact odds he wanted, Trackmania is like Doom, it's very easy to access your run and replay it for scrutiny and any discrepancies can be found very easily. The only reason it took so long was because people didn't think it was necessary since he was excellent and had a good reputation and grinded so they trusted him. He himself destroyed that trust, it was certain that it would get found out someday somehow, he could have obtained the record the legit way if he kept grinding instead of taking the easy way out, now he pays the price
@dab_boy1024
@dab_boy1024 2 роки тому
That’s a oof moment
@scottcarr5014
@scottcarr5014 2 роки тому
One of the big differences between the human driving and the AI is information available. The human learns the layout of the track and optimizes each turn for the next. The AI is only given information about what it can see at any given moment. In other words, the AI is effectively driving the track for the first time every time.
@hanbanaroda
@hanbanaroda 2 роки тому
Also, human player sees much more further down the track (curves on the horizont), this AI is limited in its visual field.
@dr_j0nes
@dr_j0nes 2 роки тому
Yeah thats exactly the point. His AI stops to progressing because it lacks of Input. He need to let the AI see further to let it predict curves and such better. Also the current speed, acceleration and other factors need to be inputs to the learning algorithm.
@SirusStarTV
@SirusStarTV 2 роки тому
Each car needs its own camera for vision to see far
@sethmath2778
@sethmath2778 2 роки тому
And the ai should take the longest visible straight-shot instead of just zig-zagging through the entire track.
@HesderOleh
@HesderOleh 2 роки тому
@@sethmath2778I think that is happening because it isn't anticipate what comes next, so even if it zigs when it is supposed to zag half the time the selection function will just take the ones that got lucky that time and put them into the next generation.
@antonkremov1827
@antonkremov1827 3 роки тому
Alternate title: Weird water learns to flow efficiently on racing track.
@NightmareCourtPictures
@NightmareCourtPictures 2 роки тому
There is science that does imply, that quantum mechanics follows evolution in the form of something you've all heard before : Path of Least resistance...that this path is actually an evolution process formed by the arrangement of molecules shifting around to settle into the most efficient configuration. There is a famous experiment with the optimal configuration of bubbles...how "most" of the time they settle into the most optimal configuration...but sometimes they don't...and so it's been the study of how evolutionary processes is how nature optimizes all problems not just biology :)
@nathanmennel
@nathanmennel 2 роки тому
The way that first car morphs at 13:33 is amazing. This was extremely visually interesting
@didjoww
@didjoww 11 місяців тому
looks like some breakcore typa visualizer
@Enkrod
@Enkrod 2 роки тому
Trabadia is very well suited to assist a tool, as he has received so much help from a tool assistant himself. :p
@dildrawz5338
@dildrawz5338 2 роки тому
Was looking for a comment on Trabadia lol
@tery3836
@tery3836 Рік тому
Yeah trabadia is really good for real trabadia is amazing
@Tollerah93
@Tollerah93 3 роки тому
20 generations in and I'm still the car hitting the wall at the start line.
@dfdempire8912
@dfdempire8912 3 роки тому
Genetic mutations. As we evolve ressesive traits can sprout causing the mutation. It's based of survival of the fittest but has bad traits. Over time there will be less and less until it's no more. It's literally evolving and changing it's (DNA)
@Tollerah93
@Tollerah93 3 роки тому
@@dfdempire8912 what? I'm aware of how this is structured. I was making a joke.
@707beats6
@707beats6 3 роки тому
@@Tollerah93 lol
@Akimo250
@Akimo250 3 роки тому
@@dfdempire8912 Jokes. You know what jokes are, do you?
@everythingoutdoors3699
@everythingoutdoors3699 3 роки тому
Isn't it strange how this looks just like flowing water.
@JavierSegura
@JavierSegura 3 роки тому
Nah, we live in a world of mathematical patterns so reality is basically just a very advanced AI.
@ElPoyoDiabolo1
@ElPoyoDiabolo1 3 роки тому
Agreed ... it's a little bit hypnotizing as well :)
@alexalekos
@alexalekos 3 роки тому
it's strange how despite his awefull pronounciation you can still make out what he says
@marinlenfant6976
@marinlenfant6976 3 роки тому
Or like insect
@Soundlmpact
@Soundlmpact 3 роки тому
which means water is AI, therefore universe is AI
@pyramidbuilder
@pyramidbuilder 2 роки тому
Hey Josh! Your AI driving algorithm reminds me of water flowing down a tube... which is almost the opposite of how F1 drivers drive; as they hug the corners rather that rebounding off the opposite walls. Really interesting experiment! Thanks for sharing.
@doobzb5482
@doobzb5482 2 роки тому
If a high pressure water source was blasted into a tube that was well constructed for minimum resistance, it would follow an f1 drivers route wouldn't it?
@stig3914
@stig3914 2 роки тому
My only issue with generations (with know experience and just watching youtube) is you see a good contender that isnt the fastest (e.g. doesn't hit a wall but comes second) and it gets scrubbed - in a generation or two might acctually take over the current wall smashing leader.
@TheSmiddy
@TheSmiddy 2 роки тому
That's why you usually only cull the worst 50% instead of the 99% that didn't win.
@gosupersheep2006
@gosupersheep2006 2 роки тому
The term for what youre describing is a local minima/maxima - the algorithm effectively gets 'stuck' within the search space. Its why, as the other commenter says, you dont kill all but the winner, but its also the reason for random mutations being introduced!
@diabl2master
@diabl2master 2 роки тому
@@TheSmiddy Oh, is he culling the worst 99%?
@klobiforpresident2254
@klobiforpresident2254 2 роки тому
@@diabl2master One assumes not as that's basically never done.
@adithyaarun9981
@adithyaarun9981 3 роки тому
"The circle strategy" made me giggle
@warpmonkey
@warpmonkey 3 роки тому
If you were to introduce a stronger penalty for hitting the wall, such as ending the run right there and not letting it progress, would a stronger rule like that ensure the 'gene' for clipping the walls was removed?
@rookbrood
@rookbrood 3 роки тому
I was thinking the same thing, if you use a combination of factors for the fitness function it can learn better behavior.
@jackw1252
@jackw1252 3 роки тому
Exactly what I was thinking. A sort of ‘punishment’ for the AI for either hitting the wall or not reaching a specific checkpoint by a certain time
@YouHaventSeenMeRight
@YouHaventSeenMeRight 3 роки тому
I was about to suggest this myself. It seems to me that the AI is held back by the fact that it thinks that hitting the wall is a valid strategy to make a turn, whereas we humans know that this is not the case.
@Cuahchic
@Cuahchic 3 роки тому
A really straightforward solution is just to add a time penalty if it hits the wall, so total time is duration + penalties and optimise total time.
@martinum4
@martinum4 3 роки тому
I would rather add a penalty after 1 second of wall contact, depending on the vehicle it might be easier to implement a course correction via controlled crash
@brandonfortner3646
@brandonfortner3646 2 роки тому
I love how you explained all the science behind it, it allows me to understand how these AI work on a conceptual level
@antonnym214
@antonnym214 2 роки тому
Super-interesting! I found you by accident and subscribed immediately. Thank you! Very smart stuff. I love it.
@MisterFilOfficial
@MisterFilOfficial 3 роки тому
This is exactly how water flows trough pipes. Should we try to put a genetic algorithm on water drops to tech'em flow better? 🤔
@nif4345
@nif4345 3 роки тому
Yes
@noahm44
@noahm44 3 роки тому
Popular boy.
@hugoantunesartwithblender
@hugoantunesartwithblender 3 роки тому
This is not like water flow at all. Doing a water simulation is completely different of this ai
@theroninrenofficial9058
@theroninrenofficial9058 3 роки тому
So. We actually already do. Electrical and audible waves are used in many purification/production processes.
@theroninrenofficial9058
@theroninrenofficial9058 3 роки тому
@@hugoantunesartwithblender I mean. It kind of is though
@nak_attak
@nak_attak 3 роки тому
This just goes to show that even in a world where geniuses are all around you, some idiots decide to bash their heads on the wall instead
@sulosky
@sulosky 3 роки тому
Enough idiots banging their heads will finally get through.
@Redbikemaster
@Redbikemaster 3 роки тому
@@sulosky brute force method
@Mansory811
@Mansory811 3 роки тому
@@sulosky thats deep
@quantumsoul3495
@quantumsoul3495 3 роки тому
It's more people born without legs
@diogohernandez66
@diogohernandez66 2 роки тому
Interesting video and amazing effort! Many hours spent, nice work man!
@alhdlakhfdqw
@alhdlakhfdqw 2 роки тому
great video Yosh thank you very much for your great effort! subbed :)
@wouter8596
@wouter8596 3 роки тому
The last clips are basically a fluids simulation 😂
@CallumAtwal
@CallumAtwal 3 роки тому
The first bit of the track looks like a sink trap/u-bend 😂
@blue.dt2000
@blue.dt2000 3 роки тому
I thought the same
@enzozeb
@enzozeb 2 роки тому
Left, Right and Floor it. The only three inputs a true racing car ever needs.
@xcruell
@xcruell 2 роки тому
Oh and the handbrake of course, for drifting really sharp turns with style and speed.
@Raterex
@Raterex 2 роки тому
I prefer the bang bang bang => wasted strategy =D
@manz7860
@manz7860 2 роки тому
@@xcruell it wouldn't be as efficient so it'd probably cut all the nonsense out
@reubenbetts7537
@reubenbetts7537 2 роки тому
@@manz7860 have you played trackmania before?
@ke6gwf
@ke6gwf 2 роки тому
You can get rid of one of those inputs in Nascar. Makes it simpler... Lol
@overratedprogrammer
@overratedprogrammer 11 місяців тому
There needs to be more content like this. I would literally watch videos of every single popular game even though it's the same algorithm
@sycips
@sycips 2 роки тому
Impressive that you created this for Trackmania! Two major things you could have added are penalizing bumps, which slow down the car, and make the AI able the break, which is necessary in some tracks. Even though this isn't added, it was fun to see what your results were. Great job!
@Jagd204
@Jagd204 3 роки тому
"-Hey Terminator, how did Skynet ever become so powerful?" "-Gaming."
@siriusczech
@siriusczech 3 роки тому
This reminds me of how they let supercomputer play a Civilisation game against normal AI. You know, the game that you can win by many possibilites - Space race, Culture race, Political race, Technology race, use diplomacy etc.... or go tryhard and conquer most of the world. The supercomputer was learned those rules, possibilites of win, conditions of technologies ... and after some calculations he entered an absolute warmonger scenario, flooded the map with his units and crushed every opposition he ever faced. It was ... disturbing to say at least and worrying to see a "normal AI" thinking of such a result and then to execute it brilliantly. Don´t know the algoritm used or any details, it just stays in my mind as a memory of an article i once read
@icollectstories5702
@icollectstories5702 3 роки тому
_WarGames_ the 1983 movie. To a computer, reality is a simulation.
@alexejfrohlich5869
@alexejfrohlich5869 3 роки тому
​@@siriusczech you may want to account for the artificially created environment and the limits it provides the AI to deal with, introducing a general bias towards warmongerism as the best strategy in this context. This might be a general bias in the game system (war is always stronger than any other type of progress towards victory) OR the lack of collaboration benefits. Which are always there as long as the ultimate goal is a "winner takes it all" mentality. The winning conditions of Civ are that you need to be the guy on top. The first one. So collaborating is only worth if you are still the one getting out of it on top which deminishes the whole purpose of collaboration in the long run. The game's system and winning conditions would need to be adjusted if you want the "true" AI (not the algorithms just called "AI") to come out on top without killing everyone :D
@siriusczech
@siriusczech 3 роки тому
@@alexejfrohlich5869 it strongly depends on the nationality of AI - some civs have significant bonuses there to be able fo fulfill other types of victories (tech race or cultural race is one of them) and this thing doesn´t require too much giving up of any other strategy gameplay - like that you couldn´t build a strong or high tech army in the first place. The issue there was that no matter the nation, no matter those conditions, it ALWAYS flooded the map with tons of cheap units, somehow achieving the victory despite it pissed of every nation one after another and it didn´t care for diplomacy or other things neither as much as you would think it will be - just a brutal 1000 turns raid on barbarians, trading only something and with strentgh in counts it defeated even those that it (based on numbers) shouldn´t defeat. And that was interesting on it whole - that perfectly "thought-through" assault is probably the easiest and most viable strategies even in this world, no matter other facts. That the problem with humans is that even dictators cannot wage war against such computing force.
@alexejfrohlich5869
@alexejfrohlich5869 3 роки тому
@@siriusczech might be that there are bonuses, but it still looks like waging war is just the "best" strategy in this game system. also it is most likely the easiest accessable. if the AI is beating everything by spamming units also exposes a general flaw in the game system that was there the whole time. the AI is just taking advantage of it. so it looks like "killing" is the best strategy for the AI but actually, it is the best strategy within this artificial system. the AI just makes it clearly visible.
@isaie7144
@isaie7144 3 роки тому
En 0.0005 secondes, j'ai compris que j'avais affaire à un français haha
@camaradeKC
@camaradeKC 3 роки тому
La mm mdr 😂
@rbpt__4136
@rbpt__4136 3 роки тому
Pareil
@auxencefromont1989
@auxencefromont1989 3 роки тому
+1
@zedestroyer44
@zedestroyer44 3 роки тому
Mais tellement 😂
@Elon_Muscle_
@Elon_Muscle_ 3 роки тому
C'est marrant parce qu'aucun coms anglais mentionne son accent, les seuls qui en parlent c'est nous même x)
@Dragonmastur24
@Dragonmastur24 2 роки тому
your combined run clips remind me of how a light pulse spreads out in a fiber optic line! a pretty good representation if ya ask me :D
@MrPooPooJohn
@MrPooPooJohn 2 роки тому
This is mind-blowing btw. Great work.
@TheCodingTrain
@TheCodingTrain 2 роки тому
Watching this video and zoinks there I am! Amazing work!!
@yoshtm
@yoshtm 2 роки тому
Thank you so much !! I loved your video series on genetic algorithms, it helped me a lot in the beginning ! Very happy you came across this video :D
@brad3262
@brad3262 2 роки тому
@@yoshtm the
@gamefun2525
@gamefun2525 2 роки тому
@@brad3262 th
@ventrys
@ventrys 2 роки тому
@@gamefun2525 ok
@DarthMaulOfficial
@DarthMaulOfficial 2 роки тому
@@yoshtm This is a lame video ai has been able to do this since the 1970s
@vinod8941
@vinod8941 Рік тому
Along with the wall distances and car speed you should also add the car direction as an input. This should help in turns significantly. Also if you take vectors across 180 degrees for measuring wall distances, it can increase the accuracy of the algorithm.
@jamescarroll489
@jamescarroll489 2 роки тому
Great video, structure and explanation!
@Barde_Jaune
@Barde_Jaune 2 роки тому
The French is strong with this one.
@akmedia8206
@akmedia8206 2 роки тому
Algorizzum
@M-Cube_
@M-Cube_ 2 роки тому
It's funny, coming from a language where the 'h' is silent, he puts in so mich effort to pronounce words like "how" right that he even does it with words like "hour" - where it's actually silent in english as well! :-D (no offense, just something I noticed)
@Brabldibrablmann
@Brabldibrablmann 2 роки тому
He sounds like he had 1 hour to learn english pronounciation and then had to read the script
@onclebob
@onclebob 2 роки тому
Oui oui
@valovanonym
@valovanonym 2 роки тому
@@Brabldibrablmann trust me, even after years of speaking english, my prononciation isn't much better
@IAMDIMITRI
@IAMDIMITRI 3 роки тому
So the cars don't seem to be able to predict a turn. If you want them to be able to predict a turn you need to eather increase the resolution, so that AI would be able to see an oncoming turn. Or increase the neural resolution both to allow cars to process different turn radius and temporal resolution so that cars can hold and remember certain turns. Small or wide turn do look similar to the AI and AI needs a way of distinguishing between them. That's the difference between simple stimuli respondent AI and another one that can better generalize the problem.
@Obi-WanKannabis
@Obi-WanKannabis 3 роки тому
plus in a track you may have to tackle a corner differently based on what corner immediately follows it, not sure how to do that here though
@IAMDIMITRI
@IAMDIMITRI 3 роки тому
@@Obi-WanKannabis I'm sure that with bigger network AI could simply learn the map. Or learn how to learn the map.
@MikkoRantalainen
@MikkoRantalainen 3 роки тому
I think an easy way to provide "memory" of track layout would be to give it list of vectors from the center vector towards the way track is going. For some tracks, the best curve through a corner depends on two following corners. If you don't provide even a rough info about those corners, the AI can never excel. With the current input, the best that AI can ever do in theory is to play at roughly level of human driving any map for the first time.
@harmenoosterhof9604
@harmenoosterhof9604 3 роки тому
Wouldn't it be easier if you could somehow say "The AI driver hit the wall, eliminate this AI driver." And then just measure time at the checkpoints? Base your fitness level on the checkpoint times. That way the AI would have to find the fastest way to a checkpoint without hitting a wall.
@IAMDIMITRI
@IAMDIMITRI 3 роки тому
​@@harmenoosterhof9604 I think there might be evolutionary peaks and valleys. As in: AI have to do something unproductive for couple of generation before it can improve. As in AI has to drive close to the wall without hitting the wall, I bet that most would hit the wall and get lower score thus those genes not surviving at all. And even if you manage to train AI to hug the wall. It still needs to know what wall to hug. So you need deeper network to learn and predict corners and hug walls at the optimal time.
@TechWithKishan269
@TechWithKishan269 2 роки тому
Bro, this guy. I work in IT and still don't understand how you mad mad mad genius code this stuff! Great video!
@MalakaiPrograms
@MalakaiPrograms 2 роки тому
This is a great video. Very informative, and delivered amazingly. FUCK YEAH!
@EvilTaco
@EvilTaco 3 роки тому
I feel like the reason they're not performing as well is that they're very limited in what they can see. They can only see the walls right in front of them, so they can't think ahead for the next curve and account for it, which is why they always run into that one wall in the curve
@Cuahchic
@Cuahchic 3 роки тому
This is where better features would help. You can see in the long straight sections the car seems to travel at a weird angle, this is because the distances to the wall aren't relevant for the straight, the next corner information is. Is there a GitHub repository for this code?
@dubtor
@dubtor 3 роки тому
seems like the the measured input parameters (wall dists and speed) are reaching its limit regardless of the number of future iterations. Certain curves or curve combinations look "same" to the AI whilst in fact the AI should understand that they are not the same ahead of time (by measuring other/additional parameters). Because there is a limited number of curve types and thus combinations of them in TM one could try to make the AI "see" which ones they have at hand and learn accordingly. Imo this way the record of the reference driver may be broken. Also does the AI steer inputs between 0-100% or always 100%? Maybe this adds extra friction and therefore slowdown?
@amicaze9570
@amicaze9570 3 роки тому
@AE Templates Rather than making it map specific, a better solution would be to give a line of sight that correspond to different track pieces. For instance if you use circles and curves as LoS, the AI will be able to see past the bends into the turns. The good thing is that with this kind of learning algortihm, you shouldn't need to do complicated stuff, the AI should figure it out on its own.
@Buglin_Burger7878
@Buglin_Burger7878 3 роки тому
@AE Templates Well actually it is we can see over the walls, even without know the turns ahead of time you can see the track ahead of you. If you want the test to be fair... you'd need to recreate the track with walls a human couldn't see over... in which case you'd see humans advance similar to the AI. Slamming into the walls or going too slow... while we'd learn "faster" we'd learn basically the gap between generations. In other words, comparing AI to humans here is completely unfair as we have two different tracks.
@Trygeve
@Trygeve 3 роки тому
hi taco.
@benjikrafter
@benjikrafter 3 роки тому
There’s a bunch of ways to “outsmart” genetic algorithms by teaching certain skills you previously knew were important before teaching the primary goal. For example, teaching not to hit walls or turning as little as possible. Using this you’ll have a more refined base AI to learn the track. I’d love to see you try something like this again using this method!
@TheRealMeatwad
@TheRealMeatwad 3 роки тому
I wonder if a directive to simply have the maximum amount of space between the front of the car and the wall directly in front would give the best result.
@sakethram6496
@sakethram6496 3 роки тому
@@TheRealMeatwad That would be very inefficient....it isn't how apex of a corner works
@TableEnMousse
@TableEnMousse 3 роки тому
Yeah, part of thinks it would have been better if the simulation was stopped as soon as the car hit the wall; although I know nothing about A.I learning. That's just a hunch.
@sciencemanguy
@sciencemanguy 3 роки тому
@@TheRealMeatwad Congratulations, the car will now stay still and not move!
@TheRealMeatwad
@TheRealMeatwad 3 роки тому
@@sciencemanguy, obviously you also use other parameters for accelerating while the distance part is used to to determine facing, not speed. But I didn't clarify that in the original post so I guess I deserve that.
@kamikeserpentail3778
@kamikeserpentail3778 2 роки тому
It's always neat to see these. And wow, I recognize some of this music.
@jakobleck8600
@jakobleck8600 2 роки тому
Great video! Suggestion: The decision making input that is optimized is trained against a global constraint (distance travelled or time to checkpoint), whereas the effect of each steering input is only over the next few seconds. Maybe evolution might run faster or yield better results if you train the decision process with a fitness function that depends on those short-time effects, e.g. the average velocity over a certain short time interval?
@Avetho
@Avetho 3 роки тому
I love how at the end following Trabadia's car its like he's escaping the tidal wave of AI like its an action movie or something XD
@nomadMik
@nomadMik 3 роки тому
I thought the same thing. And the ones that turn around always make me laugh. 🙂
@Kharnellius
@Kharnellius 3 роки тому
Would make for an intense game, especially if you had a mini turret on the back.
@btf_flotsam478
@btf_flotsam478 2 роки тому
It almost looks like he's playing it in slow motion.
@HIkaruSashimi
@HIkaruSashimi 2 роки тому
It's like his car is outrunning a zombie horde.
@McNab1986
@McNab1986 3 роки тому
I feel like not using wall hits as an elimination category was an oops moment
@whatelsula
@whatelsula 3 роки тому
I don't know. AI who learn to follow the wall will progress, a contrario of AI doing circles.
@ziero1986
@ziero1986 3 роки тому
@@whatelsula Could have also made 'backwards progress' on the track an elimination category too. This way they AI would have learned to not hit walls and/or go the wrong way on the track.
@whatelsula
@whatelsula 3 роки тому
@@ziero1986 Yes, but maybe not for the first generations, I think.
@anthonycarless1748
@anthonycarless1748 3 роки тому
Yep, even the fastest in generation 100 still slammed into that wall before the chicane
@Solaris428
@Solaris428 3 роки тому
Didn't you wonder wether or not it was a good strategy? Maybe it makes you faster overall even if you lose some speed the moment you hit the wall.
@milehighgambler
@milehighgambler 2 роки тому
Keep at it man!!! You can do some impressive stuff with AI if you get it down. You got yourself a new subscriber
@megamax1932
@megamax1932 2 роки тому
This was the video that got me into track mania
@WwZa7
@WwZa7 3 роки тому
"Turn left, turn right, and accelerate" Robots don't need brakes nor breaks.
@abrightguy508
@abrightguy508 2 роки тому
Acceleration means the rate of change of speed which also includes slowing down
@ImyoCello
@ImyoCello 2 роки тому
@@abrightguy508 there is much more than that into obtaining the best track time, you need to be able to slow at a faster rate than the one given to you by the game wich is basically lack of acceleration and how fast the game brakes on its own, because the doesnt use physics, it uses numbers, so its a given break strenght number, you are able to increase that number, thats an action, so yeah, if AI cant even do that its missing quite a lot already. This assuming this is all real and not just show, but hey, its fun to watch regardless.
@lzfix8188
@lzfix8188 3 роки тому
Damn every episode is better than the other, this project is just too cool man, keep it up! :D
@henryshadow7217
@henryshadow7217 6 місяців тому
I love this series
@justarelaxednerd3520
@justarelaxednerd3520 2 роки тому
En tant que français, j'ai tout de suite reconnu d'où tu venais ptdrr Superbe vidéo sinon, et très beau travail !
@jakeb5121
@jakeb5121 2 роки тому
13:33 this just makes me think about how long this must have taken to render
@sethmath2778
@sethmath2778 2 роки тому
Long enough 😂
@ShiroGojo
@ShiroGojo 3 роки тому
Le bon accent de Vendée on le sent on l'entend il fait chanter nos tympans :x
@Daneri42
@Daneri42 3 роки тому
je me disais aussi que yavais un ptit goût de brioche 😅
@ShiroGojo
@ShiroGojo 3 роки тому
@@Daneri42 Ah la c'est la gâche carrément :v
@labonisk6504
@labonisk6504 3 роки тому
Mdr x)
@Daneri42
@Daneri42 3 роки тому
@@ShiroGojo miam
@clementgodard
@clementgodard 3 роки тому
Plutôt bocage où littoral ?
@Ben.N
@Ben.N 2 роки тому
Amazing video. Inspirational 😎
@mmdhd9667
@mmdhd9667 2 роки тому
Map knowledge is also really important becasue it determains how to approach a curve . Its good to generelize the curves and the relationship between them
@diariaking2476
@diariaking2476 3 роки тому
The AI could probably do better if there are more “sight” lines clustered toward the front allowing it to make more precise movements.
@jouniranta-puska4699
@jouniranta-puska4699 3 роки тому
I was about to say this as well. As it is now, I feel like it's handicapped compared to a human.
@camilohurtadoacero7233
@camilohurtadoacero7233 3 роки тому
yes, AI resolution of data is to weak. But at 11:45 of the video, the AI makes a corrective movement to the left, because he wants to be in the center of the track, not on the fast lane. You could prioritize the ahead distance so AI dont make to much of an anticipate move.
@hotel_arcadia
@hotel_arcadia 3 роки тому
@@camilohurtadoacero7233 That's because the AI was trained to just finish the track, not get the fastest time.
@teslamodel314
@teslamodel314 3 роки тому
I wonder if it remembers the next turns. If not, it will never improve as much. It just learn to drive in a track as if it is the first time it drives it. I think Trabadia drove the map some times before realizing the best time. Is it true? How many times did he try?
@frenchgeek1306
@frenchgeek1306 3 роки тому
ukposts.info/have/v-deo/sYp3jWuRgaqY2oE.html
@remink3092
@remink3092 3 роки тому
I love this serie of videos ! If you continue to do this type of videos, could you one day make a sort of "making off" video where you show us more about the code, the process to create these scenes with 100 cars driving on the same course or the way you create new AI to a next generation? I would love to learn more about this subject ! Ps: french team, lets go !
@nezziaktm9922
@nezziaktm9922 3 роки тому
Watch the videos of code Bullet
@eMgotcha77
@eMgotcha77 3 роки тому
Is it a mod for TrackMania ? How can he control so many cars in TrackMania?
@PapaBass
@PapaBass 2 роки тому
The clips at the end look like some liquid going down a tube. Thats super satisfying!
@smallejector5266
@smallejector5266 2 роки тому
The best AI explanation I have seen, makes it very understandable, thank you.
@M8cool
@M8cool 3 роки тому
13:32 was so trippy!
@bucketlynx
@bucketlynx 3 роки тому
Bro I can just sense the French in his English
@metalvideos1961
@metalvideos1961 3 роки тому
you have to be deaf not to hear that
@JakSpate
@JakSpate 3 роки тому
@@metalvideos1961 Im a Brit Naniiii I hate france
@Aquanixity
@Aquanixity 3 роки тому
@@JakSpate bonjour
@Antiork
@Antiork 3 роки тому
this guy is not an indian?
@oscarl2370
@oscarl2370 3 роки тому
@@Antiork lol no, listen at his pronunciation of r and you'll see
@TheWizardsOfOz
@TheWizardsOfOz Рік тому
This is amazing and fascinating.
@ifvictorthen10
@ifvictorthen10 2 роки тому
Great video! Currently your AI have a reactive approach to each frame of information it receives. Maybe you can give to the AI the current frame along 3 or 4 past frames so it can generilize the concept of speed and direction.
@DamienKoda
@DamienKoda 2 роки тому
cause it use a NN instead of a LSTM
@tmaster2151
@tmaster2151 3 роки тому
13:00 the girls in your dms after you get rid of that yee yee haircut
@Lilmanminion
@Lilmanminion 3 роки тому
Lol
@koo9ol
@koo9ol 3 роки тому
229 likes but 1 reply ! How
@tvojejidlo8143
@tvojejidlo8143 3 роки тому
@@koo9ol Actually three
@knoert7977
@knoert7977 3 роки тому
@@tvojejidlo8143 you're wrong it's four
@tvojejidlo8143
@tvojejidlo8143 3 роки тому
@@knoert7977 no its actually five
@EddieOtool
@EddieOtool 3 роки тому
13:32 That's a great shot!
@rubikvoncube3583
@rubikvoncube3583 3 роки тому
corona be like...
@BalthazarMaignan
@BalthazarMaignan 2 роки тому
Super intéressant ! Continue gars !
@ghosttwo2
@ghosttwo2 2 роки тому
Try incorporating race time into your fitness function. Something like f = Distance * (1 + 100/time) where time is set to some absurd constant if they don't finish; this creates two tiers, with the first one working their way to track completion, and the second tier improving their times. Might also be worth adding an extra layer to the network and see if it improves anything, along with a couple extra inputs like 'am I drifting'.
@TA_Plus_Hemi
@TA_Plus_Hemi 3 роки тому
Bro I love the fact you pronounce the H in hours. My wife cringes every time I do it.
@quantumsoul3495
@quantumsoul3495 3 роки тому
He speaks french and we don't have H in our pronunciation. So in english class we learn to pronounce the H. We do it every time we see a H in english now
@Aelfraed26
@Aelfraed26 2 роки тому
I stand with your wife.
@jnmaietta
@jnmaietta 3 роки тому
One thought, could the AI be slamming into walls because they don't have brakes built in as a response? From what you mentioned, they can only turn and accelerate. Would explain the lack of learning and inability to approach human times past that certain point. Anyways, the whole video is awesome and I wanted to give you specific props on the editing and overlays on this. Really help visualize concepts. Idk how you select which runs to run together in clips, or how you got that shot sitting in the middle of the track with cars going by, but they were great visualizers.
@yoshtm
@yoshtm 3 роки тому
Thanks !! You don't need brake on this specific map, Trabadia didn't use brake in his run for example. And the AI is still able to stop accelerating. But brake would be useful in more complex maps. It's easy to sort replays in folders, and to select and edit specific replays ingame. And there are tools to edit camera shots ingame.
@mastermind1677
@mastermind1677 Рік тому
I wish I could also do these things, I am learning, I want to make a neural network in python, I saw some videos and didnt understand them fully, I do have some prior knowledge and experience on python, but now im trying to learn more deeply, then I will return to neural networking.. Good job, I really enjoyed the vid, thnx
@eLasmc
@eLasmc 2 роки тому
You did more science for one Video than i did for my master thesis in cs
@JohnSmith-rf1tx
@JohnSmith-rf1tx 2 роки тому
I feel like using shortest split times from one checkpoint to the next might give better results. Maybe there's a car that doesn't do sections 1 and 2 very well, but kills it on section 3 where every other "well performing" car is having lots of trouble. If your fitness function is only time to end point, you're going to miss out on some more targeted improvements.
@neyth1942
@neyth1942 3 роки тому
Je regrette pas de m'être abonné avec la première vidéo, super hâte de voir la suite !
@bartvandelint
@bartvandelint 2 роки тому
you should add a fitness function for distance traveled between checkpoints as well, while this would make the cars take the inside of corners
@firethenoob1012
@firethenoob1012 2 роки тому
just got this recommended awesome!
@R4GEcraft
@R4GEcraft 3 роки тому
fun fact about this video : The narrator'a accent have been mentionned 26 times in the comments. 26 out of 26 are french guys making fun of it. 0 out of 26 are "international" english speaking people claiming it interfered with their viewing experience. Conclusion : french should shut the fuck up when one of them speaks english
@yoshtm
@yoshtm 3 роки тому
Interesting comment ! I know my accent isn't very good. If I make another video with voice-over, maybe I'll try to make it in French with English subtitles, I'll have to think about it. If any of you have a constructive opinion on this, please let me know :)
@verticalgaming9042
@verticalgaming9042 3 роки тому
@@yoshtm The whole point of the comment is to say that your accent is fine and there is nothing wrong with it. You're a perfectly understandable and should keep doing what you do.
@razaanbotutihe7420
@razaanbotutihe7420 3 роки тому
@@yoshtm no, keep voice ovet with english language, not only it will improve your english (im assuming english isnt your 1st language), and that way you can train your english speaking ability, and can attract more viewers.
@Advenyx
@Advenyx 3 роки тому
@....... Tu feras gaffe, ton anglais est plus propre que ton français...
@randowdude6847
@randowdude6847 3 роки тому
Conclusion: everybody hates french people
@TheExplosiveGuy
@TheExplosiveGuy 3 роки тому
That last compilation of all the runs together, looks like a marble run from overhead, the way they all bounce off corners in the same spots, and all the erratic moving back and forth, just like marbles.
@thetinyurlguy
@thetinyurlguy 3 роки тому
AI: Jelle's Simulated Marble Runs Martin: I'm still making the Wintergatan MMX physical
@luvinthejazz
@luvinthejazz 2 роки тому
The final runs remind me of water flowing down a stream bed. It shows why a river has curves. The particles collide with and erode the outside of the curves, and leave deposits on the inside of the curves.
@ruukaoz
@ruukaoz 2 роки тому
it would be interesting to so see how the A.I. would tackle the challenge if the map would mutate in slight ways, and how it would affect the learning.
@StyggeOrd08
@StyggeOrd08 3 роки тому
Nobody: A chocolate bar in my pocket: 13:32
@donadigotm
@donadigotm 3 роки тому
Always love seeing videos like this! Well explained concepts and presentation. I'm sure a lot of work has been put into this one.
@yoshtm
@yoshtm 3 роки тому
Thanks ! and thank you also for your help with Openplanet :)
@thomaskleine6121
@thomaskleine6121 2 роки тому
Thank you for that explaining
@hopeandpiece
@hopeandpiece 2 роки тому
You definitely had the mutation factor high enough there! So many generation and you still get a large share that gets the enlightened idea of "Just Turn Left" Ever thought of tuning down the randomness on successive generations slowly?
@harryaddlesee6529
@harryaddlesee6529 3 роки тому
I watch code bullet and riolu a lot and you’re like a mix of them both. Love these types of videos :)
@Sniperrkr
@Sniperrkr 3 роки тому
13:17 "Guys, go ahead, do not wait for me! I forgot the mask, need to get back home and get it."
@yoshtm
@yoshtm 3 роки тому
ahaha nice comment
@stefannesic2421
@stefannesic2421 9 місяців тому
Thank you for helping me visualize a different goal then this i am trying to reach with AI!
@MigunoOS
@MigunoOS 2 роки тому
When you ad extra time you see the brilliance of the ai, it applies what it had learned from the previous part
@randomshadow4620
@randomshadow4620 3 роки тому
Imagine an AI who knows how to find shortcuts.
@nuarius
@nuarius 2 роки тому
that is the inevitable outcome of a well developed self learning AI. Shortcuts and exploits....
@hencethebeetroot
@hencethebeetroot 3 роки тому
Running from zombies in real life: There's a lot of them, better walk fast Running from zombies in movies: 14:27
@lekonai
@lekonai 3 роки тому
running from zombies in real life? whaa
@lekonai
@lekonai 3 роки тому
@@mebe6474 you what
@cinegraphics
@cinegraphics 3 роки тому
The red one is Neo. The blue ones are agents Smith.
@arnabgon1157
@arnabgon1157 2 роки тому
13:32 THE routes taken by the cars look like a mycelium, beginning to explore a surface looking for food. Damn trippy.
@Raterex
@Raterex 2 роки тому
I am thinking that expanding on the 'memory' idea of the map for genetic algorithm would help improve its performance. Some ideas off the bat: 1. Simplify map input into a chain of data point set (e.g. position, angle of turn to the left/right etc), similar to how notes are recorded on a score sheet 2. Have real-time positional input data for the neural network to know its progress on the track With these information, the genetic algorithm might learn how to make predictions and optimize moves to handle upcoming portions of the map, based on its current and future data point sets.
@JstAnothrVirtuoso
@JstAnothrVirtuoso 3 роки тому
Brilliant stuff. Always get excited when I see your videos pop up. As you said, supervised learning is better for new tracks, but (idk about others) for me it's more interesting to maximize a particular map using GA. As someone who's worked with GAs before, I can relate to your problems of time consumption being the biggest issue. It'd take many more iterations with different parameters (and more generations) to start finding a global maxima. I know that in tmnf people have managed to speed up the game to help with stuff like this, but I'm not sure if you could merge it with your program.
@yoshtm
@yoshtm 3 роки тому
Thanks ! I'm glad you like this, I've been watching your videos for years :D Yeah time is by far the biggest problem, 100 generations already takes so much time.. I think Supervised learning is pretty cool too : it would be awesome to have an AI that can play any map, and let it discover the campaign and collect the medals for example !
@ericdanielski4802
@ericdanielski4802 3 роки тому
ukposts.info/have/v-deo/jWmZpWiinIeYumQ.html
@raphaelturrasprenger7394
@raphaelturrasprenger7394 3 роки тому
If reality is a big simulation, probably I'm one of those cars that did go backwards.
@Litorax
@Litorax 2 роки тому
Reality has not to be a simulation, this simulation is based on reality.
@billmcdonald4335
@billmcdonald4335 Рік тому
All tracks have an ideal line, easily determined. Putting a weight on that line so the AI try to occupy it [like humans do] would help, don't you think? Also the interaction through corners can be accounted for - when to allow room or close the door, for example. This brute-force approach is great for seeing what the trickier aspects of the problem are, and was well worth doing. Keep diggin' - can't wait to see where this goes.
@homer46303
@homer46303 2 роки тому
I still have a ways to go to learn how to do this but I would consider starting from the beginning and adding a time penalty for hitting the wall or getting within a certain distance of the wall. This could help the auto selection process to wittle out the AI that make time wasting mistakes. I was reminded of 3b1b's video on gradient descent about finding a false bottom which is a problem that's hard to solve with machine learning.
@anothervoyager9654
@anothervoyager9654 3 роки тому
It is amazing how close it looks to a fluid passing through a tube.
@Polygarden
@Polygarden 3 роки тому
This looks great! Do you also buffer one frame? I did some similar experiments and noticed it's beneficial to have 2 or 3 timesteps as inputs for the neural network. Then your AI will be able to predict trajectories based on the frames before. (it would have a perception of speed) This is not exactly a map memory, but at least a very short term knowledge. They will then brake and accelerate better on curve entries or exits.
@williamknowles-kellett
@williamknowles-kellett 2 роки тому
It's interesting to me that they take wide turns. Because their inputs come from raycasting, they can't see around the turn like Trabadia. By taking the turns wide, they can see better.
@michaelrawson9100
@michaelrawson9100 2 роки тому
Maybe it would help if you give the AI some more parameters to play with. Hitting walls seems to be a feature here so perhaps have 1 parameter that is distance to nearest wall. Perhaps also introduce a collision penalty in the fitness (either use explicit collision detection or detect sudden drops in speed not accompanied by a brake action). Anyway - loved it. Keen to see more!
@robertmangan804
@robertmangan804 3 роки тому
Code bullet watching this be like: *sweats*
@NaThingSerious
@NaThingSerious 3 роки тому
It is so satisfying watching the hundreds of cars drive all over track like a wave
@almendratlilkouatl
@almendratlilkouatl 3 роки тому
Yes, like Satan
@aindatenhoconta
@aindatenhoconta 2 роки тому
What Tabadia did better than the zig zag approach is constantly calculating the distance from the car to the walls and the likelyhood of which side is the next turn based on the shape of the wall (or the green lines which serve that purpose)
@boneshakerjake
@boneshakerjake 2 роки тому
its like watching an ant colony all working together making a giant flat space its... kinda mesmerizing
AI beats multiple World Records in Trackmania
37:18
Yosh
Переглядів 2,4 млн
Training an unbeatable AI in Trackmania
20:41
Yosh
Переглядів 12 млн
The Quest for the Biggest Trackmania Shortcut
14:41
Wirtual
Переглядів 7 млн
A.I. Learns to Drive From Scratch in Trackmania
16:51
Yosh
Переглядів 7 млн
A.I. learns to drive on a pipe
21:43
Yosh
Переглядів 229 тис.
Simulating the Evolution of Aggression
13:17
Primer
Переглядів 22 млн
AI Learns Insane Monopoly Strategies
11:30
b2studios
Переглядів 10 млн
Neural Network Learns to Play Snake
7:14
Greer Viau
Переглядів 4,5 млн
How Trackmania Players Conquered Bugs
27:50
Wirtual
Переглядів 984 тис.
OpenAI Plays Hide and Seek…and Breaks The Game! 🤖
6:02
Two Minute Papers
Переглядів 10 млн
AI vs. AI in 100m Dash (deep reinforcement learning)
11:13
AI Warehouse
Переглядів 2,1 млн
New bug with Lily Server error #brawlstars
0:23
FireCrow
Переглядів 3,8 млн
Glitch Larry & Lawrie is the new ranked skin ✅ #shorts #brawlstars
0:20