128-core MONSTER Arm PC: faster than a Mac Pro!

  Переглядів 388,749

Jeff Geerling

Jeff Geerling

День тому

New Customer Exclusive - Get the Creality Ender 3 V2 for $99: micro.center/vnwy
Check Out Micro Center's New Charlotte Store, Opening Early 2024: micro.center/pzhn
Shop Micro Center's Fall Savings: micro.center/mcri
Submit Your Build, Receive a $25 Off Coupon When Approved: micro.center/4y00
Special thanks to ADLINK and Ampere for sending the Dev Workstation and Dev Kit, and to Maxon, ptitSeb, and many others for help debugging issues in the course of making this video. It has been a LONG term effort, and I am still learning new things every day :)
Mentioned in this video:
- Ampere Altra Developer Platform: www.ipi.wiki/products/ampere-...
- Ampere Altra Dev Kit: www.ipi.wiki/products/com-hpc...
- GitHub discussion on memory bandwidth: github.com/AmpereComputing/HP...
- Actually Hardware Overclocking video on memory ranks: • The differences betwee...
- Ampere's guide for Nvidia GPU support: github.com/AmpereComputing/NV...
- Ampere's guide for Steam on Arm Linux: github.com/AmpereComputing/St...
- Ampere's guide for Windows for Arm: github.com/AmpereComputing/Wi...
Support me on Patreon: / geerlingguy
Sponsor me on GitHub: github.com/sponsors/geerlingguy
Merch: redshirtjeff.com
2nd Channel: / geerlingengineering
#Arm #Linux #Sponsored
Contents:
00:00 - Move over, Mac Pro
00:52 - The ultimate Arm workstation
01:20 - GPU get
02:09 - Which upgrades do you want? Yes.
02:42 - RAM is weird, and I put in 384 GB of it
04:46 - 96 to 128 core CPU
06:07 - Memory bandwidth holds me back
06:30 - Nvidia or AMD? (for arm64)
07:24 - GPU Drivers
07:57 - GPU acceleration is hit or miss
09:25 - Windows on Arm: drivers lacking
09:57 - Cinebench 2024 champ
11:17 - Minecraft on Windows - okay
11:38 - Minecraft on Linux - BUTTER
12:19 - Games, Games, Games
13:50 - Going for the efficiency crown
14:47 - Infinitely more upgradeable

КОМЕНТАРІ: 806
@BlackPanthaa
@BlackPanthaa 6 місяців тому
Threadripper 399x user here, they will never fix over 64 thread usage in windows. I've tried it all.
@WartimeFriction
@WartimeFriction 6 місяців тому
Sounds like it's time for you to do the free upgrade to a superior Linux based OS
@ciprianrobo
@ciprianrobo 6 місяців тому
@@WartimeFrictionyou forgot the "I use arch btw" as part of your comment
@mindrage92
@mindrage92 5 місяців тому
I think the culprit is that Win32 API function: GetLogicalProcessorInformation only supports up to 64 proccessing units, due to using only a 64bit flag value for each cpu. GetLogicalProcessorInformationEx is the more modern one.
@leonpano
@leonpano 5 місяців тому
Proton will have maximum 64 threads
@vgernyc
@vgernyc 5 місяців тому
Even Windows Pro for Workstations?
@fujinshu
@fujinshu 6 місяців тому
Also, about the Ti in the GPU, NVIDIA pronounces it both ways. Jensen, the CEO, pronounces it as T-I (Tee-eye), while Jeff Fisher pronounces it as Ti (Tie/Ty)
@JeffGeerling
@JeffGeerling 6 місяців тому
The plot thickens! Internally I say T-I, when I pronounce it out loud it comes out "Tie", so who knows lol
@zblurth855
@zblurth855 6 місяців тому
@@JeffGeerling I guess you need to send red shirt jeff to Nvidia HQ so we may know the answer, better not have another gif situation
@mbe102
@mbe102 6 місяців тому
@@JeffGeerling well its originally Ti-tanium, isn't it? So it makes sense. But I've only ever heard Tee Eye.
@JamesGillean
@JamesGillean 6 місяців тому
@@JeffGeerling I don't know if i can handle your Tie pronunciation Jeff. It's like a punch to the ol' squeedily spooch.
@nathanielhill8156
@nathanielhill8156 6 місяців тому
​@@JeffGeerlingit used to be T-I, but they retconed it into Tie. My personal belief is a Texas Instruments trademark got involved.
@QuentinStephens
@QuentinStephens 6 місяців тому
One thing on your RAM vs core discussion: L3 cache requirements scale non-linearly with core counts thanks to the increased incidence of L2 cache misses.
@JeffGeerling
@JeffGeerling 6 місяців тому
That's why the chip architecture is critical with more and more cores. AMD, Intel, and Ampere all seem to take slightly different approaches. I've enjoyed some of the ChipsandCheese articles on these new architectures!
@shanent5793
@shanent5793 6 місяців тому
Do increased L2 misses increase or decrease pressure on L3? If it's non-linear then is it log, exponential, or polynomial?
@QuentinStephens
@QuentinStephens 6 місяців тому
@@shanent5793 It's non-linear and there's an exact formula. Let's say you have a 5% chance of a cache miss per core, so a 95% chance of a cache hit. The percentage chance of a cache miss with N cores is (1 - (.95^N)) * 100. Obviously the chance of a miss - that 5% - is dependent upon the workload. The more misses you have, the greater the pressure. And the fewer the RAM channels you have the greater the effect of L3 cache misses.
@shanent5793
@shanent5793 6 місяців тому
@@QuentinStephens that's just the chance of at least one miss. Multiple misses will have binomial probabilities so their sum converges to linear. 128 cores are expected to have twice as many misses in total vs. 64 cores. Either way more cores causes more L3 pressure so why does the Ampère only have 16MB which is less than desktop CPUs with only 6 cores or 12 threads?
@QuentinStephens
@QuentinStephens 6 місяців тому
@@shanent5793 I'm not sure you're correct about the binomiality but yes, I do agree that the 16 MB cache does seem rather low, especially when we have Epyc CPUs with 1 GB cache for similar numbers of cores.
@IamTheHolypumpkin
@IamTheHolypumpkin 6 місяців тому
Honestly I wouldn't at all be surprised if valve would tell us tomorrow that they release a fork of Box-86 and Box-64 build right into steam so to support all steam games on ARM and RISC-V. Valve would be insane enough to do this and there's no number 3 so it allowed.
@circuit10
@circuit10 6 місяців тому
It would make sense if they’re considering using ARM for a Steam Deck successor, like maybe that new Qualcomm one that’s meant to be really good?
@AlwaysBolttheBird
@AlwaysBolttheBird 6 місяців тому
It’s one of the reasons I love my steam deck so much. Issue? Not in 2 hours haha
@KingVulpes
@KingVulpes 5 місяців тому
I don't know, code weavers contacted valve about built in support for crossover on macOS steam and they still haven't done anything about it (source: I contacted code weavers themselves about it and they said that they did pitch the idea and that it is up to Valve)
@mgord9518
@mgord9518 5 місяців тому
​@@KingVulpesBecause Valve's primary focus is on Linux, not macOS. Another thing is Crossover is a paid product, I find it highly unlikely that CodeWeavers was interested in just providing it to Valve for free without getting a cut, that's probably why Valve wasn't interested. Providing x86 emulation for ARM, however, could directly benefit Valve as it would allow for future low-draw devices, although I'm not holding my breath.
@nempk1817
@nempk1817 2 місяці тому
the problem is not steam, the problem is that you will use it to play the most simple games for the simple reason ARM.
@someguy9175
@someguy9175 6 місяців тому
I really want to see these in a consumer level platform while keeping itself upgradeable.
@iikatinggangsengii2471
@iikatinggangsengii2471 6 місяців тому
most people will be pleased even with half quality, they kind of work well together
@SaltCollecta
@SaltCollecta 6 місяців тому
3 grand for a 128 core CPU. I remember when Intel used to charge 5 grand for a quad core server. Lol, what an exciting time to be alive. I will buy one in a few years when it's stable and on the used market for a reasonable price.
@dzello
@dzello 6 місяців тому
The issue is the lack of support from software. Not enough stuff makes use of all the cores.
@SaltCollecta
@SaltCollecta 6 місяців тому
@@dzello I have a feeling that golang with a huge workload would do pretty well.
@DeltaSierra426
@DeltaSierra426 6 місяців тому
Yeah, lol. Can't get to 128 x86 cores at $3K even on ThreadRipper, either, unless it's used.
@dzello
@dzello 6 місяців тому
@@DeltaSierra426 Those limitations are definitely unfortunate. Making a powerful CPU by making it bigger with a bigger socket? Easy. Making a powerful GPU by making it bigger with a bigger socket? Easy. Even if we don't improve the technology, we can add more and make it bigger. But then... Games: I'll use 1/128 your CPU and 1/3 of your GPU.
@kepler_22b83
@kepler_22b83 3 місяці тому
@@dzello I think making a program able to use the potential of this hardware isn't that hard. It's just that people don't usually do it. With time, and more and more complex software this extra horse power might be needed... Though, there's indeed a limit for consumer grade applications, and crossing that limit is just being inefficient or lazy with your code
@23lkjdfjsdlfj
@23lkjdfjsdlfj 6 місяців тому
I appreciate the effort you make to provide lots of details.
@Gaming_with_Martin
@Gaming_with_Martin 6 місяців тому
ARM is really making huge moves am convinced very soon they will have 6 cores 8 cores and 16 cores lineups for consumers
@adamschackart6859
@adamschackart6859 6 місяців тому
Odroid N2+ is 6-core, Orange Pi 5 is 8-core, both of which can be purchased today for relatively dirt cheap!
@GustinJohnson
@GustinJohnson 6 місяців тому
RISC-5 is jumping into the fray. I am looking forward to getting my 64 core dev board in December. I am so happy to have this level of competition in the market again.
@Pasi123
@Pasi123 6 місяців тому
@@adamschackart6859 But they aren't something you'd put in a tower case and don't have a socketed CPU and memory or PCIe slots
@DavidTMSN
@DavidTMSN 6 місяців тому
That's great for those using them for production but are they going to be able to be clocked at the kind of speeds we're seeing currently?
@ultimatedarkkiller7215
@ultimatedarkkiller7215 6 місяців тому
Actually, processors on smartphones are ARM, and they are usually 6-8 cores, so yes that already happened years ago lol
@Insightfill
@Insightfill 6 місяців тому
1:47 LOVE the "18 minute pickup" at Microcenter. I've built both of my kids' gaming towers by picking out the parts, hitting "buy" and driving right over. Even picked up Dell XPS 13s for each of them the same way.
@MattStevens9824
@MattStevens9824 5 місяців тому
This is soooo cool! I can definitely use this for my MS Excel worksheets!
@fxrisxmxli
@fxrisxmxli 5 місяців тому
I wish we had something like Micro Centre where I'm from. Tech heaven
@digitalsparky
@digitalsparky 5 місяців тому
Exciting to see ARM gaining! Fantastic for servers (specifically high thread/process count web servers), etc.
@alexanderulyev4651
@alexanderulyev4651 5 місяців тому
Great stuff, thank you, Jeff!
@garciajero
@garciajero 6 місяців тому
HELL of a video Jeff !
@KG4JYS
@KG4JYS 6 місяців тому
We're finally returning to the RAM situation we had a decade ago, where workstation motherboards had lots of RAM slots. My (now very old) super micro x8dah+-f board has 18 (9 per CPU). IMO, the biggest problem with modern processors is the extremely limited PCI-e lanes available. Look at chip specs over the years, and it's something that has steadily decreased. With Thunderbolt and NVME, PCIe lanes are the most limiting feature on all my computers - even laptops.
@JeffGeerling
@JeffGeerling 6 місяців тому
Yeah; I have run into that on my Ryzen 7000 series desktop, there are few motherboards that even expose the lanes in a way I can fully utilize them :( The nice thing with this Ampere chip is it has 128 lanes, and almost all are usable on this motherboard! Still always want more, for more IO :)
@arof7605
@arof7605 6 місяців тому
Still run a 4790k on my seedbox due to this. Haven't found a non-server mobo with 10 on-board sata slots for spinning drives since that generation for any other CPU I bought.
@shanent5793
@shanent5793 6 місяців тому
128 PCIe 4.0 lanes is plenty; that's 512 GB/s, more than enough to saturate 6 channels of DDR-3200 with only 154 GB/s half-duplex bandwidth. It's up to the motherboard or backplane designer to allocate them
@NicolaiSyvertsen
@NicolaiSyvertsen 6 місяців тому
The issue is that the PCIe lanes are used for M.2 slots and other onboard functions that didn't exist on boards 12 years ago. Back then those PCIe lanes mostly went to actual PCI slots.
@GSBarlev
@GSBarlev 6 місяців тому
​​​@@arof7605My 4790k was a beast-even though I was never able to overclock it, it ran my main computer for over half a decade, and its core performance was *never* the bottleneck. But I'm surprised you're still using it-how do you live with a mere 32GB of RAM? (asked half-jokingly)
@Daggenthal
@Daggenthal 6 місяців тому
This is so fucking sick man, I love the development that ARM desktop / server cores have been making! I know we have other Architectures as well (RISC-V) and it's awesome that they're all making strides, but to see this amount of progress now? Fuck yeah! I remember watching your older videos where you literally couldn't detect the GPU or even push anything out to the frame buffer, but now look at it :D
@AwareOCE
@AwareOCE 6 місяців тому
Awesome video! ARM is a fascinating architecture, I cant wait to see where it goes in the near future!
@BaiFangLu
@BaiFangLu 6 місяців тому
Great video an components and benchmark. Looks like you also have a lots data on DIMMs and waiting for a new video on them too.
@JeffGeerling
@JeffGeerling 6 місяців тому
We'll see; right now most of the data is spread across some GitHub issues. I may do at least a blog post on it at some point.
@danagoyette7932
@danagoyette7932 6 місяців тому
Something to note about NVIDIA's ARM binary drivers: they have driver library files for x86-64 and aarch64, but they don't have armhf driver libs for software running under box86. That is, box86 converts 32-bit Intel into 32-bit ARM, not into 64-bit ARM. For i386 games, you'd likely need to use an AMD GPU -- Polaris (RX5xx) or older. One game I find very useful for checking the performance of GPUs on ARM is Veloren. It uses Metal on MacOS, Vulkan on Linux, and Vulkan or DX12 on Windows (though there's no ARM Windows build).
@leonpano
@leonpano 6 місяців тому
but why all source game crash on linux and i have rtx a5000 and platform is amd64 it crashed same way like in this video but on amd64 not arm64
@frankmoras63
@frankmoras63 5 місяців тому
Quite a few shooter games with anticheat that failed, might that be the common denominator ?
@lucasrem
@lucasrem 5 місяців тому
danagoyette7932 What titles you run good on ARM ? all old DOS titles ?
@Infinitetreee
@Infinitetreee 3 місяці тому
@@leonpano source games are the one games that run well on linux for me, i use a gt 710 (unsupported alot, still has drivers but will probably stop very soon )
@STEELFOX2000
@STEELFOX2000 6 місяців тому
What I learned here .... Its Amazing but still not work yet!!!! Great job BTW! I loved this video!!
@Flargenyargen
@Flargenyargen 6 місяців тому
I admire it so much that you are able to work around such unusual circumstances. I can't even get a Linux graphics driver fully working in an ideal setup.
@robkam643400
@robkam643400 5 місяців тому
Just buy hardware for linux, instead of the other way around. Buy all AMD. It'll all work out of the box if it's over a year or so old.
@leonbishop7404
@leonbishop7404 5 місяців тому
@@robkam643400 I understand why you would want to buy AMD gpu for linux, but what's the point of swapping Intel CPU for AMD one?(unless you mean Intel ME, but it's works the same with with Windows)
@kevinm3751
@kevinm3751 5 місяців тому
O yaaaa! Micro Center... Inconveniently located for 90% of ALL OF US!
@spinthma
@spinthma 6 місяців тому
Really amazing, did not know that there are already ampere cpu workstation in the field!
@Stealthmachines
@Stealthmachines 2 місяці тому
Very informative, thanks!
@JCtheMusicMan_
@JCtheMusicMan_ 6 місяців тому
The machine specs gives me the same feelings as when I saw and heard a monster truck performing in person for the first time! 🔥🤯
@pendragonscode
@pendragonscode 6 місяців тому
Awesome content as always!!!!
@ianperkins8812
@ianperkins8812 6 місяців тому
Dang. Getting a type 1 hypervisor on that thing would be SWEEEEET
@lavavex
@lavavex 6 місяців тому
Just went to micro center yesterday to pick up the EVA asus parts. Love seeing my hometown micro center here, STL rep!
@user-um9sl1kj6u
@user-um9sl1kj6u 6 місяців тому
I tried box 86 and 64 a Long time ago:-/ It’s nice to see someone else having better luck
@Karthig1987
@Karthig1987 6 місяців тому
Good video. Easy to understand information.
@LollosoSiTV
@LollosoSiTV 6 місяців тому
Hey Jeff, running the bedrock edition, especially the mobile version as you did, is a far too easy challenge for your rig. I suggest running and comparing the latest java version and a specific modded version: Faboulously Optimized. To get any architecture incompatibilities out of the way, consider using a launcher that comes as a JAR file, such as the Technic launcher Make sure to use the latest jre (20-21) and set the proper JVM flags Additional bonuses: shaders, resource pack with parallax mapping + physics mod pro (then grab a fire extinguisher) Looking forward to hearing from you!
@totem168
@totem168 6 місяців тому
damn one of my dream server thank you to review it Jeff
@user-pu7mv2tu3m
@user-pu7mv2tu3m 6 місяців тому
First time watcher, awesome video so far! Subd ofc. There was this guy that printed custom ducting for his fans directly over the cpu etc. Got good results. Oh and a guy that cooled with dry ice and got 0°c. For someone who doesn't know about computers, does that mean you can run it all infinitely fast lol?
@NicoDsSBCs
@NicoDsSBCs 6 місяців тому
Nice. They really make amazing stuff. Too bad I can't afford it. Would love an Ampere workstation so much. But I'm happy with RK3588 and my pc when I need it.
@DukeBoy82
@DukeBoy82 6 місяців тому
I would love to see a video/more information on the opensource LLM you used in this build. That looks super interesting.
@PrinceWesterburg
@PrinceWesterburg 6 місяців тому
Thermal Paste: Forget what LTT says, its a physical junction that transfers heat, the larger the contact the more heat can move across it. So you are completely right to spread the thermal paste out. Physics!
@idtyu
@idtyu 5 місяців тому
I would try it on Fedora, which has vanilla and almost edge Linux kernel, plus they have proper nvidia support with wayland now, and with its special ram config (which requires no swap now). Things might run better. And I always use the flatpak version of steam, runs quite good
@denvera1g1
@denvera1g1 6 місяців тому
11:00 I think this has been a problem in Cinebench since inception, originally it was only an issue for very niche 4 and 8 socket systems, but with EPYC, Threadripper and Xeon Platium (cascade lake) with up to 56-64 cores per socket and 2-4 sockets, many cores started going un-used in and after 2019
@lucasrem
@lucasrem 6 місяців тому
Den Verga intel is NOT ARM levels ! You need apple, UNIX !
@denvera1g1
@denvera1g1 6 місяців тому
@@lucasrem we're not talking about perfromance, only core count, IIRC when i built my 12 core dual processor Xeon X5690 desktop the current-at-the-time version of cinebench only supported 16 threads, not 24
@TechnoTim
@TechnoTim 6 місяців тому
I'm ready for (another) ARM desktop!
@olavaaf2218
@olavaaf2218 6 місяців тому
Nice video! Almost getting one myself! Is it the 2,8Ghz version of the CPU Ampere will ship? Regarding the Mac, let’s not forget M2 Max and M3 Max have tremendous memory bandwidth, 400GB/s.. quite much more so than a DDR4 system I believe. That makes them maybe faster in memory bandwidth limited problems, such as several types of simulations etc with low flops per byte ratio. AmpereOne has the DDR5 memory system support. However, I have not seen it easily available like this CPU is. With only 3 out of 4 memory channels being connected, maybe the 96 core version is a “better fit” as the amount of bandwidth per core will be quite better, for anything bandwith sensitive that is.
@boolightningstudios
@boolightningstudios 5 місяців тому
Wish we had a Micro Center
@dzltron
@dzltron 6 місяців тому
I really miss living near a Micro Center. They really is the best PC store I've ever been to. Please come to the PNW!
@robbin4022
@robbin4022 6 місяців тому
Upgraded my laptop's monitor to 4K and with 100% scaling I can read the text on your screen at 0:55 With any other scaling the text becomes more blurry and if I right click on the video and click stats for nerds, the resolution of the viewport changes with the scaling. Also, on my win 10 laptop I can't just hover over the speaker icon in the taskbar and scroll to change the volume, which win 11 does. Sorry for the unrelated comment but hey, good to see you are doing well and are in good health!
@jbucata
@jbucata 6 місяців тому
For at least one of those games, the text console had an error message about "out of thread IDs". Presumably it's trying to spin up one thread per core or per SMT. If you can artificially limit the number of cores that the OS sees, or that it shows to userspace programs, you might have a shot at getting these to work... Does ARM have SMT? Turning that off would be interesting too.
@bryanteger
@bryanteger 6 місяців тому
Really cool. Btw Jeff I found a way easier way to connect LTE modems via, Modem Manager and Network Manager. No need to install QMI libraries, they're already in Debian 12.
@PodcastUbuntuPortugal
@PodcastUbuntuPortugal 6 місяців тому
We approve your usage of SuperTuxKart!
@aliyuabba4575
@aliyuabba4575 6 місяців тому
Keep them vidoes coming please. This will greatly help Windows on ARM development going forward before the X Elite drops.
@JoeSpeed
@JoeSpeed 6 місяців тому
… and after, Ampere multi-core performance is in another league
@lucasrem
@lucasrem 6 місяців тому
aliyuabba4575 Xcode, apple. Do it better ????
@Teluric2
@Teluric2 Місяць тому
​@@JoeSpeedampere is crap for video and cfd.
@sethbessinger2025
@sethbessinger2025 6 місяців тому
That’s really cool! Imagine if we could a RISC-V CPU to game on Linux!
@jrshaul
@jrshaul 6 місяців тому
It sounds like the real bottleneck here is DDR5 support. Which is supported by the upcoming Ampere revision. Which is even faster. This is a surprisingly effective workstation for a development board, and further software support should improve it even further. I could see Blackmagic integrating one with a pile of their PCIe cards to build a behemoth video switching workstation capable of real-time effects - and driver support is a lot easier when you make the cards!
@lucasrem
@lucasrem 6 місяців тому
jrshaul What DDR u used, Ampere ? ARM is not needing more than 6000 DDR 5 !
@110gotrek
@110gotrek 5 місяців тому
Please more Ampere content
@Jeditilt
@Jeditilt 6 місяців тому
Awesome video. Thanks!
@geekaholic88
@geekaholic88 6 місяців тому
This video is so fscking awesome!
@utfigyii5987
@utfigyii5987 5 місяців тому
The time of the pc2 is coming!!
@kudu9
@kudu9 6 місяців тому
I thought this gonna cost a kidney and a heart but the price is actually really good
@SilentCtrl_
@SilentCtrl_ 6 місяців тому
I'm running a VM on Hetzner that uses Ampere and it performs great for the price.
@PurpleKnightmare
@PurpleKnightmare 3 місяці тому
Yeah, wish there was a Micro Center near Seattle.
@ProjectPhysX
@ProjectPhysX 6 місяців тому
2:49 That RAM cannot keep up is not only the case for server CPUs. Even for these super fast data-center GPUs, 2TB/s VRAM bandwidth cannot keep up, because compute Tflops is still so much larger. They could cut the GPU die size in half and the software would still perform thr same. Nearly all compute software is bandwidth-bound nowadays.
@JSON_bourne
@JSON_bourne 5 місяців тому
God I wish there was a microcenter near me
@tito_me_doe676
@tito_me_doe676 4 місяці тому
I just saw this, but Minecraft Java runs on ARM, with shader support-just use Prism Launcher, and you’ll need to install a specific ARM JDK version for the version of Minecraft you intend to run. The entire process is identical to using Prism Launcher in any OS on any CPU, and there are many guides on how to do it. I highly recommend when you create an instance, rather than choosing a vanilla game, go straight to the mods page, and search for and select “fabulously optimized” from the mod search menu. Then you’ll want to install Iris, and then go to the resource packs tab and search for shaders
@blisphul8084
@blisphul8084 6 місяців тому
One large advantage to the Apple RAM is that it's unified, meaning you essentially have 192GBs of VRAM, which is useful for machine learning tasks.
@darinrosse1621
@darinrosse1621 6 місяців тому
Exacto
@Teluric2
@Teluric2 Місяць тому
192 gb of ram for gpu and cpu will use air for working?
@theyoutubes4249
@theyoutubes4249 6 місяців тому
Would be great to know if you eventually manage to get Llama to use the GPU on the ARM system.
@zachzimmermann5209
@zachzimmermann5209 5 місяців тому
Thanks for sharing this with us, Jeff! I wasn't really aware just how compatible things were with ARM on Linux. I have to admit though, the LLM performance was actually rather poor. A used RTX 3090 (maybe $750?) could run that llama 2 13b model at 10x the inference speed. I'll be very interested once the GPU support with ARM is worked out as that seems like the main issue.
@JeffGeerling
@JeffGeerling 5 місяців тому
Yeah; honestly I think it just needs a little more twiddling and you could get GPUs to do the inference a bit faster.
@mori7423
@mori7423 2 місяці тому
I'm really wanting on arm to finally come to the desktop scene, especially for gaming and some lite office work or programming. I switched from x86 laptop to base model m1 mba and I'm so impressed with the power efficiency and battery life, got me through a huge machine learning project. Too bad apple hates their users and we'll have to wait for someone with more sense to come to desktop market with arm computers that can compete with Apple Silicon
@FranciscoMonteiro25
@FranciscoMonteiro25 5 місяців тому
sounds good, should be able to run opensource 7b&13B LLM locally, i need to check if available in Austria
@TT-it9gg
@TT-it9gg 6 місяців тому
Thanks for the video. One question, the GL mark is 10260. Is that by CPU or 4070Ti? The Jetson Nano can do 2000+
@user78405
@user78405 5 місяців тому
i pretty confident nvidia arm cpu gonna have built in x86 emulation hardware layer to run customize vkd3d-proton
@AlanTheBeast100
@AlanTheBeast100 6 місяців тому
You only get power from cores if you throw threads at them and if the memory can keep up.
@nayrpc
@nayrpc 5 місяців тому
yey u tested ksp! best game ever
@montecorbit8280
@montecorbit8280 6 місяців тому
At 2:43 Ubuntu and Windows for ARM.... Did you try any other Linux distro?? Just curious on that.... I have been coming down here to suggest ChimeraOS because it runs steam very well, but then I remembered it may not have an ARM flavor....if it does that might be a good way to go!! Manjaro apparently has the ability to act like SteamOS since both of them are based on ARC Linux.... Hope you have an excellent day!!
@spurdo6747
@spurdo6747 5 місяців тому
just curious did you try x265 cpu encoding? it gives nice quality for bitrate and you have the cores
@StarcoreLabs
@StarcoreLabs 6 місяців тому
Micro Center is the best! Great video.
@garyhuntress6871
@garyhuntress6871 6 місяців тому
I'm really interested in the LLM and machine learning aspect of this. I'm about to upgrade my old dual 24 core Xeon (w/ 512GB of ECC) to a modern high core count plus high end GPU. This is absolutely on my radar now. Do you have specific motherboard recommendations?
@JeffGeerling
@JeffGeerling 6 місяців тому
If you're serious about the LLM aspects, the best options would be some of the server builds, from Gigabyte, Supermicro, Asus, or one of those vendors. ServeTheHome has some interesting reviews of GPU-heavy Ampere machines used for the purpose.
@aloysiushettiarachchi4523
@aloysiushettiarachchi4523 6 місяців тому
Hello, how does it compare with a cisc machine in matrix handling?. This is most important in scientific work. M1. M2, etc are for simple arithmetic in raytracing, I belive.
@The_Slippery_Slope_NZ
@The_Slippery_Slope_NZ 6 місяців тому
I hope you're Crohns is treating you well my man, all the best to you fellow Crohnsman.
@JeffGeerling
@JeffGeerling 6 місяців тому
Crohnies unite!
@DarrylAdams
@DarrylAdams 6 місяців тому
Could you run virtual machines in this hardware? Could QEMU/KVM emulate raspberry pi, Mac OS or even X86 os? Imagine running a virtual cluster of pi? And while quickemu can run MacOS, running the latest apple sillicone version could be very useful.
@pikachuchujelly7628
@pikachuchujelly7628 5 місяців тому
Yes, Qemu supports emulating the Raspberry Pi.
@augustinolarian
@augustinolarian 5 місяців тому
Hello. How about using it as a web server and virtualization (ESXi and windows server with hyper-v)? Can you do some tests for these? and maybe compare with some xeon processors? how fast mysql is on those cpus? I really look at these ARM CPUs and i see they might change the servers world and i really think of getting an ARM server.
@orlie_dev
@orlie_dev 6 місяців тому
but can it run crysis
@Skiller71Studios
@Skiller71Studios 6 місяців тому
Jeff, glad you made it out of the Brentwood parking lot alive after that microcenter trip! 😂
@parthbhardwaj4279
@parthbhardwaj4279 6 місяців тому
Is this configution good for stable diffusion and deepfake training?
@TheSolidlad
@TheSolidlad 4 місяці тому
What's the case used for this build called? Looks pretty good.
@soontoberockstar8
@soontoberockstar8 6 місяців тому
I’m curious since the nvidia GPU wasn’t working with windows 11 but does work with Linux if using wsl ubuntu would help solve this?
@eDoc2020
@eDoc2020 6 місяців тому
Bandwidth not keeping up with compute power has long been an issue. One amusing statistic is that standard floppy disks are faster than a typical NVMe drive (compared to capacity). You can read a 1440k floppy in about 45 seconds but a Samsung 990 PRO 2TB will take over 4 and a half minutes. Even the IOPS per megabyte is a bit faster on the floppy. With a slow step rate of 8ms you'd have a worst case of 840ms access time or .82 IOPS/MB. The 980 is 1.4 million IOPS best case which comes out to 0.7 IOPS/MB.
@insanesicsix6
@insanesicsix6 4 місяці тому
WOW. Increible proyecto. Podrias probarlo con la mineria de tokens para esa computadora..
@kovacspis
@kovacspis 5 місяців тому
Did you try manually setting the number of threads inside Cinebench? File menu, settings, put the tick in, set desired number of threads.
@harryragland7840
@harryragland7840 6 місяців тому
When your Microcenter is also Jeff Geerling's Microcenter....hey, did you leave anything for me Jeff?
@JeffGeerling
@JeffGeerling 6 місяців тому
Not *this* time!
@blehbleh9283
@blehbleh9283 6 місяців тому
Hoping to get an internship at Ampere. Thanks for bringing this company to my attention over the years
@Girvo747
@Girvo747 6 місяців тому
What GCC toolchain does it use? I guess I'd still be using gcc-arm-none-eabi to cross-compile the embedded work that I have to do haha
@anonemoose102
@anonemoose102 6 місяців тому
What the website, package that you used to run a LLM model locally? I'm curious to see what my 3060ti can run. Thanks!
@RealEverythingComputers
@RealEverythingComputers 6 місяців тому
Can you please do a video where you compare the i9-13900KS with the Ampere Altra?
@ErikS-
@ErikS- 6 місяців тому
I reminded myself of the AMD Opteron CPU's of around 2005. Those CPU's also reached similar clock speeds. You start to wonder if we really can only move forward in other ways than increasing clock speeds...
@Looser_23
@Looser_23 5 місяців тому
well combustion engines have also gotten way more powerful in the last decades, but not by increasing rpm.
@mitcoes
@mitcoes 6 місяців тому
I have just read a 3 years old post in reddit writing that open source AMD GPU drivers can be used in ARM Linux (Oland GPU with blobs for initialization). It is a shame that open source drivers cannot be compiled if needed, and that games are not compiled for Linux (x86 and ARM) and Vulkan. And it was a pleasure watching that it is possible as with Super Tux Kart. It seems Nvidia and ARM are making SoC for laptops and handhelds, for MS WOS, Chrome OS and Linux perhaps future good drivers will come with them.
@PremierPrep
@PremierPrep 5 місяців тому
Linux is killing it on ARM! Great video!!
@pikachuchujelly7628
@pikachuchujelly7628 5 місяців тому
That's a lot of RAM! Holy cow!
@hannescampidell
@hannescampidell 6 місяців тому
Minecraft Java edition for linux (through an unofficial launcher) should run well on this beast (on the Nintendo Switch with Linux installed it is playable)
@michaelsopunov
@michaelsopunov 5 місяців тому
Hello Jeff, why not video production? Davinci Resolve not yet compatible with Ampere?
@Rostol
@Rostol 6 місяців тому
they are not out yet, but those large new threadripper pros at 5gz look SWEET too (for sure over 5k per cpu tho)
@IHateHandlesWayTooMuch
@IHateHandlesWayTooMuch 6 місяців тому
I have never expected an AHOC shoutout when watching a video on ARM CPUs, but here we are. Nice!
@xaytana
@xaytana 6 місяців тому
Hey Jeff, I've been curious about something for awhile but haven't really seen A-B testing of it anywhere, and I wasn't sure if you had any industry contacts that would know considering you're easily one of the biggest proponents of ARM desktops on the platform. Do ARM cores actually prefer the tight timings of DDR like x86 CPUs, or do ARM cores prefer raw speed and bandwidth like GPUs. I remember seeing some theoryposting forever ago stating that GDDR might be better for the ARM ecosystem, especially as core counts scale higher. Unless one of these companies have decent public research into it, I guess the best of A-B testing is whenever consoles adopt ARM CPUs and their PC board counterparts (with DDR slots) exist, as even the x86 consoles choose GDDR modules as the shared memory. If ARM cores don't care about timings and latency and prefer speeds and bandwidth, seeing GDDR tested on ARM cores could be extremely interesting, especially if it does push for a spec change that has modular GDDR.
@davidrobertsson7640
@davidrobertsson7640 5 місяців тому
Thank you for this video. Have been looking at that dev kit for a while now. But hesitated to buy - much to the lack of information and the "dead" forum threads. The ram sticks you went with - what specs did you go with? Do you have any recommendations or "beware of" when it comes RAM- modules? Hopefully this system will rock with FreeBSD! Placed my order today =D
@JeffGeerling
@JeffGeerling 5 місяців тому
Almost any ECC DDR 3200 SO-DIMMS will work okay, I chose Samsung as I tested them and Transcend and found the Samsungs to be consistently faster.
@davidrobertsson7640
@davidrobertsson7640 3 місяці тому
@@JeffGeerling can tell you that Samsung - DDR4 - module - 128 GB - LRDIMM 288-pin doesnt seem to work =D
@stumblinguponbliss
@stumblinguponbliss 6 місяців тому
Hi Jeff, awesome video but I wasn't able to understand if this build would be better then a Mac m2 for video editing. That was the main reason I was looking at the Mac versus the PC which is what I mostly use. I am also evaluating a build for Docker. I was thinking of a VMware solution
@JeffGeerling
@JeffGeerling 6 місяців тому
For video, I'd still stick with Mac. (Or Windows in some circumstances). There's no editors on Linux that can really match the workflow for serious editing. Though you can do a lot of the basics with Kdenlive and other OSS editors.
@secondskins-nl
@secondskins-nl 6 місяців тому
@@JeffGeerlingeach it's own but you mention how much is accelerated using GPU these days, like NVidia NVENC that's also true for a lot video editing, effects and such. Bit weird to stick with Mac for video other than being used to it's pace. If you get paid by the hour it's perfectly ok though.
@JeffGeerling
@JeffGeerling 6 місяців тому
@@secondskins-nl Apple still has top-class video workflow support, from Adobe, Blackmagic, and Apple (along with practically all the production/cinema vendors), and being able to edit with a mac running dozens of 4K streams with processing on top in full preview res with no fan noise is a blissful thing. I have a PC running Windows 11 with a 4090 and Ryzen 9 7900X. It does the same thing and can chew through tons of 4K or a few 8K with real-time processing at full preview res, it just burns 6x more watts and sounds like a hurricane doing it ;) To each their own!
AMD’s 128 Core MONSTER - Epyc Bergamo
22:57
Linus Tech Tips
Переглядів 1,9 млн
Raspberry Pi demolished by monster 128-core ARM CPU!
20:54
Jeff Geerling
Переглядів 1,3 млн
ВИРУСНЫЕ ВИДЕО / Мусорка 😂
00:34
Светлый Voiceover
Переглядів 8 млн
ФОКУС С ЧИПСАМИ (секрет)
00:44
Masomka
Переглядів 3,7 млн
Сын Расстроился Из-за Новой Стрижки Папы 😂
00:21
Глеб Рандалайнен
Переглядів 1,9 млн
The Slow Death of Windows
17:22
TechAltar
Переглядів 1 млн
Is this STILL the Fastest SFF GPU??? - RTX 4060 Low-Profile Review
20:04
Craft Computing
Переглядів 20 тис.
Rabbit R1: Barely Reviewable
19:53
Marques Brownlee
Переглядів 6 млн
Corporate Open Source is Dead
6:55
Jeff Geerling
Переглядів 285 тис.
Fast networking is cheaper than you think.
22:58
Hardware Haven
Переглядів 355 тис.
Framework Cyberdeck - DIY Portable PC
17:31
Ben Makes Everything
Переглядів 1 млн
This insane 80-Core ARM CPU easily beat a 64 Core Threadripper
16:02
der8auer EN
Переглядів 489 тис.
When Did Raspberry Pi become the villain?
21:54
Jeff Geerling
Переглядів 1,3 млн
Result of the portable iPhone electrical machine #hacks
1:01
KevKevKiwi
Переглядів 7 млн
I7 2600K тест в играх и сравнение с AMD Ryzen
17:53
Phone sees the future ! 📲🫣👽
0:38
BOGDANCHIKI
Переглядів 11 млн
How Neuralink Works 🧠
0:28
Zack D. Films
Переглядів 10 млн