The Petabyte Pi Project

  Переглядів 2,180,482

Jeff Geerling

Jeff Geerling

День тому

By far the most storage ever attached to a single Raspberry Pi. Massive thanks to 45Drives for making this possible!
Check out their storage solutions at www.45Drives.com/
Support me on Patreon: / geerlingguy
Sponsor me on GitHub: github.com/sponsors/geerlingguy
Merch: redshirtjeff.com
2nd Channel: / geerlingengineering
#PetabytePi #RaspberryPi #45Drives
Mentioned in this video:
- 45Drives (HUGE THANKS!): www.45drives.com/
- Storinator XL60: www.45drives.com/products/sto...
- My new rack build video on Geerling Engineering: • Massive Rack Upgrade f...
- 3-2-1 Backup Plan video: • Backups: You're doing ...
- GitHub issue with tons of detail: github.com/geerlingguy/raspbe...
- Raspberry Shake (seismograph): raspberryshake.org/
Contents:
00:00 - The Petabyte Pi Project
00:24 - Challenges
01:35 - What is a Petabyte?
02:00 - 45Drives sent a Storinator XL60!
03:12 - Ripping out the Xeon
04:57 - Replacing it with a Pi
07:26 - Backplanes and power
08:37 - Pre-build tests
09:06 - So. Many. Drives.
11:37 - Pi OS mods
12:17 - First boot!
13:12 - Did you try restarting?
14:42 - RAID 0 test
15:51 - Don't yell at your JBODs
16:39 - ZFS test
16:59 - Btrfs test
18:46 - Back to the basics
19:50 - 192 days
20:59 - So why, PCI?
21:57 - Don't skip this part
Music attributions:
The Curtain Rises by Kevin MacLeod
Link: incompetech.filmmusic.io/song...
License: filmmusic.io/standard-license

КОМЕНТАРІ: 2 300
@izzieb
@izzieb Рік тому
More bottlenecks than a Coca Cola factory.
@stoutscientist
@stoutscientist Рік тому
That was really good
@MindForgedManacle
@MindForgedManacle Рік тому
I already stole this 😂
@notvisibleconfusion
@notvisibleconfusion Рік тому
💀
@aaryanregmi4089
@aaryanregmi4089 Рік тому
💀💀
@nusermane1076
@nusermane1076 Рік тому
True, and also the caffeine consume is the same 😅
@bartz0rt928
@bartz0rt928 Рік тому
60 HDDs in RAID 0 is the definition of "all gas no brakes"
@staceixan
@staceixan Рік тому
Glorious, glorious RAID 0 🤩
@aim-at-me
@aim-at-me Рік тому
glass cannon lol
@monad_tcp
@monad_tcp Рік тому
Who needs raid, why aren't you using LVM and just mounting it as direct-IO.
@ampex189
@ampex189 Рік тому
And no steering
@himmelsrand7527
@himmelsrand7527 Рік тому
@@monad_tcp Who needs LVM. Just use ZFS.
@Axel_Andersen
@Axel_Andersen Рік тому
I had this colleague who told this story: I had boss who did not understand anything about computer or electronics. When ever I was trouble shooting something he would come and watch over my shoulder and comment "Have you checked the power?" This was very annoying as he did not understand anything. What made it doubly annoying that his advice was spot on so many times. Check the power!!
@RickSaffery
@RickSaffery Рік тому
This is a bonkers crazy setup. It's so mad I just had to watch, 100%. Kudos to you for getting it to work at all. Your persistence is inspiring.
@RaidOwl
@RaidOwl Рік тому
A petabyte and a raspberry pi, the crossover we didn’t know we needed. This is sick.
@plica06
@plica06 Рік тому
Except it's so slow it's like writing to one of those fake Chinese 2TB USB keys! Yes I fell for that scam on eBay.
@adrianteri
@adrianteri Рік тому
@@plica06 Why would you want a 2TB Thumb drive/Flash drive instead of an external HDD? They wear more quickly due to storage technology. Heck even utilities like #Ventoy now support HDDs which are more reliable and you can carry more OSes...
@AmruthReddi
@AmruthReddi Рік тому
literally Pi-tabyte lol
@dieSpinnt
@dieSpinnt Рік тому
@@adrianteri Man, it is a joke, a statement and also a confession by plica06. A joke like Jeff's project. Which doesn't matter, because it is not about if it is useful or practical, but: Showing that it CAN BE DONE! BTW Great job Jeff and respect to plica06!:) P.S.: I get it that you are joking, too: "They wear more quickly ..." NO! They don't wear, because they aren't real! Hehehehe
@dieSpinnt
@dieSpinnt Рік тому
@@adrianteri On the other side, when I take your comment serious ... I can't take you serious:) A portable media, 2022 in the 2TB range ..... and you say spinning media? That must be some back-to-the-future joke I do not understand. Anyway, misunderstanding plica06's grotesque comparison and taking it as a spin(a straw-man!) to compare USB-Sticks[2] against enterprise level hard disks (as in the video, or consumer ones doesn't matter) is like comparing apples to hard drives. Enterprise level HDDs have to be compared to enterprise level SSDs. Latter one beat the first in every point, including price[1], reliability, data safety and energy efficiency in that sector. [1] The purchase price is simply irrelevant compared to the maintenance costs, since the former is already planned in the product and can simply be written off after the service life of the medium. [2] actually 2TB(etc.) NVME "USB-Sticks"(miniature-Adapters) exist. Try to beat their MTBF (vs. spinning media that IS transported around) ... ;)
@SaxaphoneMan42
@SaxaphoneMan42 Рік тому
the PetaPi seems like a winner, I can only imagine the folks at 45 drives watching this with a mix of awe and horror.
@hannes-
@hannes- Рік тому
PetaPite
@glenncaughey5044
@glenncaughey5044 Рік тому
Just like watching two ships colliding. 😎🚢
@BradCozine
@BradCozine Рік тому
It IS a petabyte file server... but probably SHOULDN'T go with the name "PetaFile".
@michaelterrell
@michaelterrell Рік тому
@@BradCozine PETAFile, = a database of People Eating Tasty Animals!
@JasperJanssen
@JasperJanssen Рік тому
@@BradCozine petaPile?
@myrkat
@myrkat Рік тому
Pushing the limits of "hackey" tech is what most hardware and software engineers should be shooting for. Very well done. Kudos to 45 Drives!
@ChrisContin
@ChrisContin Рік тому
Fascinating! A lot of work! The issue it seems is having to tether to the hard-disk through a band shared with other drives, rather than universal. There is a technique where using data-points written to each hard-drive mathematical computations can rewrite and measure the data for inference machiningly- without even using the shared io-bus anymore! You’d see a slowdown in computation monologue but all speed ahead on as many drives as you want!
@45Drives
@45Drives Рік тому
Woah.
@EE-bk7qy
@EE-bk7qy Рік тому
Woah.
@iloveseals5208
@iloveseals5208 Рік тому
woah you get seal of approval
@venus007e6
@venus007e6 Рік тому
Woah.
@Todija
@Todija Рік тому
Woah.
@starling000
@starling000 Рік тому
Woah.
@SeanHodgins
@SeanHodgins Рік тому
Do I need a petabyte of storage? No. Would I mount one in my rack? Yes.
@kevinbissinger
@kevinbissinger Рік тому
30 years ago: Do I need a Gigabyte of storage? No.
@monad_tcp
@monad_tcp Рік тому
I absolutely need it to store my GPT-3 training data. Turns out having 16 GPUs wasn't enough, but I run out of space even before running out of compute time. Its probably more on the scale of 1PB + 8192 GPUs.
@monad_tcp
@monad_tcp Рік тому
10 years from now, Do I need a Petabyte, yes.
@sparkyispog
@sparkyispog Рік тому
do i really need 128 gb of storage on my mac? "your mac is almost out of storage" wait- whY IS MY SYSTEMS FOLDER 70GB whY does microsoft word take up 2gb?
@geraldcreager4432
@geraldcreager4432 Рік тому
@@kevinbissinger I still remember buying my first gigabyte hard drive. Kept a grad student in school.
@t7732155980
@t7732155980 Рік тому
Even though the idea is crazy, Jeff knows his craft. The tips at time 09:00 on how to select hard drives for the task are priceless!
@zb9458
@zb9458 Рік тому
Jeff you never cease to deliver, you're an absolute legend, great video!!
@happilicious
@happilicious Рік тому
How about one controller to one pi, then stripe 4 pis together, that should increase the throughput and scale down the issue to a more managable chunk, and still a pi project.
@JeffGeerling
@JeffGeerling Рік тому
This is probably the most reasonable way to do it, and use Ceph (or another network-based filesystem)... and indeed I will be testing that out soon. Still bottlenecked but probably more reliable and would not run into as many errors on the PCIe bus!
@qazwsx000xswzaq
@qazwsx000xswzaq Рік тому
I think you will need more than 4 Raspberry Pis unless the pool is for backup or archiving purposes only. Even a low-to-mid end Intel or AMD system would make more sense.. though then the project will lose its “Piliness”. 😂
@paulz1780
@paulz1780 Рік тому
@@qazwsx000xswzaq Wirh these Network Speeds you would need 1 Pi per Disk to max out the disks🤣
@qazwsx000xswzaq
@qazwsx000xswzaq Рік тому
​@@paulz1780 We can then expose each drive as an iSCSI target and aggregate them at a server over ethernet. And viola we have a not-so-poor man's SAN haha. It reminds me of those shiny new network addressable NVMoF SSDs btw.
@jamesfmilne
@jamesfmilne Рік тому
Gluster might be a good option too.
@twiceineverymoment
@twiceineverymoment Рік тому
"I was going to unbox this on camera but Fedex already did" As someone who's had multiple expensive items destroyed by Fedex and actively avoids doing business with companies that ship with them, I felt that...
@junkbob6832
@junkbob6832 Рік тому
The reason for FedEx packages being more damaged than USPS packages actually isn't FedEx's fault. They accept heavier packages than USPS, so that means if a shipper doesn't pack their package correctly, when that 150lb box of farm tools slams into your 6lb box of plastic in the sorting machine, your package gets crunched. Meanwhile USPS only goes up to 50(?) lbs, so your box just doesn't get slammed as hard. TL:DR your packages get damaged in FedEx because the shipper didn't pack them right. Source: I ship around 5 packages a day with all 3 major US carriers and rarely have an issue since I actually package my shit
@twiceineverymoment
@twiceineverymoment Рік тому
@@junkbob6832 That doesn't mean Fedex isn't partially at fault. Especially because on the rare occasion that I do get a damaged shipment from UPS, they leave a note with a number to call offering to pay for a replacement if the item is broken. Fedex just delivers my $300 PC case looking like a truck backed over it and acts like it's no problem. And that's without even mentioning the countless packages that never showed up at all, or that got delivered to the wrong address and sent me on a trip across town trying to find it.
@topquark22
@topquark22 Рік тому
Yep. I had a laptop battery crunched by FedEx. It's a wonder that they can stay in business this way.
@atlantic_love
@atlantic_love Рік тому
You just didn't package correctly, how is that FedEx's fault?
@adwilson0286
@adwilson0286 Рік тому
In 2021 I had 5 items fulfilled through FedEx. 1 never arrived, 2 arrived damaged, 1 arrived late, 1 was early and undamaged. "20% delivery reliability!" 😵‍💫
@BrenskiIP
@BrenskiIP Рік тому
As someone who is about to endeavor on a new NAS project, this was a fun watch! Thanks!
@JeffGeerling
@JeffGeerling Рік тому
just keep upgrading the drives, I'm sure there'll be 1 PB 3.5" drives in like 20 years 🤪
@EDATEC
@EDATEC 8 місяців тому
It's impressive that this works at all.
@MarcoGPUtuber
@MarcoGPUtuber Рік тому
I think i heard Seagate in the US having a heart attack watching drives being juggled. I live in Taiwan.
@CLU2O10
@CLU2O10 Рік тому
Linus would be proud
@hardwarefromthegarbage3446
@hardwarefromthegarbage3446 Рік тому
Torvalds too. Nice example of the Linux versatility
@shivsankermondal
@shivsankermondal Рік тому
electroboom too.
@TheBacktimer
@TheBacktimer Рік тому
I was waiting for the water bottle :D
@rvmiv_
@rvmiv_ Рік тому
It's a good example of why the ltt pedibyte project is so expensive
@subhimesto7123
@subhimesto7123 14 годин тому
​@@rvmiv_ First of all and most importantly, the speed, those were gen 4 nvme ssd drives with extreme speeds, second of all this is just a storage rig but what ltt built is a server, I mean have you seen how they run NASA simulation? And lastly there drives are extremely reliable
@bryanenglish7841
@bryanenglish7841 Рік тому
You are an absolute madman and I'm rooting for you the whole way
@Echobar
@Echobar Рік тому
Jeff another wonderful video. Keep up the great work.
@KlausWulfenbach
@KlausWulfenbach Рік тому
1942: "We need to figure out a solution to digitally store dozens of bytes at a time. Vaccuum tubes, maybe? This is going to cost us millions, but it will be worth it to finally have accurate artillery range tables!" 2022: "I'm going to hook up this petabyte of data storage to this cheap single board computer!"
@vaisakhkm783
@vaisakhkm783 Рік тому
1969: "we need to figure out how to use our cutting edge 1.5 million doller, 4kb ram 32kb harddrive to bring people moon" 2025: lauching a rocker with a single board computer...
@Nordlicht05
@Nordlicht05 Рік тому
Wait until the average person doesn't click on a video below 16k 120fps. There will be always a way to fill it 😅 but I remember it's gotten better over time.
@NovemberOrWhatever
@NovemberOrWhatever Рік тому
@@vaisakhkm783 Pi's and Arduino's are already used for avionics on model rockets, and cubesats can have total build costs of like $50,000. It's amazing how far the industry has come and is going
@AlbaxArcade
@AlbaxArcade Рік тому
I always like to think that every time Jeff publishes a new video, the raspberry pi design team feels a disturbance in the force.
@jstan5802
@jstan5802 Рік тому
As they should, it's been 3 years, where's the raspberry pi 5?
@UnderEu
@UnderEu Рік тому
I find your lack of faith disturbing 😅
@Thinktank-rn6dm
@Thinktank-rn6dm 6 місяців тому
@@jstan5802 you're gonna be happy
@boolightningstudios
@boolightningstudios 4 місяці тому
@@jstan5802 out now?
@stickmanland
@stickmanland 4 місяці тому
@@jstan5802 Here.
@markonfilms
@markonfilms Рік тому
This is an awesome project. Makes me wanna build a large storage server.
@rollerboogie
@rollerboogie 21 день тому
As an HDD engineer usually drives are built to compensate for vibration due to certain fan RPMs. Especially if we have a big customer we'll optimize things for the frequencies of vibration in their trays.
@tyrdchaos
@tyrdchaos Рік тому
I hope LTT sees this. Great stuff as always, Jeff!
@LyokoisGreat2
@LyokoisGreat2 Рік тому
I wondered how long it would take for LTT to be mentioned in the vid
@scbtripwire
@scbtripwire Рік тому
Oh my goodness, that juggling of those drives.😳
@wayland7150
@wayland7150 Рік тому
Every IT engineer has a stack of broken hard drives just for juggling with.
@JeffGeerling
@JeffGeerling Рік тому
Haha true.
@SyphistPrime
@SyphistPrime Рік тому
@@wayland7150 yep, pretty much. My old boss when I worked in computer repair used laptop hard drives to level out his microscope. Funny thing is the drives weren't even dead, they were just something like 160GB 5400RPM drives that were more useful for that task than storing data.
@communitycollegegenius9684
@communitycollegegenius9684 Рік тому
I just completed a 24 X 18T build that's almost 1/2 PB. I bought the drives a few at a time all Segate recertified it seems to be the sweet spot in price for me. I had to upgrade and rebuild things several times. Your video was just like my experience switching OS, FS, etc. I had endless drives/arrays just dropping out, mostly on startup. I tried Fedora, Centos, Open Suse, Suse JeOs, and Ubuntu Server all let me down for one reason or another. I tried them with various shares, raid arrays and file systems; plus not all would run my app. I ultimately got it working with Ubuntu workstation and the Ubuntu share - no samba. I'm not happy with ZFS. I had to kill the swap and add more RAM (which meant a new motherboard) to keep ZFS cache from crashing. I solved the dropouts problem by putting the drives really close to the host board and using short/expensive data cables that are all the same length. I'm using a very old Athlon FX-8300 8 core and 64 gig of Ram. I found a great last generation Adaptec 52445 raid card new old stock. I had to install 2 power supplies and rewire one to all molex to get enough amperage on the 5 volt rail. New 1200 watt supplies have plenty of 12v but almost no 5v power. I also upgraded to 2.5 gbps network card. The write speed is NOT stellar. With Raid0 it goes real fast at first 450mbps filling all that cache, but slows down to about 150mbps. With a single JBOD I only get 130mbps, two drives at the same time still go 130 each, and I can transfer to 4 drives at the same time before it bogs down the network and write speed drops to 70 each drive or 280 total. I already got 4 sas expanders and plan to continue adding drives (and power supplies) up to 2.5 PB. My box is an old IBM 2401 tape drive converted to rack space. I yell at the You Tube screen, not my computers. That's not true I also yell at my computer at work (it's windows).
@z98i87
@z98i87 Рік тому
This tutorial helped me so much!! Thanks!
@MichaelDude12345
@MichaelDude12345 Рік тому
I saw you posted some on the homelab subreddit last week, THIS is what you were hiding from us?? What a fun idea. Can't wait to see your next project!
@JeffGeerling
@JeffGeerling Рік тому
Homelabbers unite!
@jmr
@jmr Рік тому
You tried all the things I wanted to see. You know your audience!
@JeffGeerling
@JeffGeerling Рік тому
The last thing I wanted to try (even had it in the final edit but cut it for time) was 4x hardware RAID cards... but I only have one on hand. I was thinking of setting up 4 hardware RAID 6 arrays, then uniting them on the Pi as a RAID 0 array and seeing if that performed better since individual drives would all go through one HW raid card, and that would also give redundancy. (And who said hardware RAID is dead? You still need it if your computer performs like one from 2010!).
@Winnetou17
@Winnetou17 Рік тому
@@JeffGeerling Yeah, exactly, who said hardware RAID is dead ? Clearly unrelated, did you ever thought of doing a collaboration with Wendell ? Him and Red Shirt Jeff would surely push themselves to insanity :D Seriously speaking you two seem to do similar exploratory courses, though, of course, he's much less Pi-centric.
@adrianaa3059
@adrianaa3059 Рік тому
I think Arthur C Clarke hypothesized that this would be enough to store a few people's minds into it
@jerryw1608
@jerryw1608 Рік тому
Well, pairing €20.000 - €24.000 worth of drives with a 100 dollar raspberry seemed like a solid plan to start with. Having it run a 1pb raid0 config while hosting that storage seems like a task made for this little arm processor 😂 Nice to see you try out such extreme things with the raspberry😀
@TheOleHermit
@TheOleHermit Рік тому
Glad to see Red Shirt Jeff back. I was wondering what happened to him. Geez, your projects are soooo extreme and cutting edge. Your troubleshooting processes are very informative and helpful to your viewers, who otherwise wouldn't have a clue where to look. We never see that on "How to" setup videos, where everything just automagically works. BTW, I recently used Styrofoam to separate 2 prototype boards while testing them out. It got hotter than expected and the foam sagged, allowing the power rail of one board to touch the other board, which fried as soon as I powered up the following morning. Don't try this at home, folks. Cardboard insulation good. Styrofoam bad. Thanks for sharing Jeff. I'm fully confident that you'll get your bandwidth up on the 60 HDD RAID. Looks to me like you're already nearly there. Instead of installing it inside a server rack, perhaps a locked cage would be a better idea to protect it from Red Shirt Jeff. 😎
@JeffGeerling
@JeffGeerling Рік тому
Heh, cardboard insulation 'better', but I now have it on a 3D printed box that's a little more secure too.
@TheOleHermit
@TheOleHermit Рік тому
@@JeffGeerling Good point. 👍 Now, my 2nd attempt is safely mounted inside a custom acrylic case, too. One less thing to worry about, amongst the multitude of other potential mistakes. Live & learn, eh? Just gotta keep putting one foot in front of the other until the final goal is achieved, right? The hardware for the Teensy Laser Synth waveform module w/ILDA DAC is complete. Only need to flush out the code to full functionality, before moving on to the custom MIDI controller. BR 😎
@maxd7228
@maxd7228 Рік тому
Jeff, you have carved yourself a niche channel in an overcrowded tech community, taking the Pi to new heights in every video. Keep it up. Loving every video. I'm left astounded on the Pi's capabilities and untapped potential.
@axlslak
@axlslak Рік тому
I run something similar but without the pci-express stuff. Only usb2sata adapters. I am using a german one, that has a power supply that gives 2 amps for 12 volt circuit of the hdd, and 2 amps for the 5 volt circuit, therefor not using power from the PI usb at all. 2 harddrives per pi, running in simple raid1, xfs. I mostly use nfs instead of samba and for the most part it saturates 1GB connection (exception being many small files... but it's common). Less so with writing, where sometimes goes a bit under (as it has to write on 2 different drives and sometimes it falls to 700-800ish MB/s), but other than that, pretty practical. I've been using this setup since the pi4 came out and am very happy with it, especially when it comes to the electric bill. I tried connecting all of them to a single pi, but there were stability issues at times. Might have been the usb hub I was using, or some of the cables, so reverted back to 2 harddrives per pi, one in each usb3 port.
@whitey4986
@whitey4986 Рік тому
Hey Jeff, I’ve been using your ansible roles for almost 10 years. Love seeing you around HN and the other traps, great to see your UKposts channel is doing so well. Very cool projects!
@wildbikerbill6530
@wildbikerbill6530 Рік тому
When I first saw the title I figured this would be one of your massive multiprocessor/multi-Pi projects combined with a massive amount of storage. Something closer to 20 TB per Raspberry Pi. At 60 Drives and 60 Raspberry Pi's, that would still be way beyond any normal homebrew project.
@Bronathan
@Bronathan 6 місяців тому
This showed up in my recommendations. I have no clue what happened here and I know nothing about data managment or IT in general. But I watched it to the end not realizing that this vid is 22 minutes long 😅good content 9/10
@jeffreyumeh8580
@jeffreyumeh8580 Рік тому
Hu, I know that modern cases have that 1 stud so you can place your motherboard in on the stud which makes it easier to align the rest of the screws, but even then every motherboard I have installed has had 8 - 9 screw holes, sometimes there is a heatsink or something in the way of having all 9 screw holes, but 8 is the minium I have seen, this is only for full ATX though not ITX / mATX.
@haidenshober6732
@haidenshober6732 Рік тому
Many Pi’s running something like Minio might allow for some interesting single box hardware redundancy. Also might be able to get over the 1Gb limitation since you’d have many pi’s each with their own 1Gb connections.
@MarcoGPUtuber
@MarcoGPUtuber Рік тому
Jeff Geerling: My storage setup registers on the Richter Scale
@StrongbowTX
@StrongbowTX Рік тому
OK, this is the kind of cool content I like seeing appear on my birthday! (even if I didn't see it until 5 days later.) Just ONE of those drives is more storage than everything in my house, and I do Unixy networky stuff for a living. :)
@badwolf1984
@badwolf1984 Рік тому
Ripps out high end setup for a PI! You Monster! Though a Fun Project to play with a pi, maybe in the future with a PI 14 hope you enjoy your new Perabyte Server
@SpiritmanProductions
@SpiritmanProductions Рік тому
Maybe the CABD order is like the firing sequence of a 4-cylinder petrol engine lol 🤔
@falxonPSN
@falxonPSN Рік тому
The amount of power this thing draws could probably be measured more easily in horsepower, so you're not wrong there!
@JasperJanssen
@JasperJanssen Рік тому
@@falxonPSN the PSU isn’t that big - a horsepower is roughly 750W, so it’s going to be around the 1-2 mark.
@falxonPSN
@falxonPSN Рік тому
@@JasperJanssen fair enough. I can't argue with good pedantry! 🤪
@fohkukohgeki
@fohkukohgeki Рік тому
@@JasperJanssen Could run a petabyte server off a lawnmower engine...
@falxonPSN
@falxonPSN Рік тому
@@fohkukohgeki this is the project we need to see!
@devnol
@devnol Рік тому
One thing I saw you doing when wiring up the NAS was that you connected the molex adapters to two sata power connectors on the same line. If the psu has another line, try connecting to that, as each line can draw a limited amount of current and there might be an issue there. Then again, the issue might be anywhere else but that's a thing you can easily try
@InterprisesTV
@InterprisesTV Рік тому
Jeff, I'm catching up with thx, since I've admired your ability for a while. I am an old geek and you do magic. Reminds me of my S-100 days with CromixOS.
@spoils8179
@spoils8179 Рік тому
1 month ago and no recognition? Especially for a $50 USD donation? Sadge.
@InterprisesTV
@InterprisesTV Рік тому
@@spoils8179 Thanks for the sentinent, Aiden, but no problem. Hope he's doing well, and you as well for that matter. 👍
@abdusaidabduraufov5615
@abdusaidabduraufov5615 Рік тому
My God! 20TB 60HDD I never dreamed of such a volume
@b00573d
@b00573d Рік тому
Be careful with those sata to molex adapters...they are prone to fires!
@JeffGeerling
@JeffGeerling Рік тому
Red Shirt Jeff liked this comment.
@braydennturner
@braydennturner Рік тому
Amazing content as always, keep up the great work!
@stuffinfinland
@stuffinfinland Рік тому
Yet another absurd carage setup. Just love it!
@ryda2l
@ryda2l Рік тому
Awesome video & project!
@crashmatrix
@crashmatrix Рік тому
Well Jeff, one thing's for sure, if you and others aren't pushing the pi to bleed on the edge, no progress will ever be made in this direction. I'm not sure what kind of useful stuff this direction will yield, but it surely will yield something. Keep on pushing ya madlad!
@JeffGeerling
@JeffGeerling Рік тому
My hope is the next Pi at least has the PCIe bus bugs sorted so any card will 'just work'. After that, any more bandwidth they could squeeze out would be appreciated. The CM4 is actually great for many 1 Gbps network use cases-but with a little more bandwidth, it could be great for 2.5 Gbps (or heck, more than that if we're dreaming!).
@levygaming3133
@levygaming3133 Рік тому
@@JeffGeerling with a USB3.something port, it could get almost-5Gbe, so not that much of a stretch for faster than 2.5gbe
@MrSnuffyX
@MrSnuffyX Рік тому
It almost hurts to watch the board being removed. I've been wishing for a storeage like this for decades. Still a very interesting project.
@brandonjohnson3566
@brandonjohnson3566 7 місяців тому
Ive always wondered if there was a way to make a external ram board like hp has one that adds 4 ram slots on a pcie. Ive ran into issues of bottlenecked performance not because of cpu but because ram capped out. The pi has only max of 8gb so i would assume max speed on drives could only be met with maybe 6 to 10 drives at most. If you were to add an additional 128gb i wonder if that would help out. Plus the raid cards you had i couldnt tell if they had their own memory or passthrough.
@red03golf
@red03golf 10 місяців тому
Jeff - I mourn you missed seizing the opportunity to officially name this The Pi-tabyte Project - a portmanteau teed-up for a long drive, but you duffed it, lol. - great vid, keep 'em coming, I'm having a grand time doing some of your projects. Cheers.
@JPToto
@JPToto Рік тому
This is amazing. Good job charming 45drives into sending the case! They must be Red Shirt fans 🤣
@mahtin
@mahtin Рік тому
Even before doing a raid or zfs test; I’d have run a single drive test (either benchmark or simple linear read/write). Loop that 60 times and see success. Then repeat with two drives in parallel and loop 30 times. Redo with three in parallel 20 times, etc etc. When you start seeing failures you really will see the cause-effect point. There is technically no reason why this won’t work with 60 drives - if you ignore performance. Any bugs that are exposed that can be fixed will simply improve the base users world. This is an awesome test that pushes the RPi and kernel to the limit. Making this work at that limit helps all of us just running one drive. Plus, the errors you saw, as bad as they were, should somehow restart the drive (without a reboot).
@JeffGeerling
@JeffGeerling Рік тому
When those errors occurred, the HBA reset itself and the drives always came back-at least 15/16 of them! That was one concern from the Broadcom engineer I spoke with, and the reason he really wanted me to run the latest firmware. Unfortunately due to time constraints I couldn't flash all the cards in a separate PC then bring them back to the Pi and re-test. But I plan on trying that out.
@JeffGeerling
@JeffGeerling Рік тому
Extra testing has been done-tl;dr the breaking point is 3 cards (or more than 30 direct attached drives). But forcing PCIe Gen 1 speed also fixes the issue. More to come in my next video!
@magicmulder
@magicmulder 7 місяців тому
Next week: A 1024 CPU server running off a 1.44 MB floppy disk.
@97oweb
@97oweb Рік тому
The real impressive thing ist that he was able to get a pi compute module 4 I am trying to find one now for almost a year but the only place I even found one listed was on Amazon in the us, it is sold out since then, here in europe I did not even find a seller who listed it
@nezu_cc
@nezu_cc Рік тому
My NAS is a pi4 with a desoldered USB chip to expose the PCIe bus. It's connected to a PCIe switch(to improve transmission and prevent crashes) and then to a cheap ASMedia SATA controller. The kernel is also patched to force PCIe gen 1 to prevent crashes(this is probably the same thing that happened to you in the video btw). PCIe gen1 is still faster than gigabit so it doesn't matter. Over samba or NFS I can get the full gigabit speed even on large data transfers so no bottlenecks there. Would I recommend this setup? no. Does it work? hell yeah (longest uptime was like 90 days or something and then a power outage killed it, I need to get(or more likely make) a UPS, I know). If anyone wants photos lmk
@JeffGeerling
@JeffGeerling Рік тому
Nice. Coreforge also suggested forcing Gen 1 speeds elsewhere in the comments, so I may need to test that out.
@mthqwork123
@mthqwork123 Рік тому
@@JeffGeerling That was my first idea as well. Those switches are used to run gen1 or maybe max gen2, when used for GPU mining. Above that it will throw errors.
@caseyhefner1966
@caseyhefner1966 Рік тому
Peta-Pi go BRRRRR Im genuinely surprised this worked, very impressive. As for what to do with it, a video on downloading/hosting a local copy of Wikipedia would be pretty cool.
@aajpeter
@aajpeter Рік тому
The slow storage movement. Bytes aged to perfection, not for those in a hurry but for folks who appreciate a good long wait for quality unbalanced spectacle in design.
@volvaary2724
@volvaary2724 Рік тому
Thanks for this idea to store my stuff.
@Neilhuny
@Neilhuny Рік тому
Absolutely fascinating and completely, totally barmy! What an immense amount of work you obviously put in to this - research, sweet talking 45Drives, research, RAID solutions, research, talking to 45Drives techs, research, etc. I am *extremely* impressed, both by you AND by 45Drives for their courage! And my abiding thought? "Chassis" is pronounced "shassey", not "chassey"! Pfft! So I looked up chassis in Cambridge Dictionary to back my obviously accurate opinion and ... well, blow me, North Americans really do say "chassey"! Every day is a learning day! Pedantry isn't good
@JeffGeerling
@JeffGeerling Рік тому
lol we North Americans are weirdos. Or maybe you are... I guess it's a matter of perspective, neighbour!
@Neilhuny
@Neilhuny Рік тому
@@JeffGeerling I have to picture you in dungarees, a battered straw hat, and with a long stalk of grass hanging from you mouth when you call me neighbour! Hock-diggardy, or something like that
@jaxxarmstrong
@jaxxarmstrong Рік тому
60x USB-connected HDDs... I'm telling you, a missed opportunity :D
@marcogenovesi8570
@marcogenovesi8570 Рік тому
Yeah use the maximum possible number of USB hub daisy chaining
@JeffGeerling
@JeffGeerling Рік тому
Heh, the USB controller would probably just set itself on fire!
@pantegministries
@pantegministries Рік тому
I have a Pi running RAID1 with 2 External SSDs and on times I have to reboot as it seems to forget that the disk exists. That's Pi 4 and Samba and a Window share. So to get as far as you did its amazing.
@niklasxl
@niklasxl Рік тому
i was guessing and hoping this was the project in progress when i saw the rack update video :D
@zambonidriver42
@zambonidriver42 Рік тому
I have smaller versions of those EXOS drives. >200 drives, over the past 5 years, I’ve had 4 failures. 3 covered under RMA. 1 had just expired.
@JeffGeerling
@JeffGeerling Рік тому
For every anecdote (usually it's "all my Seagate drives exploded in giant fireballs!"), there's an opposite anecdote. In aggregate, if drives like these were truly failing at the rates some people think, Seagate would not be in business :)
@llortaton2834
@llortaton2834 Рік тому
@@JeffGeerling The reason all of this information was popularized in the first place is because of backblaze's reports (back in the day) but if we look at their stats now, year over year, seagate is constant.
@marcogenovesi8570
@marcogenovesi8570 Рік тому
EXOS and businness drives in general are fine, it's the consumer lines that are more "hit and miss", but even then it's easy to find a pattern unless you buy hundreds of them. I.e. a brand doesn't just "consistenly fail 4x more than another"
@goodlookinglady7536
@goodlookinglady7536 7 місяців тому
Could we please get an updated video with the pi 5 and see how the two setups compare
@JeffGeerling
@JeffGeerling 7 місяців тому
I do plan on this, though stepping up to it from a slightly different angle :)
@ronguin7062
@ronguin7062 Рік тому
This guy is the only human capable of winning in a debate with Data from star trek.
@gmaildinozz
@gmaildinozz Рік тому
"I was thinking about unboxing it on camera but fedex already did". Best punchline :'-)
@MegaManNeo
@MegaManNeo Рік тому
**CM4:** Hey Jeff! What are we going to do today? **Jeff:** You will handle 60 enterprise grade hard drives. **CM4:** Oof
@thomasgoirand488
@thomasgoirand488 Рік тому
Hi there! I work for Infomaniak, and I am managing storage networks. We offer backup services and connected drive (we call it kDrive, it's a bit like google drive, just we protect your data and they are hosted in Switzerland). Adding 96 Exos 20TB HDD in my swift storage cluster is what I do every day. I manage "moderately large" Swift clusters. On them, we add 6 2U servers at a time, each of them holding 16 HDD (so 6*16 = 96 HDDs). In total, I calculated that I am managing more than 5000 spinning drives in our OpenStack swift clusters, which amounts probably around 100PB (so not just 1 PB like you're doing...). Just this week, I added 12 HDD storage nodes, and 6 proxies (with 2x SSD each, plus the system drive), and 9x NVMe storage nodes (10 NVMe each, to be used in a Ceph cluster). Your idea of a Raspbery-Pi is fun, but instead of one RPi for all HDDs, I would setup one RPi *PER* HDD, and then it all makes sense, and you may have decent speed. BTW the casing you bought seems of very bad quality compared to what we get from HPe or Lenovo. I've been running this service for nearly 5 years now, and we never lost a single bit of data... :)
@marcodebortoli
@marcodebortoli Рік тому
It was only about time that the new storage measurement of reference was going to be the petapite... I'll show myself out :D
@hoagy_ytfc
@hoagy_ytfc Рік тому
Jeff doesn't have the word "why" in his vocabulary :) (Just kidding, things like this are fun, which is "why" enough for me).
@marcogenovesi8570
@marcogenovesi8570 Рік тому
45drives marketing team is on a roll
@45Drives
@45Drives Рік тому
hi
@wayland7150
@wayland7150 Рік тому
Were they taking ab big chance? What if Jeff had proved you could do it all better with a PI? Hahaha.
@YeOldeTraveller
@YeOldeTraveller Рік тому
@@wayland7150 Not much of a chance. They are well aware of the performance of their product, and the limitations of a Pi.
@miolini
@miolini 5 місяців тому
Integrating a Pi compute module with Ethernet into each hard drive can make scaling easier. It lets each drive connect to a network independently, simplifying data handling in big storage setups.
@cervyvin
@cervyvin Рік тому
Very interesting watching "The PP Project"! :D
@falcychead8198
@falcychead8198 Рік тому
As a St. Vincent fan, I'm calling it "Pietabyte" whether anybody else does or not.
@davidmcken
@davidmcken Рік тому
I'd almost be interested in seeing each raid card assigned to 1 PI and then them clustered together.
@nathan0401_
@nathan0401_ Рік тому
this is amazing
@hahnlothar
@hahnlothar Рік тому
Hi - im interrested in the "working mat" on your table - where i can get this? And thanks for the entertaining clips!
@dpz7184
@dpz7184 Рік тому
I am new to the channel and this was my first video of yours! On simple question...how does one...learn about everything you just did and said in this video. It’s all so interesting but so much of it is over my head...
@BioToxin
@BioToxin Рік тому
I would really like to see you build a cluster based file system like gluster or ceph, run that on a pi cluster with 2 to 4 drives per pi, then show performance
@Space_Reptile
@Space_Reptile Рік тому
Next step: 3.14PB on a PI
@namgibau
@namgibau Рік тому
OMG the original hardware is so beautiful it brought a tear to my eye to see it removed. Given the resources I would have 2 or 3, one for local redundancy and one for off site but given net speeds the off site would mostly be pointless. And no I don't need one, I just want one so bad it feels like I need it :)
@reinoud6377
@reinoud6377 Рік тому
Naive guess: regarding the harddrives going offline/faulting when using other schemes than sequential/linear, it might just be a power issue where addressing that many discs trough the PCIE cards might draw too much current and the cards get below a voltage threshold on the expansion board.
@zambonidriver42
@zambonidriver42 Рік тому
How many times did you recompile the kernel?
@Finkelfunk
@Finkelfunk Рік тому
If lsblk starts to run out of letters and shows drives as "sdaa, sdab, sdac" etc. you know you have a data hoarding problem.
@johnnytarponds9292
@johnnytarponds9292 Рік тому
Hey! 45 Drives is here in my home town!
@Marco-gx7ok
@Marco-gx7ok Рік тому
I am afraid someone addicted to this work is called a petaphile person! 😱 But great, that there are people doing these kind of projects!
@jbrown-acuity
@jbrown-acuity Рік тому
Would love to see this with a RISC V processor
@Jimmy_Jones
@Jimmy_Jones Рік тому
"I grabbed a small piece of cardboard to insulate the boards from each other. At least temporarily." Yeah. Sure. I think red shirt Jeff was trying the break back into the room.
@wayland7150
@wayland7150 Рік тому
You can see red shirt Jeff is actually in the room with Jeff at one point in the video.
@_caith
@_caith 3 місяці тому
I built my first datahoarder set up using nothing but 8tb SSDs XD a whole bunch of them. solves the issues of "spinning" in an easy way
@twinshobbytwinshobby3863
@twinshobbytwinshobby3863 Рік тому
Super Projekt !
@popcorny007
@popcorny007 Рік тому
They must really trust you to borrow $35k+ worth of disks
@JeffGeerling
@JeffGeerling Рік тому
s/'borrow'/'juggle' :D
@2nd_bloxx
@2nd_bloxx Рік тому
*happy linus noises*
@JeffGeerling
@JeffGeerling Рік тому
How many days until "I lost all my data" video? :) But hey, if it summons Wendell from L1Techs, it's kind of a win-win.
@km6mmo843
@km6mmo843 Рік тому
Yay! Tiny computers doing huge things!!! 🔥🔥🔥
@Unidentified7002
@Unidentified7002 Рік тому
I'M SO INTERESTED IN THIS PETABYTE THING
@GH05TYPLAYz
@GH05TYPLAYz Рік тому
dang, fedex unboxed it first huh. not surprised
The ULTIMATE Raspberry Pi 5 NAS
32:14
Jeff Geerling
Переглядів 1 млн
How I almost broke MrBeast's Ages 1-100 video
24:00
Jeff Geerling
Переглядів 1,5 млн
Помилка,  яку зробило військове керівництво 🙄
01:00
Радіо Байрактар
Переглядів 307 тис.
Meet the new SBC Linux Cluster King!
31:05
Jeff Geerling
Переглядів 205 тис.
Unboxing a Petabyte
9:50
The Slow Mo Guys 2
Переглядів 504 тис.
Raspberry Pi demolished by monster 128-core ARM CPU!
20:54
Jeff Geerling
Переглядів 1,3 млн
When Did Raspberry Pi become the villain?
21:54
Jeff Geerling
Переглядів 1,3 млн
AI Leader Reveals The Future of AI AGENTS (LangChain CEO)
16:22
Matthew Berman
Переглядів 32 тис.
I Spy, with my Little Pi...
23:46
Jeff Geerling
Переглядів 235 тис.
I made the Petabyte Raspberry Pi even faster!
9:32
Jeff Geerling
Переглядів 177 тис.
Control ANY COMPUTER with these Pi KVMs!
17:46
Jeff Geerling
Переглядів 788 тис.
ЭТО САМЫЙ МОЩНЫЙ ИГРОВОЙ СМАРТФОН ЗА 270$ 🔥
13:33
Thebox - о технике и гаджетах
Переглядів 49 тис.
Компьютер подписчику
0:40
Miracle
Переглядів 193 тис.
Infrared Soldering Iron from Cigarette Lighter
0:58
ALABAYCHIC
Переглядів 1,8 млн
I KNEW I’d Hate this (I Was Wrong)
12:19
Linus Tech Tips
Переглядів 617 тис.