Jump to content

Backing Up the Petabyte Server

53 minutes ago, poochyena said:

Wouldn't this make their 3rd storage location since youtube counts as one, right?

they backup the RAW uncompressed 8k video before it's edited in anyway, so NO Youtube and it's shitty bitrates doesn't count.

Link to comment
Share on other sites

Link to post
Share on other sites

Quote

image.png.03c38eac61e843c01359c0d48bcb8173.png

 Since we're already operating in Hyper-Pedantic Mode: 

  • "i.e." means id est, "that is."  
  • "e.g." means  exemplī grātiā, "for the sake of example."

In this context, you should've corrected it to "e.g."  

Hard to get away with something like that on a channel filled with nerds.  ?

Link to comment
Share on other sites

Link to post
Share on other sites

6 hours ago, nicklmg said:

 

Have no idea how big your room is, but fireproofing, concrete walls, etc... Unless the fire starts in that room ?  It is really rare nowadays that fire causes tones of damage on a concrete structures (from the picture I have seen of your office) in places like Burnaby response is fast, I would built a concrete structures, with many sprinkler, etc around it... Could be close to your $20.000 or less... Obviously in Van the first contractor will want $100.000 but if you keep it simple. Electricity and cooling during the fire may be the expensive part, unless you could put it to sleep remotely... just thinking outloud !

 

Link to comment
Share on other sites

Link to post
Share on other sites

Umm... No one has mentioned the audio being out at 1:14? It actually looks like it started at 57 seconds in...

Link to comment
Share on other sites

Link to post
Share on other sites

Thanks for killing this for the rest of us.

 

6 hours ago, raf42 said:

No idea if it can scale up, but I've found Wiztree (https://antibody-software.com/web/software/software/wiztree-finds-the-files-and-folders-using-the-most-disk-space-on-your-hard-drive/) works just like WinDirStat, but WAYY faster, at least at a small, few TB scale.

I tested most of the popular applications recently, and TreeSize Free was the fastest by a significant margin when working on 50TB of data (500K files).

The only one faster was TreeSize Professional, which is about 50% faster than the free version - and that's about 10x faster than WinDirStat.

WizTree was disqualified from the test by requiring Admin permissions to run. That's never going to happen.

Link to comment
Share on other sites

Link to post
Share on other sites

8 hours ago, VegetableStu said:

my entire life I though e.g. was just a shortening of example o_o

It makes sense that it's Latin - a lot of expressions in English come from Latin.

 

Plus, there's no 'g' in "example", so e.g. being a simple shortening of example doesn't realllly work.

For Sale: Meraki Bundle

 

iPhone Xr 128 GB Product Red - HP Spectre x360 13" (i5 - 8 GB RAM - 256 GB SSD) - HP ZBook 15v G5 15" (i7-8850H - 16 GB RAM - 512 GB SSD - NVIDIA Quadro P600)

 

Link to comment
Share on other sites

Link to post
Share on other sites

19 hours ago, Verond said:

You should use Wasabi. They don't charge for pull and are like 80% cheaper than AWS and Google Cloud.

 

We use them for video storage and streaming. 

I was going to suggest that too! But if you do the math, at $5 per TB, that would end up costing $5,000 for a petabyte of storage. So the jank Google drive strategy starts sounding more worth it.

Link to comment
Share on other sites

Link to post
Share on other sites

I do the same, you should have asked me for advice.

 

 

drive.png

Link to comment
Share on other sites

Link to post
Share on other sites

One issue with the google drive system is that google has a history of removing accounts that they do not like, even if they do not violate and specifically stated rules, thus there is a good chance that google will notice the data use and ban the account.

Link to comment
Share on other sites

Link to post
Share on other sites

I put 18TB on Google Drive Enterprise ($10/mo for unlimited....I'm grandfathered when it allowed unlimited for 1 user) 2 years ago, as an offsite backup of my 26 drive raid 60 array.

 

Proof:

 

image.png.57e265eb193ed4922779b651023942a4.png

 

It's not really a great solution.  Their whole infrastructure is not meant to be a backup to handle gigantic 300GB files.  Both downloading and interrupting uploads.  Their app too also doesn't like working with 300GB files.  Maybe it would be better if I had > 35Mbps internet upload speed, but it really felt janky for me and not reliable.

 

TLDR: Make sure you can actually  DOWNLOAD the data after you upload it.

Workstation:  14700nonk || Asus Z790 ProArt Creator || MSI Gaming Trio 4090 Shunt || Crucial Pro Overclocking 32GB @ 5600 || Corsair AX1600i@240V || whole-house loop.

LANRig/GuestGamingBox: 9900nonK || Gigabyte Z390 Master || ASUS TUF 3090 650W shunt || Corsair SF600 || CPU+GPU watercooled 280 rad pull only || whole-house loop.

Server Router (Untangle): 13600k @ Stock || ASRock Z690 ITX || All 10Gbe || 2x8GB 3200 || PicoPSU 150W 24pin + AX1200i on CPU|| whole-house loop

Server Compute/Storage: 10850K @ 5.1Ghz || Gigabyte Z490 Ultra || EVGA FTW3 3090 1000W || LSI 9280i-24 port || 4TB Samsung 860 Evo, 5x10TB Seagate Enterprise Raid 6, 4x8TB Seagate Archive Backup ||  whole-house loop.

Laptop: HP Elitebook 840 G8 (Intel 1185G7) + 3080Ti Thunderbolt Dock, Razer Blade Stealth 13" 2017 (Intel 8550U)

Link to comment
Share on other sites

Link to post
Share on other sites

On 23/08/2018 at 9:38 PM, poochyena said:

Why would they need all of the raw footage though? Tbh, I don't see why they need to backup reviews of 3+ year old hardware anyways.

Linus is a data hoarder. Sure they dont need most of the raw footage from their old videos, but when you can make money on doing videos about your backup plan, you might as well, we get to geek out over petabyte project and Linus gets to geek out over it as well. Although at this point it's probably more sensible to only backup the finished videos in their highest quality on a cloud storage (and raw footage from more recent videos), while saving the raw footage locally while space is available.

Link to comment
Share on other sites

Link to post
Share on other sites

So they shelled out however many hundreds of thousands of dollars on this project, but suddenly a proper off-site backup is just too expensive.

Link to comment
Share on other sites

Link to post
Share on other sites

1 hour ago, BachChain said:

So they shelled out however many hundreds of thousands of dollars on this project, but suddenly a proper off-site backup is just too expensive.

I believe that 45 Drives and Seagate sponsored the original servers. 

 

Certainly they could afford $100K, but if they can find a cheaper way *and* get some video content out of it too? Why not. 

 

They might still end up with another set of servers co-located at their ISP datacentre. 

For Sale: Meraki Bundle

 

iPhone Xr 128 GB Product Red - HP Spectre x360 13" (i5 - 8 GB RAM - 256 GB SSD) - HP ZBook 15v G5 15" (i7-8850H - 16 GB RAM - 512 GB SSD - NVIDIA Quadro P600)

 

Link to comment
Share on other sites

Link to post
Share on other sites

32 minutes ago, dalekphalm said:

I believe that 45 Drives and Seagate sponsored the original servers. 

 

Certainly they could afford $100K, but if they can find a cheaper way *and* get some video content out of it too? Why not. 

 

They might still end up with another set of servers co-located at their ISP datacentre. 

yeah and funny thing they have 2-3 empty S45s so all they would need to do is buy HDD.

I get that linus probably didn't want to shell out the extra cost to go from 10 to 12TB drives on the backup in addition to making the raid groups larger so that they could fit it all in 2 of the 45 bay units.

Good luck, Have fun, Build PC, and have a last gen console for use once a year. I should answer most of the time between 9 to 3 PST

NightHawk 3.0: R7 5700x @, B550A vision D, H105, 2x32gb Oloy 3600, Sapphire RX 6700XT  Nitro+, Corsair RM750X, 500 gb 850 evo, 2tb rocket and 5tb Toshiba x300, 2x 6TB WD Black W10 all in a 750D airflow.
GF PC: (nighthawk 2.0): R7 2700x, B450m vision D, 4x8gb Geli 2933, Strix GTX970, CX650M RGB, Obsidian 350D

Skunkworks: R5 3500U, 16gb, 500gb Adata XPG 6000 lite, Vega 8. HP probook G455R G6 Ubuntu 20. LTS

Condor (MC server): 6600K, z170m plus, 16gb corsair vengeance LPX, samsung 750 evo, EVGA BR 450.

Spirt  (NAS) ASUS Z9PR-D12, 2x E5 2620V2, 8x4gb, 24 3tb HDD. F80 800gb cache, trueNAS, 2x12disk raid Z3 stripped

PSU Tier List      Motherboard Tier List     SSD Tier List     How to get PC parts cheap    HP probook 445R G6 review

 

"Stupidity is like trying to find a limit of a constant. You are never truly smart in something, just less stupid."

Camera Gear: X-S10, 16-80 F4, 60D, 24-105 F4, 50mm F1.4, Helios44-m, 2 Cos-11D lavs

Link to comment
Share on other sites

Link to post
Share on other sites

1 minute ago, GDRRiley said:

yeah and funny thing they have 2-3 empty S45s so all they would need to do is buy HDD.

I get that linus probably didn't want to shell out the extra cost to go from 10 to 12TB drives on the backup in addition to making the raid groups larger so that they could fit it all in 2 of the 45 bay units.

Personally I’d go tape drive. Get a tape library, pop it into a data centre, and replicate to tapes on a rotating basis. 

 

Maybe also have a 45 drive server as a temp dumping ground, or just as an additional level of redundancy. 

For Sale: Meraki Bundle

 

iPhone Xr 128 GB Product Red - HP Spectre x360 13" (i5 - 8 GB RAM - 256 GB SSD) - HP ZBook 15v G5 15" (i7-8850H - 16 GB RAM - 512 GB SSD - NVIDIA Quadro P600)

 

Link to comment
Share on other sites

Link to post
Share on other sites

Anyone familiar with the website featured in the video to calculate storage costs across all the vendors?

Link to comment
Share on other sites

Link to post
Share on other sites

14 minutes ago, AlexDeHacker said:

use https://www.backblaze.com/backup-pricing.html unlimited storage for 5$ a month

You've selected their Personal plan. While it is good value, they almost certainly have commercial uses banned via the EULA.

 

There's also business - $50/mo per computer - but I assume this is for non-server usage only.

 

They do have a commercial Server/NAS pricing option though - it's still pretty expensive, but much cheaper over a year than I think Linus might have realized.

 

I threw in some random figures:

100TB initial upload

50TB monthly upload

50GB monthly download

 

And it came to just under $8000 USD per year. That's a lot, but as a yearly business expense, it's not that much.

 

Granted, the Vault has almost 400TB - so you'd need to multiply costing by 4, pretty much.

For Sale: Meraki Bundle

 

iPhone Xr 128 GB Product Red - HP Spectre x360 13" (i5 - 8 GB RAM - 256 GB SSD) - HP ZBook 15v G5 15" (i7-8850H - 16 GB RAM - 512 GB SSD - NVIDIA Quadro P600)

 

Link to comment
Share on other sites

Link to post
Share on other sites

My guess is the 750G/day limit is there to prevent ddos type attacks, where you'd create a bunch of accounts over time and sync your botnet to try to upload petabytes of random per second burning through the accounts as it goes through.

 

Backups of stuff sound like a legit use case, and you're still paying a hefty chunk, I don't know why anyone would be mad. On the contrary, it's actually a creative way to use the service, I wish you did a segment where you explain how folks should setup rclone, but maybe that's more for wendell

 

Link to comment
Share on other sites

Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now

×