Jump to content

Who has migrated HCI from Xeon to Epyc?

OCSmash

I'm currently running a 2+1 Simplivity infrastructure (each node is dual xeon 16core (32 total) w/ 1TB RAM / 20TB usable storage) using VMware as my hypervisor.  Essentially operating an HCI with 64 usable cores and 2TB of RAM in the production environment. I'm planning an upgrade to 96+ cores and 2-4TB ram. I've been mostly looking at the new Simplivity (HP)  and VxRail (Dell) equipment. Both Dell and HP have introduced Epyc gen 2 options.  This would essentially mean building 2x servers with 1x 64-core processor and 2TB of RAM each. I've even toyed with the idea of running one chassi with dual-Epyc 64-core processors and 4TB of RM and dumping HCI altogether. With the savings on VMware and other licensing, it seems like a slam dunk.

 

I have read that the migration process from intel to amd is not as simple as using vMotion to move everything from the old to the new cluster, which makes sense. We run mostly Windows Server and some Linux.

 

My questions is for anyone who has made this migration. How did it go? How did the OS images survive their migration form the xeon to the epyc platform? What was your performance like? Would you do it again?

Link to comment
Share on other sites

Link to post
Share on other sites

I havent touched the new Epyc gear, but just keep in mind that nodes in a VMware cluster need to be the same CPU vendor type (i.e Intel or AMD). By switching between the 2, as you mentioned you arent going to be able to live vMotion the machines as Enhanced mode wont be available. To make the switch you will need to shut down each VM and move them over which will mean an outage. Otherwise I dont see any problem with the VM images themselves, i've switched between Intel Xeon & AMD Opteron just fine

 

As for running a single host, i'd keep in mind that while HP & Dell offer some excellent service, in the event of a hardware failure you'll be down for at least a few hours still until you get a technician out with the replacement part. Also keep in mind that you wont be able to perform regular recommended maintenance on the hosts such as firmware upgrades & esxi upgrades. 

 

You might have more luck getting advice at Level1Tech as there is more people there that deal with SME's which would be more likely to make this switch. Most here are generally either enthusiasts (who typically dont deal with VMware) or Enterprise (who cant afford the downtime for a switch). 

Spoiler

Desktop: Ryzen9 5950X | ASUS ROG Crosshair VIII Hero (Wifi) | EVGA RTX 3080Ti FTW3 | 32GB (2x16GB) Corsair Dominator Platinum RGB Pro 3600Mhz | EKWB EK-AIO 360D-RGB | EKWB EK-Vardar RGB Fans | 1TB Samsung 980 Pro, 4TB Samsung 980 Pro | Corsair 5000D Airflow | Corsair HX850 Platinum PSU | Asus ROG 42" OLED PG42UQ + LG 32" 32GK850G Monitor | Roccat Vulcan TKL Pro Keyboard | Logitech G Pro X Superlight  | MicroLab Solo 7C Speakers | Audio-Technica ATH-M50xBT2 LE Headphones | TC-Helicon GoXLR | Audio-Technica AT2035 | LTT Desk Mat | XBOX-X Controller | Windows 11 Pro

 

Spoiler

Server: Fractal Design Define R6 | Ryzen 3950x | ASRock X570 Taichi | EVGA GTX1070 FTW | 64GB (4x16GB) Corsair Vengeance LPX 3000Mhz | Corsair RM850v2 PSU | Fractal S36 Triple AIO | 12 x 8TB HGST Ultrastar He10 (WD Whitelabel) | 500GB Aorus Gen4 NVMe | 2 x 2TB Samsung 970 Evo Plus NVMe | LSI 9211-8i HBA

 

Link to comment
Share on other sites

Link to post
Share on other sites

Thanks for your insights Jarsky.  You make a solid point about the multi-host environment too.. We've had ZERO downtime with our current multi-host environment and there have been a number of updates performed over the years. HCI is a real life changer in the server room.  I cross posted to spiceworks, but I'll check out Level1Tech as well. I've exhausted my personal & professional networks and no one had attempted the leap yet :) 

Link to comment
Share on other sites

Link to post
Share on other sites

Level1Tech and Serve the home should have lots of info

Good luck, Have fun, Build PC, and have a last gen console for use once a year. I should answer most of the time between 9 to 3 PST

NightHawk 3.0: R7 5700x @, B550A vision D, H105, 2x32gb Oloy 3600, Sapphire RX 6700XT  Nitro+, Corsair RM750X, 500 gb 850 evo, 2tb rocket and 5tb Toshiba x300, 2x 6TB WD Black W10 all in a 750D airflow.
GF PC: (nighthawk 2.0): R7 2700x, B450m vision D, 4x8gb Geli 2933, Strix GTX970, CX650M RGB, Obsidian 350D

Skunkworks: R5 3500U, 16gb, 500gb Adata XPG 6000 lite, Vega 8. HP probook G455R G6 Ubuntu 20. LTS

Condor (MC server): 6600K, z170m plus, 16gb corsair vengeance LPX, samsung 750 evo, EVGA BR 450.

Spirt  (NAS) ASUS Z9PR-D12, 2x E5 2620V2, 8x4gb, 24 3tb HDD. F80 800gb cache, trueNAS, 2x12disk raid Z3 stripped

PSU Tier List      Motherboard Tier List     SSD Tier List     How to get PC parts cheap    HP probook 445R G6 review

 

"Stupidity is like trying to find a limit of a constant. You are never truly smart in something, just less stupid."

Camera Gear: X-S10, 16-80 F4, 60D, 24-105 F4, 50mm F1.4, Helios44-m, 2 Cos-11D lavs

Link to comment
Share on other sites

Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now

×