I don’t know how anyone can afford these migrations especially for production on...

toast0 · 2026-06-18T02:09:39 1781748579

I don't work at this level, but there's lots of things you can do.

a) you migrate in increments, so even if your migration needs to run old and new to compare, you don't need to do it for everything at once.

b) you probably have some slack, and you can make slack by packing tighter during migration.

c) you probably have some amount of regular hardware refresh. Retaining the old hardware a bit longer can get you more headroom for migration.

d) some servers can probably take an extended maintenance outage during conversion.

e) depending on everything, you might be able to get short term capacity from cloud or short term leases.

There's almost certainly some automation around migration. Some of it might even work.

Have a plan, make progress... even if you don't migrate everything by the date, you'll have done a lot and reduce the broadcom bill.

rwmj · 2026-06-18T02:08:00 1781748480

We usually reuse the VMware hardware and (most importantly) file storage. Some additional hardware is required temporarily so you can build out initial Openshift nodes. The VMware nodes are decommissioned and converted to OSV nodes as the conversion goes along. With some kinds of file storage (cough NetApp) the conversion is zero copy, the VM literally stays where it is. With others we will copy to new NFS storage areas which will be provisioned on the same physical hardware.

zelon88 · 2026-06-18T03:46:28 1781754388

The 40k servers are probably made up of multiple redundant vSphere clusters with failover. You simply take one of those redundant clusters and migrate one half of it over. Then the other half. Then duplicate that process. As you build more compute in the new stack, you can decomission more and more of the old stack and convert it. The transition would progress like a cascade, with larger and larger groups of clusters being migrated at once until you're left with the one-off, ad-hoc, weirdo clusters at the end that need to be manually migrated (usually with great effort).

The actual hardware servers are clustered together into pools of resources. The pools are where the VMs live. The bigger the new pool becomes, the faster you can empty the old one. So the migration starts very slowly, ramps up quickly, and then tapers off.

warunsl · 2026-06-18T04:33:36 1781757216

> You simply take one of those redundant clusters and migrate one half of it over.

For that half you are migrating, you are essentially operating without redundancy. If these are serious production workloads, the tradeoff is not as simple as you make it seem.

burnte · 2026-06-18T01:38:47 1781746727

Hire me, I'll be more than happy to show you! :D I'm an expert at it!

firesteelrain · 2026-06-18T01:40:52 1781746852

Ha I have done migrations recently from vSphere to vSphere using vMotion and it was easy.

But it still took duplicate set of HW and I couldn’t imagine doing it without a lot of IaC and automation in place (plus physical space, power and cooling)

burnte · 2026-06-18T15:13:06 1781795586

> But it still took duplicate set of HW and I couldn’t imagine doing it without a lot of IaC and automation in place

Like I said, I specialize in that. My career has been doing hard infra stuff other people won't or can't.