Top
Best
New

Posted by ndhandala 10/29/2025

AWS to bare metal two years later: Answering your questions about leaving AWS(oneuptime.com)
727 points | 491 commentspage 4
tuhgdetzhh 10/29/2025|
Quite recently I made a TCO analysis between AWS and bare metal Hetzner including salary. https://beuke.org/hetzner-aws/
iLoveOncall 10/29/2025||
This is a completely meaningless article if they don't provide information about their technical stack, which AWS services they used to use, what TPS they are hitting, what storage size they're using, etc.

The story will be different for every business because every business has different needs.

Given the answer to "How much did migration and ongoing ops really cost?" it seems like they had an incredibly simple infrastructure on AWS, and it was really easy to move out. If you use a wider-range of services the cost savings are much more likely to cancel themselves.

globular-toast 10/29/2025|
TFA begins with a link to the original article with those details.
iLoveOncall 10/29/2025||
If you called "We used EKS" details, then yeah they provide those details.

Assuming this is indeed all they used, this was admittedly nonsense, they were essentially using cloud-based bare-metal.

StratusBen 10/29/2025||
Co-Founder and CEO of https://vantage.sh/ here - I've been pretty impressed by the rate that repatriation is happening off of public cloud. It rarely ever came up and in the last year it's been popping up more and more -- and especially just for getting access to GPU workloads.

I thought there would be a greater unbundling to AWS or to cheaper providers but it seems like a good-sized portion of the market is just going back to managing their own hardware.

ed_mercer 10/29/2025||
Talos is great until it's not. We ran into Ceph IO speed bottlenecks and found it was impossible to debug ("talosctl cgroups —preset=io" is a mess) because the devs didn't want to add an SSH escape hatch into their black box OS. Our Talos nodes would also randomly become unhealthy and you have no way of knowing why. Switched to PXE booted Alpine linux with vanille k8s, and we had a much more stable experience with no surprises, and the ability to SSH whenever we want has been hugely helpful.
nodesocket 10/30/2025||
“ $600/month for NAT gateways”

I build my own NAT instances from Debian Trixie with Packer. The configuration is literally a few lines:

    sudo iptables -t nat -A POSTROUTING -o ens5 -j MASQUERADE
    sudo iptables -F FORWARD
    sudo iptables -A FORWARD -i ens5 -m state --state RELATED,ESTABLISHED -j ACCEPT
    sudo iptables -A FORWARD -o ens5 -j ACCEPT

    sudo iptables-save | sudo tee /etc/iptables/rules.v4 > /dev/null
aetherspawn 10/29/2025||
Ok but what about a dedicated OVH for example? Those are about 70% cheaper than AWS, so is it still worth it to colo?
bilekas 10/29/2025|
Did you read the article ? The main point of this and the prior article is that YES colocation/baremetal IS a better option for this company (and I would argue the majority of AWS users)

reference : https://news.ycombinator.com/item?id=38294569

stack_framer 10/30/2025||
My question is: How did you get your entire team on board with this decision? My team disagrees on even the most trivial technicalities, so I can't imagine doing something on the scale you're doing (even though I wish we could leave AWS for our own hardware).
debarshri 10/29/2025||
Recently i learned that orgs these days want to show software and infrastructure spend as capex as they can shown it as depreciating asset for tax purposes.

I understand that with AWS you cannot do that as it is often seem as opex.

I guess thats a good enough motivation to move out of AWS at scale.

doctorpangloss 10/29/2025|
Microk8s has common, catastrophic performance bugs. There are also catastrophic problems with microk8s Ceph addons. So is this post true? Microk8s, for people who know stuff, is a canary for clusters / applications that don’t really work.
ndhandala 10/29/2025||
We havent found those bugs in our cluster, but we're also moving to Talos (but for diff reasons)
acejam 10/29/2025||
Source? Links?
doctorpangloss 10/29/2025||
https://www.google.com/search?q=microk8s+dqlite+site%3Agithu...

Click on the various catastrophic issues. Observe how many are closed with no resolution. Canonical is great but Microk8s is not.

More comments...