Top
Best
New

Posted by tartieret 10/29/2025

Tell HN: Azure outage

Azure is down for us, we can't even access the azure portal. Are other experiencing this? Our services are located in Canada/Central and US-East 2

https://downdetector.ca/status/windows-azure/

https://azure.status.microsoft/en-gb/status

885 points | 806 commentspage 9
a_f 10/29/2025|
Looks like MyGet is impacted too. Seems like they use Azure:

>What is required to be able to use MyGet? ... MyGet runs its operations from the Microsoft Azure in the West Europe region, near Amsterdam, the Netherlands.

hypeatei 10/29/2025||
All of my employers things are hosted on Azure and running just fine and didn't go down at all. Portal access has been fixed.

Doesn't seem to be too bad of an outage unless you were relying on Azure Front Door.

randomsofr 10/29/2025|
SSO is down, Azure Portal Down and more, seems like a major outage. Already a lot of services seem to be affected: banks, airlines, consumer apps, etc.
hypeatei 10/29/2025|||
The portal is up for me and their status page confirms they did a failover for it. Definitely not disputing that its reach is wide, but a lot of smaller setups probably aren't using Front Door.
rahkiin 10/29/2025|||
Both work for me in the Netherlands
_pdp_ 10/29/2025||
With all the recent outages considered, it is time to move off the cloud.
rcarmo 10/29/2025||
Not seeing it. I have VMs in US East and Netherlands and they're up.
tgv 10/29/2025|
I tried to look some things up on their support pages before 1600Z, and it timed-out. The Dutch railways are also affected (they're an MS shop, IIRC).
LaserToy 10/29/2025||
Azure portal still insists the issue is jsut with Console.

We had to bypass the Frontdoor

8cvor6j844qw_d6 10/29/2025||
Quite close to the recent AWS outage. Let me take a look if its a major one similar to AWS.

Any guess on what's causing it?

In hindsight, I guess the foresight of some organizations to go multi-cloud was correct after all.

jcims 10/29/2025||
We're multi-cloud and it really saved a few workloads last week with the AWS issue.

It's not easy though.

sanskarix 10/30/2025||
This is the eternal tension for early-stage builders, isn't it? Multi-cloud gives you resilience, but adds so much complexity that it can actually slow down shipping features and iterating.

I'm curious—at what point did you decide the overhead was worth it? Was it after experiencing an outage, or did you architect for it from day one?

As someone launching a product soon (more on the builder/product side than infra-engineer), I keep wrestling with this. The pragmatist in me says "start simple, prove the concept, then layer in resilience." But then you see events like this week and think "what if this happens during launch?"

How did you handle the operational complexity? Did you need dedicated DevOps folks, or are there patterns/tools that made it manageable for a smaller team?

jcims 10/30/2025||
I don't think I would recommend multi-cloud right out of the gate unless you already have a lot of experience in the space or there is a strong demand from your customers. There's a tremendous amount of overhead with security/compliance, incident management, billing, tooling, entitlements, etc. There are a number of external factors that drove our decision to do it, resiliency is just one of them. But we are a pretty big shop, spending ~$10M/mo on cloud infra and have ~100 people in the platform management department.

I would recommend focusing on multi-region within a single CSP instead (both for workloads AND your tooling), which covers the vast majority of incidents and lays some of the architectural foundation for multi-cloud down the road. Develop failover plans for each service in your architecture (eg. planned/tested runbooks to migrate to Traffic Manager in the event AFD goes down)

Also choose your provider wisely. We experience 3-5x the number of service-impacting incidents on Azure that we do on AWS. I'm sure others have different experiences, but I would never personally start a company on Azure. AWS has its own issues, of course, but reliability has not been a major one (relatively speaking) over the past 10 years. Last week's incident with DynamoDB in us-east-1 had zero impact on our AWS workloads in other regions.

stuff4ben 10/29/2025|||
It's always freakin DNS...
iAMkenough 10/29/2025|||
Trusting AI without sufficient review and oversight of changes to production.
whynotminot 10/29/2025|||
Yeah, these things never happened when humans were trusted without sufficient review and oversight of changes to production.
shepherdjerred 10/29/2025|||
Do you have any insight or do you just dislike AI? Incidents like this happened long before AI generated code
Capricorn2481 10/29/2025||
I don't think it's meant to be serious. It's a comment on Microsoft laying off their staff and stuffing their Azure and Dotnet teams with AI product managers.
conroydave 10/29/2025||
cost cutting attempts
avgDev 10/29/2025||
I am having a bunch of issues. It looks like their sites and azure are both affected.

I also got weird notification in VS2022 that my license key was upgraded to Enterprise, but we did not purchase anything.

Mr_Bees69 10/29/2025||
Might be a failsafe, if you cant get a license status, and you're aware that MS is down, just default to the highest tier.
ThatManulTheCat 10/29/2025||
Free upgrade
dlcarrier 10/29/2025||
Yesterday Amazon, today Microsoft. Are Google's cloud services going down tomorrow?
gtowey 10/29/2025||
This is because Azure just copies everything AWS does. Google is a bit more innovative, they will have something else unexpected happen.
dkdcio 10/29/2025||
throwback to when they deleted a customer's entire account! https://arstechnica.com/gadgets/2024/05/google-cloud-acciden...
Insanity 10/29/2025|||
Maybe they are and no one realized yet.. :P

That said, I don't hear about GCP outages all that often. I do think AWS might be leading in outages, but that's a gut feeling, I didn't look up numbers.

luhn 10/29/2025|||
They had a pretty massive one earlier this year. https://status.cloud.google.com/incidents/ow5i3PPK96RduMcb1S...

This isn't GCP's fault, but the outage ended up taking down Cloudflare too, so in total impact I think that takes the cake.

xenolithis 10/29/2025||||
fairly certain they had a significant multi region outage within the past few years. I'll try to find some details to link.

Few customers....few voices to complain as well.

Mr_Bees69 10/29/2025|||
as a victim of xbox, azure is down 'bout as often as its up
m_fayer 10/29/2025|||
And if they don't, we'll know who the culprit is.
shishcat 10/29/2025||
Who?
briffle 10/29/2025||
here's hoping its Oracle's cloud instead....
CKMo 10/29/2025||
Reasons to not use hyperscalers, exhibit 654

There's a lot of outages this month!

anon025 10/29/2025|
It's the DNS https://dnschecker.org/#A/get.helm.sh is unreachable
I_am_tiberius 10/29/2025|
Why are Azure App Services still working?
More comments...