NSA director: 'Mythos "broke into almost all of our classified systems in hours"

Posted by ricksunny 3 days ago

NSA director: 'Mythos "broke into almost all of our classified systems in hours"(www.economist.com)

113 points | 119 comments

mrandish 2 days ago|

This quote from TFA is highly likely to be a conflation, exaggeration or extrapolation of what actually happened:

> "On June 11th Mark Warner, the vice-chair of the Senate Intelligence Committee, said that General Joshua Rudd, who leads the National Security Agency and the Pentagon’s Cyber Command, had told him that Mythos “broke into almost all of our classified systems, not in weeks, but in hours”"

Why:

1. It's a paraphrase of a 2nd hand conversation and (at least) the last two 'telephone game' recipients are a U.S. Senator and a general, not security domain or IT experts. 2. Motivated communication: The Senator claimed this to justify the necessity of unprecedented restrictions that he agrees with. 3. The original testimony to the Intelligence Committee was almost certainly detailed, nuanced and highly classified, making this an extreme paraphrase.

In saying this, I'm not claiming Mythos may not be a security issue or that something directionally like this wasn't reported. But given the indirect, circuitous path, it's quite easy to imagine the original testimony was more like "Mythos identified a potential vulnerability we rated "Severe" in a critical system and we believe it could find similar vulnerabilities in any of our systems."

HillRat 2 days ago||

The journalist later admitted that he failed to provide the appropriate context and nuance, which comes down to "red team pen-testers who already had high-side network access were able to more quickly and effectively compromise systems when they were using Mythos as part of their workflow," which is a pretty crucial distinction to make between that and the spectre of Skynet that the article raises.

hnburnsy 1 day ago||

JFC thats is not even remotely close.

hnburnsy 1 day ago||

Here is the update from The Economist...

>An update. A US official tells me that Sen. Warner misunderstood the NSA director Gen. Rudd in this case. Rudd did use the 'hours, not weeks' wording, but the use of Mythos in this context was—as widely assumed—part of a red-teaming effort, i.e. testing the security of internal networks

https://x.com/shashj/status/2069078104941961293?s=20

dostick 2 days ago||

Why not use Mythos to hack them and see what the report was

zombot 2 days ago||

When you get your hands on it, let me know.

mirekrusin 3 days ago||

If mythos can break into almost all of their classified systems in hours then other models including opus, gpt, gemini and large open weight models can do so as well, maybe you'll have to double hours or it may become days, but they also will, there is no "maybe" in here.

State sponsored, non-public penetration fine tunes (of possibly public ones) likely can do it even faster.

Unsupervised penetration RL loop is ideal setup similar to optimization one – it's relatively easy to gain function on it.

dualvariable 2 days ago||

Also, this is just security through obscurity. The holes that mythos exploited still exist after you've tried to limit mythos accessibility.

And the fact that all our systems are riddled with security holes shouldn't be too much of a surprise given the way that we all know that software is developed and how tech debt / chores are constantly underbudgeted (plus I think this underscores that any one human's knowledge and attention are inherently limited, and even the best PR review is going to leak all kinds of security holes).

mirekrusin 2 days ago||

Yes, exactly, quite shocking, if something like this is true, as NSA (!!) director you keep it quiet, right?

dualvariable 2 days ago|||

That is literally just more security through obscurity.

And the threat actors that would find that information "useful" already know it.

All of our IT security is a mess, the NSA director is just confirming what should be common knowledge.

DaSHacka 2 days ago||

[dead]

DANmode 2 days ago|||

Not if the existential threat (to your organization’s current setup) is uncontainable.

johndough 3 days ago||

I don't think that is necessarily true.

- With a weaker model, the time to break into the system might grow so larger that it becomes infeasible, similar to how password hashes can be bruteforced, but if the password is long enough, that is not going to happen in our lifetime.

- There might be problems which are inherently unsolvable with a lower level of intelligence. For example, your dog won't derive calculus from scratch, even if it lived forever.

- LLMs might be biased in such a way that they never explore the entire solution space, no matter how many attempts are made. Some models are notorious for getting stuck in a loop, trying small variations of the same approach every time, even though it is doomed to fail. This can be counteracted somewhat with higher sampling temperature, but that hurts reasoning capabilities.

BikDk 3 days ago|||

The concept of infinity claims that the dog eventually becomes Shakespeare. The same way we handled encryption, even before Alan Turing codes were broken and evolved. Last, it is a huge advantage to have the machine/mind and to evolve from there. P.S. Even if you go back to lemon juice on paper there may be a thief around that knows the trick.

jjk166 2 days ago|||

> The concept of infinity claims that the dog eventually becomes Shakespeare.

The ability to reproduce an exact copy of hamlet does not make one Shakespeare. A monkey on a typewriter may very well generate Shakespeare eventually, but it wouldn't understand Shakespeare then any more than it could immediately. Likewise a dog may put together some string of text that includes a derivation of calculus, but at no time will it be able to apply that derivation to solve mathematical problems.

gmerc 2 days ago||

And by dog you mean lLM

cyanydeez 2 days ago|||

People seem to think entropy can be overcome with proper focus. Thats why we have things like "effective altruism", the idea that you can ignore all the harm you do on the way to some big grand altruistic act, as if the shattered glass can be reassembled if you just collect enough reverse entropy.

It's a line of reasoning meant to shut off empathy to the here and now. And while it sounds good, along the lines of Baywatch: If you're jumping into a live saving situation and you have to choose between further harming your victim and you being harmed, you choose your victim because without you to save both of you, it's fatal; the difference is indirectly or directly pushing your victim into the water then claiming you're altruistically going to save them at a later date.

It's just delusions to keep moving forware.

awesomeusername 2 days ago||||

Dogs deriving calculus:

https://www.csun.edu/~dgray/BE528/Pennigs2003Dogs_Calculus.p...

Reubend 2 days ago||||

I think you're missing the point. Everything you said is theoretically correct, but the parent comment was talking about the concrete circumstance of pentesting with the top models today.

Let's just take GPT 5.5 and Opus 4.8 as an example. Both are worse than Mythos 5, but they're capable of quite a bit when the guardrails are lifted and they're paired with a skilled human operator. They more than "good enough" to reach the same result with the addition of some human effort.

mirekrusin 3 days ago||||

Mythos and other models are not brute-forcing passwords (and with this analogy passwords, ie. systems are the same).

We're not talking about dogs, but LLM systems.

Mythos is not exploring entire solution space either.

Usually looping is solved by repetition/frequency/presence/n-gram penalties/DRY/min-p sampling, not temperature but we're not talking about small models that have those classes of issues here.

johndough 3 days ago||

> Mythos and other models are not brute-forcing passwords (and with this analogy passwords, ie. systems are the same).

I am not talking about literally bruteforcing passwords (although LLMs are being used for that, too), but bruteforcing passwords and solving verifiable domain tasks have quite a few similarities, especially when considering rule-based and probabilistic bruteforce methods.

> We're not talking about dogs, but LLM systems.

Well, clearly dogs are not LLM systems. It is an analogy. If there is an important point on your mind that makes the analogy break down, feel free to spell it out.

> Mythos is not exploring entire solution space either.

Yes, but weaker models do not find the solution right away, so they need to try more often. But if they only try the same thing every time, they will never succeed, so we need some kind of guarantee that they try something different every time.

> Usually looping is solved by repetition/frequency/presence/n-gram penalties/DRY/min-p sampling, not temperature but we're not talking about small models that have those classes of issues here.

Those might help to reduce looping (at the cost of biasing the generation), but to guarantee that a model can generate all possible generations, we need non-zero probabilities for all tokens, not lower probabilities for likely tokens.

1over137 2 days ago||

> I am not talking about literally bruteforcing passwords (although LLMs are being used for that, too)

They are? Seems like a much worse way to brute force that a tight loop written in a compiled language.

robocat 2 days ago||

PassGPT: Password Modeling and (Guided) Generation with Large Language Models

https://huggingface.co/papers/2306.01545

Although most activity is likely hidden (blackhat or state)

spacebacon 2 days ago|||

[dead]

mikewarot 2 days ago||

It's sad that they did the research[1] and solved computer security about 40 years ago[2], and then proceeded to lose that hard won knowledge over time.

[1] https://csrc.nist.rip/publications/history/index_1.html

[2] https://en.wikipedia.org/wiki/KeyKOS

onjectic 2 days ago||

People will think you are exaggerating, you aren’t. They will also think I’m exaggerating that you aren’t, I’m not. Learning about capability-based microkernels and realizing this has been a solved problem for years, and is actually one of the rare easy freebie problems in computing, is a highly sobering experience!

Only thing I disagree on is that we lost that knowledge, we did not, there isn’t much to capabilities, they actually simplify OS design IMO.

tra3 2 days ago||

I’m not familiar with this, but what does “solved” mean in this case? Guaranteed inability to compromise systems?

mikewarot 2 days ago||

Pretty much. If you've got a microkernel / capabilities based OS, the amount of mischief that someone can cause is severely reduced.

It's my belief that we can have general purpose, easy to use, secure computing for everyone.

No UAC crap, or horrible systems like AppArmor, no virus scanners, etc... just computers that do what you want, and only what you want.

We could have had it decades ago, if things had happened in a slightly different sequence order, related to the flood of personal computers.

RetroTechie 2 days ago|||

Capabilities-based OSes aren't magic. Their robustness still depends on underlying assumptions, which may or may not hold. See eg relevant disclaimers in seL4 whitepaper(s).

And hardware glitches are a thing (edit: and supply chain attacks).

But I do agree that verified correct software can offer very strong guarantees that go well beyond those of commonly deployed software. We could have been in a much better place today.

saidnooneever 1 day ago|||

their robustness lives in hardware capabilities. amd64 and intel x86_64 have quite good features but people dont use them well. For example you can have your microkernel be at the hypervisor level and thoroughly isolate devices etc through IOMMU and have almost no attack surface to get access deep enough to make significant changes to the security posture.

still not immune to be hacked ofc. I think the last step would be making it common place again to build these things custom. that way they'd have to have more specific information available as threat actors to exploit you. It'd be harder to have generic methods affecting millions of systems.

regardless there are no silverbullets, and tradecraft/opsec will always be a thing. most compromises are because people hand out keys unwittingly rather than 0days and crazy sploits. (they do happen though, but its more expensive than fishing and just loggin on under some dudes credentials)

RetroTechie 2 days ago|||

To clarify: capabilities-based OS != verified correct software.

But there's much synergy there. Each enhances the other.

esperent 2 days ago||||

How much does this limit what a computer can do? E.g. if I converted an Ubuntu desktop into a "secure microkernel", what functionality would be lost?

marysol5 2 days ago||

Everything but the specific usage

esperent 2 days ago||

But what does that mean? Can I browse a webpage, open a doc, if those are listed as specific usage? And if not, what's the purpose of this and why are people talking about it with such import?

saidnooneever 1 day ago||

most people dont do a lot on their machines. they have specific tasks they want to do. The idea is to isolate by default and crack open gaps by policy. You can still do 'anything' but you wouldnt want to enable 'anything' to be possible in the policy..

fsflover 1 day ago||

Sounds like security through compartmentalization is more user-friendly: You can run whatever you want and how you want it in a dedicated VM, keeping sensitive things safely isolated, without much thinking of what to enable. Case in point: Qubes OS, my daily driver. Btw it already exists and is stable.

ethbr1 1 day ago|||

> security through compartmentalization is more user-friendly: You can run whatever you want and how you want it in a dedicated VM, keeping sensitive things safely isolated

My brain hurts. How is a system where you can run whatever you want, however you want, but still keep sensitive things safely isolated possible?

Either you have restrictions on what you can run or access (in which case those limit sandboxed capabilities) or you have a hypothetically secure system, the security features of which you never leverage (because sandboxes have absolute freedom).

Unless you were talking about the ability to guarantee a monitor-only hypervisor or resource slice a machine into multiple tenants? (i.e. no/light touch hypervisor situations)

saidnooneever 1 day ago|||

this relys wholly on user skill which most people will not be able to do. you need extreme tradecraft and opsec to keep really secure. any little mistaken copy between domains etc. might compromise.

This is the downside of isolation machines and their upside.

Hard to make a completely isolated machine for all workflows and keep all data at all times inaccessible for exploits. But because each user has their own ways its more potential that 'your particular way of breaking the model' is not known or exploitable (yet).

A lot of holes you open are one-time actions from within a restricted domain.

in qubes you have cross domains tools from domain0 for this, which is very hard to reach (but not impossible).

And then supplychain is also hard. Qubes have canaries, but i think most ISO people copy into their dom0 and spinnVMs off of are not doing such rigorous things. (depends what u use ofc).

fsflover 1 day ago||

> this relys wholly on user skill which most people will not be able to do. you need extreme tradecraft and opsec to keep really secure.

This depends on the chosen level of compartmentalization. For most people, it might be sufficient to store passwords in a dedicated, offline VM and do everything else in another one. This will already be huge improvement.

fsflover 1 day ago|||

I'm not sure I understand your question. VMs run full operating systems on top of Xen hypervisor relying on hardware-assisted virtualization (VT-d or similar). You can run untrusted software in a dedicated VM and keep your sensitive data in another offline VM.

The dom0 has no network and doesn't manage, e.g., USB devices.

ethbr1 1 day ago||

You can't have full general purpose computing on a system and perfect isolation for free.

By definition, the latter implies limits on the former.

Either you have complete freedom to run whatever you want, however you want, or you enforce limits to guarantee system behavior and enforce isolation.

And if you do the latter... then you don't have the former.

fsflover 1 day ago||

Can you elaborate? I'm not a computer scientists. In my understanding, full VMs are practically equivalent to general purpose computers. What are their limitations? Malware escapes?

Last VM escape in VT-d was discovered in 2006 by the Qubes founder, so I really feel safe on Qubes, https://en.wikipedia.org/wiki/Blue_Pill_(software)

ethbr1 3 hours ago||

We're talking about apples and oranges.

I thought your original point above was that VMs freed you from having to come up with policy-based isolation rules (which have always been a UX weakness of policy-based isolation systems).

The point I was making is that VMs don't provide any security guarantees unless you also use the trusted hypervisor layer to enforce something.

At lightest touch, this might be time-slicing resources and ensuring they're evenly split between VMs, regardless of what individual VMs try to do.

But to provide policy-alike granular security control on VMs, you fundamentally have to generate similar rules. E.g. network can only be used by this VM in this way, etc.

Which gets you right back to having to define policies.

From an architecture security perspective, sure, having a trusted hypervisor enforcing the rules is nice. But it doesn't fundamentally fix the problem of getting policies right... if you're trying to guarantee the same level of control.

saidnooneever 1 day ago|||

QubesOS is not bad indeed. its not perfect (they are looking i think to replace Xen or make it much more thin layer). Its definitely the way i think if u want to retain compatibility with existing OSes/tools.

fsflover 1 day ago||

They are not looking to replace Xen. They plan to add support of KVM without breaking Xen: https://github.com/QubesOS/qubes-issues/issues/7051#issuecom...

They also plan to replace Fedora in dom0 with something minimized https://github.com/QubesOS/qubes-issues/issues/1919#issuecom.... Is this a problem for you?

saidnooneever 1 day ago||||

building such an OS for many years now..Qubes gets close enough but its super heavy, trying to support existing apps. I make my own so its super light weight, but no one will use it but me because their toolz arent supported (nothing is :D).

there are some BSD spinoffs like 5BSD which might end up with a good capability model but even there things like capsicum have their limits and IOMMU based isolation is still a dream. (because entire OS kernel is in one privilege level, accessible as root user, so DMA capable devices kill a lot of those securities).

(my os puts every subsystem, service, device driver, app etc. in their own hardware VM, likely there will be IPC bugs or hypercall bugs still tho in that case)

Nowadays with AI its getting more to a point where people can actually build these systems for themselves. Maybe that is a bigger threat to these big corporate tech companies than some security things. It will allow nations and companies to detach from their Tech...

snvzz 2 days ago||||

seL4[0] being the formally-proven modern representative.

0. https://sel4.systems/

saidnooneever 1 day ago||

sel4 is neat. and open source. there are many like it proprietary.

snvzz 1 day ago||

I am not aware of any microkernel actually competitive with seL4, open or else.

saidnooneever 1 day ago||

seen some, i could tell u but then i'd have to... :-) (go to jail)

wolvoleo 2 days ago||||

That would stop a lot of kernel-based exploits but you can also do a lot of damage just as a user of course.

port11 2 days ago||||

Can you recommend any modern systems that behave like that?

RetroTechie 2 days ago||

https://genode.org

saidnooneever 1 day ago||

this is a step forward but they need to lean into hw isolation more. definitely a very interesting project. inspiring :)

vsgherzi 3 days ago||

This is really making me raise an eyebrow. I’m sure mythos is an improvement for sure. I don’t think the framing of it hacked the entire NSA is fully truthful. I’d like a more in depth understanding of what actually happened. Excited to be proved wrong tho!

Epa095 3 days ago||

Yeah, this article cites someone saying that someone else said something. Maybe it was said, maybe not. Maybe it was a exaggeration, maybe not.

SirFatty 3 days ago||

Very insightful.

cyanydeez 2 days ago||

One has to assume they put Mythose behind the front lines and not infront of the front lines, so I'd agree almost any currently useful LLM could likely crack through security if you're already inside the perimeter.

stithpragya 2 days ago|||

From the outset, Mythos’s PR has been rather dodgy.

Gee101 1 day ago||

Marketing has been brilliant thou.

DANmode 2 days ago||

They said “almost”, for starters.

AngryData 2 days ago||

Not surprised, our security systems are 95% security through obscurity these days. Mythos didn't find new ways to break security, it just went down the list of common security exploits and exposed them for being common even among government agencies.

protocolture 2 days ago|

Next Headline: Government bans nMap.

protocolture 2 days ago||

>On June 11th Mark Warner, the vice-chair of the Senate Intelligence Committee, said that General Joshua Rudd, who leads the National Security Agency and the Pentagon’s Cyber Command, had told him that Mythos “broke into almost all of our classified systems, not in weeks, but in hours”.

From outside? Or did you have a shit ton of unpatched systems that only internal users could access?

throw1234567891 2 days ago|

“Only those who are inside can access”.

scotty79 2 days ago||

If I were to guess, internally they have as sloppy security as any other corp/organization. And those were the things Mythos effortlessly poked holes in. Other models would probably as well, but Antropic hyping gave NSA the idea to try. The shell around those internal systems is probably as (im)penetrable as ever because it's just some flavor of hardened and bare bones linux.

ggm 3 days ago||

I made a point about this in relation to anthropic last week: nobody inside the strategic information spaces is worried about AGI they're worried about core strategic information leaking out. Either it's in the model, or the model exposes pathways to finding it in the core strategic systems.

Those "tapes" DOGE took away? Nothing on them can be considered private any more. That's how brute force risk happens. Mythos' risks are showing doorways to exfiltration surely? Why bother when you can walk out the door with a data dump?

The NSA is just a highly specific subclass of the problem. Their traditional publicly stated approach to security is "nothing electronic which enters our domain leaves" and yet somehow they have assessed these systems as capable of breaching their walls? That's super bad.

I suspect they ran an analogue/instance inside their protection rings. I doubt they ran a test outside in the global internet. If they have actually lost control of their boundary, that's a bigger story (which I doubt) and contextually he could have been referring to information systems in NSAs duty of care, not things inside Ft Meade.

CobaltFire 2 days ago||

Not a surprise. I got in a LOT of trouble for identifying and outlining a trivial privilege escalation attack that worked on both NIPR and SIPR.

In the end I got to help write up the issue but to my knowledge they never patched it as it would have caused major issues with maintenance by closing off access needed for some legacy software patches.

stubish 2 days ago||

It is very interesting the different reactions between your experience (and many whistleblowers), and how people react to software doing the same thing. Although in this case, maybe it isn't so different? They did essentially have the tool buried, out of sight out of mind for a little while at least.

mock-possum 2 days ago||

What did you get into trouble for?

chaitanyya 2 days ago|

How much of it is just exposing poor engineering practices people got away with because it was not economically viable earlier to spend human hours to exploit a system?

Not taking a dig at people, it was not a terrible choice earlier. Not like these models are inventing net new ways to exploit systems.

ActorNightly 2 days ago|

Its not that.

I would bet a large sum of money that Mythos was put on the same local network as the "systems" (ie you have access to services like UPnP brokers that never meant for outside internet), and the "broke into" is just a blanket term for finding some bug which can range from simply crashing the program, to actual remote code execution. And its probably mostly the former. It used to be that cyber security research was all about finding ways to crash the program, which then implied that you can inject shell code, so the two became synonymous for vulnerability, but these days its very much not the case.

More comments...