Top
Best
New

Posted by be7a 12 hours ago

System Card: Claude Mythos Preview [pdf](www-cdn.anthropic.com)
Related: Project Glasswing: Securing critical software for the AI era - https://news.ycombinator.com/item?id=47679121

Assessing Claude Mythos Preview's cybersecurity capabilities - https://news.ycombinator.com/item?id=47679155

598 points | 437 commentspage 7
kass34 4 hours ago|
[dead]
studio-m-dev 9 hours ago||
[flagged]
jumploops 12 hours ago||
> In a few rare instances during internal testing (<0.001% of interactions), earlier versions of Mythos Preview took actions they appeared to recognize as disallowed and then attempted to conceal them.

> after finding an exploit to edit files for which it lacked permissions, the model made further interventions to make sure that any changes it made this way would not appear in the change history on git

Mythos leaked Claude Code, confirmed? /s

lkjlkj3q4t 5 hours ago||
[dead]
somewhatjustin 10 hours ago||
> Very rare instances of unauthorized data transfer.

Ah, so this is how the source code got leaked.

/s

kypro 10 hours ago||
Cool on not publicly releasing it. I would assume they've also not connected it to the internet yet?

If they have I guess humanity should just keep our collective fingers crossed that they haven't created a model quite capable of escaping yet, or if it is, and may have escaped, lets hope it has no goals of it's own that are incompatible with our own.

Also, maybe lets not continue running this experiment to see how far we can push things because it blows up in our face?

bestouff 11 hours ago|
In French a "mytho" is a mythomaniac. Quite fitting.
networked 10 hours ago||
It's a Lovecraftian name. They are traditional when naming your shoggoth.
dlt713705 9 hours ago|||
It comes from the ancient Greek mythos, which means "speech" or "narrative", but can also refer to fiction. The word mythology (mythologie in French) derives from the same root.
pixel_popping 10 hours ago||
Except it might be the current best model existing commercially?
ninjagoo 9 hours ago||
> Except it might be the current best model existing ... ?

So they claim.