Top
Best
New

Posted by Ryan5453 18 hours ago

Project Glasswing: Securing critical software for the AI era(www.anthropic.com)
Related: Assessing Claude Mythos Preview's cybersecurity capabilities - https://news.ycombinator.com/item?id=47679155

System Card: Claude Mythos Preview [pdf] - https://news.ycombinator.com/item?id=47679258

Also: Anthropic's Project Glasswing sounds necessary to me - https://news.ycombinator.com/item?id=47681241

1292 points | 640 commentspage 10
anuramat 17 hours ago|
"oops, our latest unreleased model is so good at hacking, we're afraid of it! literal skynet! more literal than the last time!"

almost like they have an incentive to exaggerate

knowaveragejoe 17 hours ago|
I'm sure they do, yet the models really are getting scarily good at this. This talk changed my view on where we're actually at:

https://www.youtube.com/watch?v=1sd26pWhfmg

linzhangrun 2 hours ago||
[dead]
sajithdilshan 4 hours ago||
[dead]
Paul20261 4 hours ago||
[dead]
silentstack 7 hours ago||
[dead]
deepreview 7 hours ago||
[dead]
minutesmith 15 hours ago||
[flagged]
minutesmith 17 hours ago||
[flagged]
Serberus 14 hours ago|
[dead]
More comments...