Top
Best
New

Posted by marc__1 3 days ago

Malware developers added nuclear and biological weapons text to to their spyware(twitter.com)
https://socket.dev/blog/mini-shai-hulud-miasma-and-hades-wor...
458 points | 236 commentspage 4
amiga386 2 days ago|
[flagged]
hurtigioll 2 days ago||
devs will say this is proof we need to remove all biological guardrails. think about that for a second
alt227 2 days ago||
Someone above already did:

https://news.ycombinator.com/item?id=48506760

rustcleaner 2 days ago|||
Just say no to all guardrails! Subscribing to be told no is cuck paypig behavior! Never subscribe!
montaz 1 day ago||
[flagged]
sciencejerk 2 days ago|
If you actually read the Tweet, the exploit doesn't work against Fable, Opus, Grok...at least, in the examples.

Jailbreaks do work against the models (look on Github), and they do use similar strategies of mixing SAFE text with malicious text, or malicious with even more malicious, etc, but the working Jailbreaks I've seen are pretty long and complicated and even...creepy.

csomar 2 days ago|
Did you actually read what the tweet/blog post are about?
sciencejerk 2 days ago||
Did you?

Goal? To trigger LLM safety refusals... so that their spyware wouldn't be analyzed by an AI security scanner