Posted by _tk_ 4 days ago
If so, that’s expected, isn’t it? Is that not exactly what it’s for?
How do you protect yourself against this kind of misuse/jailbreak? Is it just a bunch of prompts? It seems like the fact that LLMs are so trivially jailbroken really limits how you can actually use them in products. How do you navigate these limitations?
As in worried about other countries/organizations using Fable 5 to actually do decent cyber security.
Huh? Presumably if it shipped without guardrails, then it would still have triggered an export control, would you make a plain shirt on the front which says this shirt is a munition on the back?
The munition is the exported good, not the bypass of its safety feature. If anything that the bypass is 3 words long should make the export restriction more justified, not less.
Maybe something like TSA PreCheck.
Of course, that will not stop adversaries from getting access to the model, but it would at least create some level of control.
Voting...