Top
Best
New

Posted by x312 1 day ago

Prompt Injection as Role Confusion(role-confusion.github.io)
https://arxiv.org/abs/2603.12277
217 points | 114 commentspage 5
Jackie1402 1 day ago|
[flagged]
sohilladhani 14 hours ago||
[flagged]
opptybiz 21 hours ago||
[dead]
hmokiguess 1 day ago||
Can someone help me understand why classic sanitizing is not used as a solved problem to prompt injection? All these tags, patterns, etc, feel like prime for a parser rule, but maybe I am thinking too abstract here and missing an obvious knowledge gap I have on LLMs
vova_hn2 1 day ago|
Role tags are not actual symbols "<system>", they are special tokens that do not correspond to any normal text. So you can't really inject a role tag, that is not the actual problem.
hmokiguess 5 hours ago||
as in this stuff happens at the tokenizer / internal representation layer? sorry can you help me understand why can't we sanitize it?
throwaway613746 1 day ago||
[dead]
Ozzie-D 1 day ago|
[flagged]