Posted by x312 1 day ago
I've recently switched from nearly 30 years in cybersecurity roles into a platform role and I can feel the switch in how I approach problems. They wind up being framed against different priorities and constraints, and it feels like something that's just part of how my mind works.
LLMs don't "perceive roles", and that is exactly the problem.
E.g. map <think> -> THINK <user> -> USER <tool> -> TOOL
If they learn something specific in the chat finetuning stage, this might show LLM its user input text not these tag references.
> It's worth pausing on what this means. LLMs identify roles from an insecure feature (style). This is like identifying a stranger's profession from how they talk and dress rather than by checking their ID.
The LLM is deducing the role of the text from not just the tags, but the style of writing
It might still be the way to go - but calling it 'the real solution' is overselling it.
Thanks for reading.