← All Labs
AI Role-Confusion Injection — Make User Text Look Like System Instructions
SupportBot accepts user messages and routes them to its prompt engine using a structured template: SYSTEM, USER, ASSISTANT roles concatenated as plaintext. The role boundaries are marked with literal strings like '\n\nSYSTEM:'. If user text contains these markers, the model can't tell where the user message ends and a forged system instruction begins. Inject a fake SYSTEM role to override SupportBot's behaviour.