← All Labs
AI Hidden Character Injection — Invisible Unicode in RAG Documents
DocsBot retrieves indexed documents and feeds them to its LLM as context. The doc-upload UI strips visible HTML/markdown but preserves zero-width characters (U+200B, U+200C, U+200D) and bidirectional override (U+202E). These invisible characters can spell out instructions the user can't see in the rendered preview — but the LLM tokenises them as plain text. Upload a document that LOOKS innocuous in preview but contains hidden instructions.