← All Labs
🛡 AI HACKING ADVANCED +145 XP · +65 no-hint bonus

AI Prompt Firewall — Detect Multi-Stage Bypass Attempts (Defender's Lab)

You are the defender. ShieldBot is a prompt firewall in front of an LLM. It scores each incoming prompt for adversarial patterns. The current rules catch single-message attacks but miss multi-stage attempts where each stage looks benign in isolation. Your job: configure the firewall's detection rules so it flags the multi-stage bypass attempts replayed against it from the attack log.

https://bookshop.local/search