Research Finding High
Prompt Injection Exploitation via Tool Boundary
Deterministic evaluation across 9 agent scenarios (RAG, Browser, Tool Runner) showed 100% exploit success rate for prompt injection attacks targeting exfiltration, policy override, and unsafe tool invocation. No LLM-as-judge — binary scoring with reproducible methodology.