← Back to glossary
Glossary

Prompt Injection

Reviewed 20 March 2026 Canonical definition

An attack where malicious instructions are embedded in data that an AI agent processes, causing it to deviate from its intended behavior. This can lead to unauthorized data access, tool misuse, or policy bypasses.