← Back to glossary
Glossary

Jailbreak

Reviewed 20 March 2026 Canonical definition

A jailbreak is a prompt engineering technique designed to bypass a model's safety instructions or system prompt. In agentic systems, a successful jailbreak can lead to unauthorized tool use, data exfiltration, or policy violations.