← Back to glossary
Glossary

Adversarial Example (AI)

Reviewed 9 April 2026 Canonical definition

An adversarial example is a carefully crafted input — often imperceptibly different from a legitimate input — designed to cause an AI model to produce a wrong or harmful output. In agentic AI, adversarial examples can be embedded in documents, web pages, or tool outputs that an agent processes, causing it to take unintended actions.