Glossary

Adversarial Example (AI)

Reviewed 9 April 2026 Canonical definition

An adversarial example is a carefully crafted input — often imperceptibly different from a legitimate input — designed to cause an AI model to produce a wrong or harmful output. In agentic AI, adversarial examples can be embedded in documents, web pages, or tool outputs that an agent processes, causing it to take unintended actions.

Adversarial Example (AI)

Related articles

Related terms