Glossary

Agent Constitution

An agent constitution is the written policy that defines what an AI agent is authorized to do, what it must refuse, how it escalates, and how it speaks — enforced at runtime by a policy layer.

An agent constitution is the deployment-time counterpart to a model's training-time safety work. It covers scope (which workflows the agent owns), prohibitions (behaviours it must refuse), escalation (when and how it hands off to humans), tone (how it speaks), and change-control (who can amend it and how).

A usable constitution is short (2–5 pages for a first deployment), specific, testable, and versioned in source control. It is enforced by a policy layer at runtime — not just via system prompts, which drift and can be overridden.

Constitutions are reviewed quarterly and amended whenever scope changes, incidents occur, or red-team evaluations reveal gaps.

See also
  • Constitutional AIConstitutional AI is an approach to training and deploying AI systems in which model behavior is guided by an explicit written set of principles — a "constitution" — rather than only by reinforcement from human feedback.
  • AI Agent GovernanceAI agent governance is the set of controls, policies, and audit mechanisms that keep deployed AI agents operating inside defined boundaries.
  • AI AgentAn AI agent is a software system that uses a large language model to perceive its environment, reason about tasks, and take actions in external systems on behalf of a user.