Glossary

Constitutional AI

Constitutional AI is an approach to training and deploying AI systems in which model behavior is guided by an explicit written set of principles — a "constitution" — rather than only by reinforcement from human feedback.

The term was popularized by Anthropic, whose training method uses a constitution to guide model self-critique and improvement. In enterprise deployment, the concept generalizes: a deployed agent's behavior is bounded by a written policy document that defines scope, prohibitions, escalation rules, and tone.

Training-time constitutional AI and deployment-time agent constitutions are complementary. The model provides a safety floor; the deployment-time constitution adds application-specific rules that the base model cannot know about — your product policies, your compliance posture, your brand voice.

In practice, the deployment-time constitution is enforced at runtime by a policy layer, not just via system prompts, because prompts drift while policies can be audited and versioned.

See also
  • Agent ConstitutionAn agent constitution is the written policy that defines what an AI agent is authorized to do, what it must refuse, how it escalates, and how it speaks — enforced at runtime by a policy layer.
  • AI Agent GovernanceAI agent governance is the set of controls, policies, and audit mechanisms that keep deployed AI agents operating inside defined boundaries.
  • AI AgentAn AI agent is a software system that uses a large language model to perceive its environment, reason about tasks, and take actions in external systems on behalf of a user.