Governance of Agency

The Governance of Agency

Humans must explicitly define the scope, authority, escalation paths, and risk boundaries for every agent. No agent can safely or reliably determine its own limits. Without a clear, written constitution of delegation, agentic systems drift into misalignment, overreach, or collapse.

The Principle of Delegated Agency Scaling

Autonomy is not a binary setting but a variable bounded by verification capability: you cannot delegate authority where you cannot automate accountability. Agency granted beyond the system's capacity to automatically validate output does not produce more throughput—it produces unverified risk. Low-risk or easily verifiable tasks therefore admit high autonomy; high-risk or subjective tasks admit only restricted agency (consultant mode). This bound operationalizes The Corollary of Graduated Agency by Structure and Risk and is constrained by The Principle of Verification Asymmetry.

Failure Scenario: Delegating complex build optimization to AI leads to short-term gains but introduces critical errors, increasing rework and risk.

The Corollary of Automated Guardrail Prerequisite

Before granting full autonomy to AI agents for a task, ensure robust automated safety nets (CI/CD, test suites) are in place. Automation must be checked by automation to prevent catastrophic failures.

The Corollary of Trust-Gated Orchestration

The velocity and scale of agent orchestration are strictly limited by the "Trust Latency" of the human operator. If a human must verify every intermediate "real task" within a workflow, the system degrades from an autonomous fleet into a synchronous, manual approval queue. True orchestration is only possible when the cost of verification is significantly lower than the cost of execution. Therefore, trust—built on robust provenance and observability—is not a sentiment, but a functional requirement for scaling.

Failure Scenario: A manager deploys a team of five agents to optimize marketing campaigns but requires manual approval for every keyword selection and ad copy variant. The "orchestration" becomes a bottleneck where the manager spends more time reviewing low-stakes decisions than if they had done the work themselves, stalling the entire workflow.

The Principle of Asymmetric Risk

The economics of automation are governed by convexity: the cost of verification is often linear (time spent reviewing), but the cost of failure can be non-linear (catastrophic data loss, security breaches). Agency must be capped not by the capability of the model, but by the bounds of the downside risk. When the "blast radius" of an error is infinite (e.g., production database access), autonomy must be zero, regardless of the model's intelligence.

Failure Scenario: An autonomous agent is given write access to the production environment to "fix a small bug." It hallucinates a command that drops a critical table. The $5 saved in developer time results in a \$ 500,000 outage.

The Corollary of Bounded Edit Radius

An agent given write access to a region treats the entire region as eligible for modification; edits diffuse outward from the requested target because each local change is locally plausible. Because blast radius is what governs risk, the agent's editable surface—not just its executable actions—must be constrained: scoped to specific files, specific functions, or change budgets enforced by tooling. Without this, a one-line bug fix can return as a 200-line PR that is individually defensible and collectively unreviewable, silently expanding the asymmetric downside this principle exists to bound.

The Corollary of Graduated Agency by Structure and Risk

Agency—the degree of autonomous decision-making granted to an agent—must scale inversely with problem structure and consequence severity. Grant high autonomy only to probabilistic components operating in ill-structured, low-risk domains where variability is tolerable and exploration adds value. In well-structured or high-stakes contexts, constrain agency through deterministic rules, mandatory verification steps, or human-in-the-loop escalation. This follows necessarily from Asymmetric Risk combined with The Principle of Problem Structure Allocation: given convex downside, and variability that adds value only in ill-structured domains, no other distribution of autonomy is coherent.

Failure Scenario: An enterprise deploys a fully autonomous agent for customer refund processing (a partially structured task with high financial risk). The LLM probabilistically interprets ambiguous return policies, occasionally approving ineligible claims and causing significant revenue leakage—because agency was not calibrated to the mixed structure and elevated risk profile.

The Corollary of Risk-Structured Delegation

Map every agent workflow to a risk-structure matrix: low-risk/ill-structured → maximal LLM agency; high-risk/well-structured → minimal agency with deterministic overrides. Intermediate cases require hybrid escalation paths, ensuring probabilistic flexibility never bypasses non-negotiable constraints. This calibration aligns with asymmetric risk tolerance: accept variability where upside outweighs downside, enforce certainty elsewhere.

The Principle of Contextual Authority

An AI agent's effective capability is capped by the operator's ownership and mental model of the system. When the operator has deep knowledge (ownership), AI acts as an extension of will, amplifying intent. When the operator lacks deep knowledge (contracting), AI acts as a temporary shield against complexity, hiding implementation details the operator cannot evaluate.

You cannot safely delegate authority to an agent when you cannot predict the side effects of its output. This bounds agency even when the model is capable, because the operator cannot supply reliable review, escalation, or rollback judgment. This constraint interacts with The Principle of Delegated Agency Scaling, The Principle of Asymmetric Risk, and The Principle of Context Compressibility.

Failure Scenario: A contractor uses an AI agent to close a ticket in a legacy codebase they do not understand. The AI suggests a solution that works perfectly in isolation but relies on an internal API scheduled for deprecation. Because the operator lacks the Contextual Authority to know the API history, they accept the solution, solving the ticket today but creating a guaranteed failure for the next release.

The Corollary of Ownership Declaration

Before delegating meaningful agency, explicitly declare whether the operator is acting as an owner (has the mental model to predict side effects) or a contractor (does not). In contracting mode, restrict the agent to low-risk exploration and require an owner to review decisions with irreversible consequences.

The Corollary of Side-Effect Predictability Gates

Only allow an agent to execute irreversible actions (writes, merges, deployments) when the operator can articulate the likely side effects and the rollback path. If the operator cannot, force isolation (e.g., staging/sandbox, read-only tools), aligning with The Principle of Execution Isolation.

The Governance of Agency​

The Principle of Delegated Agency Scaling​

The Corollary of Automated Guardrail Prerequisite​

The Corollary of Trust-Gated Orchestration​

The Principle of Asymmetric Risk​

The Corollary of Bounded Edit Radius​

The Corollary of Graduated Agency by Structure and Risk​

The Corollary of Risk-Structured Delegation​

The Principle of Contextual Authority​

The Corollary of Ownership Declaration​

The Corollary of Side-Effect Predictability Gates​