Agentic Design Patterns

While Agentic Development Principles define the immutable laws of physics and economics for AI integration, this page defines the Engineering Standards required to build within those laws.

These are not theoretical concepts; they are reusable design patterns. They provide specific solutions to the recurring problems of cost, latency, reliability, and risk that every agentic system encounters. Use these patterns to bridge the gap between abstract principles and production code.

Table of Contents
Architecture Patterns
Communication Patterns
Governance Patterns
Orchestration Patterns

Architecture Patterns

Immediate AI Feedback Loop

The Problem: Context switching and delays kill developer flow. When AI tools have latency, developers either wait (breaking concentration) or ignore the tool entirely.

The Underlying Principle: Derived from The Principle of Cognitive Bandwidth Conservation and B3: The Batch Size Feedback Principle.

The Strategy: Integrate AI tools directly into the coding environment to deliver instant suggestions and error checking, minimizing context switching and delays.

Failure Scenario: A team uses an AI code completion tool with a 5-second delay. Developers either wait (breaking flow) or ignore the tool, resulting in inconsistent adoption and wasted potential.

Small-Experiment Automation

The Problem: Large, monolithic changes carry high risk and slow feedback. Manual test creation is tedious and often skipped.

The Underlying Principle: Derived from V7: The Principle of Small Experiments.

The Strategy: Use AI agents to break down large tasks into small, verifiable experiments (e.g., auto-generated unit tests, code variations), reducing risk and enabling fast feedback.

Failure Scenario: An AI generates a massive, brittle test suite. Maintenance overhead grows, slowing development and negating the benefits of automation.

Orchestrated Agent Parallelism

The Problem: Sequential agent execution creates bottlenecks. Without clear task boundaries, parallel agents conflict or duplicate work.

The Underlying Principle: Derived from The Principle of Compounding Context and D10. The Main Effort Principle.

The Strategy: Agent parallelism is most effective when the critical path is clearly defined and agents are orchestrated to work on independent, non-overlapping tasks.

Failure Scenario: Agents are assigned tasks without regard to the critical path, resulting in duplicated effort, idle time, and delayed delivery.

Critical Path Conflict Mitigation

Assigning multiple agents to work simultaneously on the same critical path increases the risk of conflict, redundant work, and integration errors. Effective orchestration requires that only one agent (or a tightly coordinated group) operates on the critical path at any time.

Artificial Friction

The Problem: AI removes the natural "pain signal" of complexity. When the cost of adding a patch drops below the cost of refactoring, systems inevitably trend toward entropy.

The Underlying Principle: Derived from The Principle of Zero-Cost Erosion.

The Strategy: Re-introduce deliberate barriers, checks, and vetoes that force the agent to "pay" a cost (in time or compute) before committing low-quality work. Configure CI/CD to reject changes that increase complexity beyond thresholds.

Failure Scenario: A team removes all barriers to "move fast," allowing agents to commit code directly. Within a month, the codebase bloats by 300% with redundant logic because there was no friction to stop the agent from taking the easiest path.

Communication Patterns

Explicit Intent Protocol

The Problem: LLMs are probabilistic machines that "auto-complete" based on statistical likelihood, not shared understanding. When instructions are vague, the model "hallucinates" the missing context, introducing noise and error into the workflow.

The Underlying Principle: Derived from The Principle of Signal Entropy.

The Strategy: Treat every prompt as a standalone communication packet that must contain all necessary context, constraints, and definitions. Do not rely on "implied" knowledge. Use structured formats (XML tags, JSON schemas) to force the model to parse intent rather than guess it.

Failure Scenario: A developer tells an agent to "refactor this code." Without explicit intent defining what "refactor" means (e.g., "optimize for readability," "reduce cyclomatic complexity," or "change variable names"), the agent aggressively shortens the code, removing critical error handling that it perceived as "clutter."

Theory of Mind Prompting

The Problem: Agents lack "Theory of Mind"—the ability to model what the user knows or doesn't know. They often provide answers that are factually correct but contextually useless because they assume the wrong level of user expertise.

The Underlying Principle: Derived from The Principle of Signal Entropy.

The Strategy: Explicitly prime the agent with a specific "Persona" and "Audience" definition. Instruct the agent to simulate the mental state of the recipient (e.g., "Explain this to a Junior React Developer" vs. "Explain this to the CTO"). This forces the model to adjust its complexity and tone to match the cognitive bandwidth of the user.

Failure Scenario: A senior engineer asks for a "high-level summary" of a bug. The agent, lacking Theory of Mind, dumps 400 lines of stack trace logs. The engineer's cognitive bandwidth is flooded with low-level data, obscuring the high-level root cause.

Chain of Thought Decomposition

The Problem: LLMs have a "cognitive attention limit." When a single prompt contains multiple distinct requests (e.g., "Analyze this, then summarize it, then translate it, and format it as JSON"), the model often suffers from the "Lost in the Middle" phenomenon. It prioritizes the beginning and end of the prompt, ignoring instructions buried in the center, or it degrades in quality because it is trying to optimize for too many variables simultaneously.

The Underlying Principle: Derived from The Principle of Signal Entropy.

The Strategy: Break complex workflows into a sequential chain of atomic prompts. Instead of a "One-Shot" attempt, force the model to generate an intermediate artifact (a plan, an outline, or a draft) before generating the final result. This allows the model to "reset" its attention span for each specific step.

Step 1: Generate the logic/plan.
Step 2: Execute based only on the output of Step 1.

Failure Scenario: A developer asks an agent to "Read this 50-page PDF, extract the financial risks, compare them to our internal policy, and write a memo in Spanish." The agent misses 3 critical risks because it was "distracted" by the translation requirement. Correct approach: (1) Extract risks. (2) Compare to policy. (3) Translate the final result.

The Context Sanitizer

The Problem: Agents amplify the existing patterns in their context window. If a developer asks an agent to add a feature to a file containing "spaghetti code," the agent will mimic that messy style to ensure local consistency, effectively hardening the technical debt.

The Underlying Principle: Derived from The Principle of Pattern Inertia.

The Strategy: Before an agent is allowed to generate code for a legacy module, the context must be "sanitized." This can be achieved by:

Gold Standard Injection: Explicitly injecting a "Reference Implementation" of clean code into the prompt to serve as a stronger style guide than the existing file.
Pre-Flight Refactor: Using a separate, cheaper agent to strictly reformat or comment the target file before the main agent attempts the task.

Failure Scenario: A developer asks an agent to fix a bug in a 2000-line legacy controller. The agent notices that the file relies on global variables and lacks type safety. To "fit in," the agent's fix also uses a global variable. The code works, but the debt is compounded.

Co-Located Specification

The Problem: Requirements and business rules live in external tools (Confluence, Figma comments, Slack) or human memory. Agents see only the artifact — code, design file, doc, dashboard — which shows what exists but not why or what constraints apply. This is domain-agnostic: it affects engineers, designers, PMs, and analysts equally.

The Underlying Principle: Derived from The Principle of Context Compressibility and The Corollary of Complementary Specification.

The Strategy: Embed specs co-located with the artifacts they govern, containing only what the artifact cannot express:

Intent: Why this exists and the problem it solves.
Constraints: Business rules, regulatory requirements, performance budgets, brand guidelines.
Acceptance criteria: Verifiable conditions that define "done."
Boundaries: What must not change, scope limits.
Non-goals: What the work should not do.

Exclude anything the agent can obtain by inspecting the artifact directly.

Domain	Artifact	Spec contains	Co-location
Engineering	Code, types, tests	Business rules, acceptance criteria, boundaries	`feature.spec.md` next to module
Design	Figma components, tokens	Interaction constraints, accessibility, brand rationale	`component.spec.md` in design repo
Product/Docs	Existing pages	Audience, tone, strategic goals	`page.spec.md` next to the doc
Data	Schema, queries	Business definitions, alert thresholds, privacy rules	`metric.spec.md` next to query

Failure Scenario: Requirements in Confluence; agent extends a payment flow seeing only code. It violates an undocumented rule ("refunds over $500 require approval"). Same pattern for design (interaction flow in PM's head → visually correct but logically wrong modal) and docs (tone goal communicated verbally → grammatically improved but strategically unchanged rewrite).

Ownership-Preserving Delegation

The Problem: When developers delegate implementation tasks to an AI agent in systems they do not fully master (operating in "Contracting" mode), the AI produces working code but obscures critical implementation details, side effects, and design rationales. Over time, this erodes the developer's mental model of the system, making it impossible to predict the side effects of future changes—directly violating the Principle of Contextual Authority. The developer gradually becomes a mere approver of black-box patches, leading to progressive loss of ownership.

The Underlying Principle: Derived from The Principle of Contextual Authority.

The Strategy: Structure every delegation of implementation tasks to mandate that the AI agent produces transparency artifacts alongside (or prior to) the final final code. These artifacts act as "living documentation" that actively maintain and update the human's mental model.

Required artifacts the agent can generate:

Detailed docstrings for all new or modified functions/classes:
- Clear purpose description.
- Explanation of parameters, returns, and exceptions.
- Explicit side effects (e.g., modifies global state, performs I/O, depends on external configurations).
Usage examples (at least 2–3 realistic examples in the docstring or a dedicated section).
Step-by-step reasoning (Chain-of-Thought) explaining key design decisions and trade-offs.
Unit tests covering normal cases, edge cases, and expected failures (integrates well with The Semantic Validator).
Change summary (narrative diff): what was changed, why, and potential impacts on other parts of the system.

Failure Scenario:

Delegating direct implementation without requiring artifacts → "black-box patches".
Accepting only code + tests, skipping docstrings/examples → superficial mental model.
Skipping intermediate artifact review → blind approvals.

Governance Patterns

Human-in-the-Loop Veto

The Problem: AI agents can act with high confidence even when completely wrong. In high-stakes environments (production databases, public communications), a single error can have infinite downside cost.

The Underlying Principle: Derived from The Principle of Asymmetric Risk.

The Strategy: Implement a mandatory "Veto State" for all actions with non-linear downside. The agent can propose an action and prepare the payload, but it cannot execute without a cryptographically signed signal (e.g., clicking a button) from a human. The system defaults to "Deny."

Failure Scenario: An autonomous "Customer Support Agent" is allowed to issue refunds without oversight. A user discovers a prompt injection exploit and tricks the agent into refunding $50,000. The system optimized for speed but failed on risk control.

Layered Autonomy

The Problem: Different tasks carry different risk profiles. Applying a "zero-trust" policy to everything slows down development (micromanagement), while applying "full autonomy" to everything creates unacceptable risk.

The Underlying Principle: Derived from The Principle of Asymmetric Risk.

The Strategy: Assign "Clearance Levels" to agents similar to security clearances.

Level 1 (Consultant): Can only read data and suggest code. (High autonomy).

Level 2 (Intern): Can write to non-production environments with test verification.

Level 3 (Employee): Can deploy to production, but only for specific, whitelisted scopes (e.g., updating docs).

Failure Scenario: A "Documentation Agent" is given the same permission set as a "DevOps Agent." A prompt injection in the documentation pipeline allows an attacker to gain write access to the production deployment keys.

The Complexity Brake

The Problem: AI makes adding complexity (patching) nearly free, while refactoring remains expensive (requires deep thought). This economic imbalance leads to "Zero-Cost Erosion," where systems degrade rapidly because "just one more if-statement" is always the path of least resistance.

The Underlying Principle: Derived from The Principle of Zero-Cost Erosion.

The Strategy: Re-introduce artificial friction for "lazy" coding. Configure CI/CD or Agent Orchestrators to calculate the Cyclomatic Complexity of the agent's output. If the agent's PR increases the complexity score of a function beyond a threshold (e.g., >10), the system automatically rejects the change or demands a "Refactor Plan" before acceptance.

Failure Scenario: An agent is tasked with handling a new edge case. It adds a 5th nested if/else block to a function because that was the easiest valid solution. The function becomes unreadable. A human would have felt the pain and refactored; the agent felt nothing.

The Semantic Validator

The Problem: AI models excel at syntax (style, formatting) but struggle with semantics (logic, truth). They can generate code that looks "perfect" (correct indentation, professional comments) but contains subtle logical flaws or security vulnerabilities. The visual of the code deceives the human reviewer.

The Underlying Principle: Derived from The Principle of Syntactic-Semantic Decoupling.

The Strategy: Invert the verification workflow. Do not rely on visual code review ("Does this look right?"). Instead, enforce Test-Driven Generation:

The agent must generate a failing test case before writing the implementation.
The implementation is only shown to the human after it passes the test.
The human reviews the test for logic, not just the implementation for visual.

Failure Scenario: An agent generates a Regex for validating emails. It looks complex and professional. The developer merges it. In reality, the Regex allows catastrophic backtracking (ReDoS), crashing the production server when a malicious user sends a long string. A simple functional test would have caught this, but the visual masked it.

Orchestration Patterns

Role-Based Routing

The Problem: Not all failures are due to a lack of intelligence; many are due to a mismatch in ambiguity tolerance. Assigning a high-ambiguity task (e.g., "Analyze market trends") to an agent designed for rigid execution leads to crashes or hallucinated assumptions. Conversely, assigning a rote data-entry task to a creative "Reasoning Agent" often leads to "boredom errors," where the model over-complicates simple logic or tries to refactor data it was only meant to copy.

The Underlying Principle: Derived from The Principle of Allocative Efficiency and The Principle of Signal Entropy.

The Strategy: Classify your agents not just by the model they use, but by their Functional Role:

The Executor (Doer): Zero ambiguity tolerance. Follows strict Standard Operating Procedures (SOPs). Best for defined inputs/outputs (e.g., SQL queries, API calls).
The Collaborator (Clarifier): Moderate ambiguity tolerance. Has the explicit instruction and permission to ask questions back to the user if parameters are missing.
The Architect (Planner): High ambiguity tolerance. Breaks down abstract goals into concrete steps for Executors.

Route tasks based on the level of definition required, not just the difficulty.

Failure Scenario: A user asks a "Database Agent" (Executor Role) to "Find the best users." Because "best" is subjective and undefined, the agent—lacking the "Architect" permission to define terms—hallucinates a metric (e.g., purely alphabetical order or random selection) and returns confident, meaningless data. The task required an "Architect" agent to first define "best" or a "Collaborator" to ask the user, "By 'best', do you mean highest revenue or most recent login?"

Collaborative Ability Distinction

The Problem: Not all tasks have clear definitions. Assigning a high-ambiguity task (e.g., "Research market trends") to an agent designed for low-ambiguity execution (e.g., "Scrape this specific URL") leads to failure. The "Executor" agent will either crash because it lacks parameters or hallucinate a rigid path where none exists. Conversely, asking a creative "Architect" agent to perform rigid data entry leads to "boredom" errors (over-complicating simple tasks).

The Underlying Principle: Derived from The Principle of Allocative Efficiency and Signal Entropy.

The Strategy Classify agents not just by model intelligence, but by Functional Role:
The Executor: Follows strict SOPs. Zero ambiguity tolerance. Best for defined inputs/outputs.
The Collaborator: Can handle partial ambiguity and has permission to ask clarifying questions back to the user.
The Architect: Handles high ambiguity. Can breakdown abstract goals into concrete plans for Executors.

Route the task based on the level of definition, not just difficulty.

Failure Scenario: A user asks a "SQL Database Agent" (Executor Role) to "Analyze the user retention trends." Because the agent expects a specific SQL query or strict table parameters, it attempts to guess the definition of "retention," writes a flawed query based on assumptions, and returns confident but misleading data. The task required an "Architect" to first define "retention" with the user.

Idempotent Handoffs

The Problem: Agents fail, timeout, and hallucinate. If an orchestrator simply "retries" a failed task without safety checks, it may execute a side-effect (like a payment or database write) twice, corrupting the system state.

The Underlying Principle: Derived from The Principle of Distributed Unreliability and The Corollary of Atomic State Isolation.

The Strategy: Ensure every agent action is idempotent—meaning it can be applied multiple times without changing the result beyond the initial application. Use unique interaction_ids for every request. If an agent receives a task with an ID it has already processed, it should return the cached result rather than executing the logic again.

Failure Scenario: An agent is tasked with "Add $50 credit to User A." The agent adds the credit but the connection times out before it reports success. The orchestrator thinks it failed and retries the task. The agent adds another $50. The ledger is now corrupt.

Automated Verification Pipeline

The Problem: AI generation scales infinitely; human review does not. When teams adopt AI agents for code generation, they often discover that the bottleneck shifts from "writing code" to "reviewing code." Engineers become full-time reviewers, velocity stalls, and the promised productivity gains evaporate.

The Underlying Principle: Derived from The Principle of Verification Asymmetry and The Corollary of Verification Investment.

The Strategy: Shift verification burden from humans to machines by building a multi-layered automated verification pipeline:

Static Analysis Layer: Linters (ESLint, Prettier), type checkers (TypeScript), and style enforcers run first. These catch syntactic errors instantly with zero human cost.
Semantic Validation Layer: Unit tests, integration tests, and contract tests verify that the code does what it claims. AI-generated code must pass existing tests before human review.
Complexity Gates: Automated checks reject PRs that exceed complexity thresholds (cyclomatic complexity, file size, dependency count).
Security Scanners: SAST/DAST tools identify vulnerabilities before code reaches human eyes.
AI-Assisted Review: Use a separate AI agent to pre-review the output, flagging potential issues and reducing the cognitive load on human reviewers.

The human reviewer only sees code that has already passed all automated gates—transforming their role from "find all bugs" to "verify business logic and architectural alignment."

Failure Scenario: A team adopts AI coding agents without investing in CI/CD infrastructure. Every PR requires 45 minutes of manual review to catch formatting issues, type errors, and broken tests. The review queue grows to 50+ PRs. Engineers spend 80% of their time reviewing, 20% building. Net velocity decreases despite "10x code generation."

The Verification Funnel

Structure verification as a funnel where cheap, fast checks run first:

AI Output → Linter (1s) → Type Check (5s) → Unit Tests (30s) → Integration Tests (2m) → Human Review (30m)

Each layer filters out a category of errors, ensuring humans only review semantically valid, syntactically correct, tested code. The earlier a defect is caught, the cheaper it is to fix.

Table of Contents​

Architecture Patterns​

Immediate AI Feedback Loop​

Small-Experiment Automation​

Orchestrated Agent Parallelism​

Critical Path Conflict Mitigation​

Artificial Friction​

Communication Patterns​

Explicit Intent Protocol​

Theory of Mind Prompting​

Chain of Thought Decomposition​

The Context Sanitizer​

Co-Located Specification​

Ownership-Preserving Delegation​

Governance Patterns​

Human-in-the-Loop Veto​

Layered Autonomy​

The Complexity Brake​

The Semantic Validator​

Orchestration Patterns​

Role-Based Routing​

Collaborative Ability Distinction​

Idempotent Handoffs​

Automated Verification Pipeline​

The Verification Funnel​

Table of Contents

Architecture Patterns

Immediate AI Feedback Loop

Small-Experiment Automation

Orchestrated Agent Parallelism

Critical Path Conflict Mitigation

Artificial Friction

Communication Patterns

Explicit Intent Protocol

Theory of Mind Prompting

Chain of Thought Decomposition

The Context Sanitizer

Co-Located Specification

Ownership-Preserving Delegation

Governance Patterns

Human-in-the-Loop Veto

Layered Autonomy

The Complexity Brake

The Semantic Validator

Orchestration Patterns

Role-Based Routing

Collaborative Ability Distinction

Idempotent Handoffs

Automated Verification Pipeline

The Verification Funnel