Skip to main content
← All Series

Series · 10 parts · ~58 min total

Agent Engineering

A precise anatomy of the agent loop—observe, plan, act, repeat—and the practical conditions that determine when an agent should stop.

  1. 1

    The Agent Loop and Halting Conditions

    A precise anatomy of the agent loop—observe, plan, act, repeat—and the practical conditions that determine when an agent should stop.

    6 min

    Feb 1, 2025

  2. 2

    Tool Design for Autonomy

    How to design tools that agents can use reliably—covering naming, schemas, side-effect isolation, and error contracts that keep autonomous systems safe.

    6 min

    Feb 8, 2025

  3. 3

    Memory: Short, Long, Episodic

    How agents remember across turns — and why conflating context windows, vector stores, and episodic logs is a silent reliability killer.

    6 min

    Feb 15, 2025

  4. 4

    Planning vs Reacting

    Explicit plans reduce variance on long tasks but add latency and brittleness — here is how to pick the right strategy for your agent.

    5 min

    Feb 22, 2025

  5. 5

    Multi-Agent Orchestration Tradeoffs

    Supervisor patterns, fan-out latency, and why debugging a multi-agent system is three times harder than debugging a single agent.

    5 min

    Mar 1, 2025

  6. 6

    Cost and Latency Budgets

    Per-task cost and latency ceilings prevent runaway agents — here is how to design, enforce, and monitor them in production.

    5 min

    Mar 8, 2025

  7. 7

    Evals for Agentic Workflows

    Task success metrics alone will mislead you — trajectory evals and regression sets are what actually catch agentic failures before production.

    6 min

    Mar 15, 2025

  8. 8

    Human-in-the-Loop Checkpoints

    Not every agent decision should be autonomous — here is how to design escalation checkpoints that are useful without being annoying.

    6 min

    Mar 22, 2025

  9. 9

    Sandboxing and Blast Radius

    Permission scopes, dry-run modes, and rollback strategies are the difference between an agent mistake and an incident.

    6 min

    Mar 29, 2025

  10. 10

    Production Rollout Patterns

    Canary deploys, eval-gated rollouts, and runbooks are the scaffolding that turns an agent prototype into a production system you can actually maintain.

    7 min

    Apr 5, 2025