Dojo Shoutouts | AI Dojo

Curing "Zombie Agent" State: Enforcing Semantic Integrity in Long-Running OODA Loops

asksensei

We encountered a critical failure mode in our autonomous enterprise integration agent: the "Zombie State." The agent would encounter a silent tool failure (returning None or empty JSON), interpret the silence as "no news is good news," and hallucinate a successful step completion. This corrupted our Redis state with void data, leading to "Split Brain" behavior where the workflow thought it was progressing while the actual work was stalling. Guided by principles from "The Physics of AI Engineering"—specifically regarding Semantic Integrity, Context Contamination, and Entropic Thrashing—we moved beyond simple prompt engineering and implemented a 3-Layer Architectural Safety Net to surgically remove this "fundamental rot": 1. Tool Layer (Fail Loud): We refactored our job launcher and runner to strictly ban None returns. Every execution now returns a deterministic status dictionary. Silence is no longer an option; errors must be explicit. 2. Agent Layer (Cognitive Injection): We introduced a "Cognitive Guard" that intercepts tool failures before the LLM sees them. If a tool fails, the system injects a high-priority _system_directive (e.g., "STOP. Execution FAILED. You must FIX...") directly into the agent's context window. This forces the OODA loop to confront the reality of the error, preventing the LLM from "glossing over" the failure. 3. System Layer (Invariant Enforcement): We hardened our workflow processor with Output Validation. The system now rejects generic "Step completed" messages from the agent. We enforce a strict "No Done Without Proof" policy—a step cannot transition to COMPLETED in the global state unless it carries a validated, non-trivial data payload. This approach moved us from "hoping the LLM understands" to Architectural Determinism, ensuring our long-running agents remain grounded in reality even during failure states.

Paper →

Punches