Intervention Threshold
The architectural parameter that defines the conditions under which an agentic system must halt execution and escalate to a human Steward — set during system design, not discovered through operational failure.
The Intervention Threshold is a design decision, not an observed measurement. It is the set of conditions encoded into an autonomous system's logic that determine when an agent must stop executing and surface the task to a human Steward for judgment. The threshold is defined before the system runs. The proportion of tasks that actually trigger it — the escalation rate — is the operational measurement that confirms whether the threshold was set correctly.
The distinction between the two concepts is architecturally important. A system without an explicitly set Intervention Threshold will either over-escalate — surfacing tasks the agent could have handled autonomously, consuming Steward time on work that does not require judgment — or under-escalate, executing tasks that required human assessment and producing errors that compound silently until they produce a structural failure. The Intervention Threshold is what prevents both failure modes. It is the boundary between the Execution Layer and the Judgment Layer, made explicit in code.
At T1 — routine, scripted, high-volume tasks with binary outcomes and low risk — Arco targets an Intervention Threshold of 1:100: one human intervention per hundred autonomous executions. Customer care simulation data validates this target, with T1 ticket types achieving escalation rates of 1–2% across password resets, FAQ resolution, and order tracking. At T2, where tasks require moderate contextual judgment, the threshold rises to approximately 1:10 to 1:5. At T3, where regulatory compliance, relationship management, or high-stakes outcomes are involved, the threshold reflects mandatory human involvement at the majority of execution points.
Related Terms
In the Log
First used: April 2026