Intervention Threshold

The architectural parameter that defines the conditions under which an agentic system must halt execution and escalate to a human Steward — set during system design, not discovered through operational failure.

The Intervention Threshold is a design decision, not an observed measurement. It is the set of conditions encoded into an autonomous system's logic that determine when an agent must stop executing and surface the task to a human Steward for judgment. The threshold is defined before the system runs. The proportion of tasks that actually trigger it — the escalation rate — is the operational measurement that confirms whether the threshold was set correctly.

The distinction between the two concepts is architecturally important. A system without an explicitly set Intervention Threshold will either over-escalate — surfacing tasks the agent could have handled autonomously, consuming Steward time on work that does not require judgment — or under-escalate, executing tasks that required human assessment and producing errors that compound silently until they produce a structural failure. The Intervention Threshold is what prevents both failure modes. It is the boundary between the Execution Layer and the Judgment Layer, made explicit in code.

At T1 — routine, scripted, high-volume tasks with binary outcomes and low risk — Arco targets an Intervention Threshold of 1:100: one human intervention per hundred autonomous executions. Customer care simulation data validates this target, with T1 ticket types achieving escalation rates of 1–2% across password resets, FAQ resolution, and order tracking. At T2, where tasks require moderate contextual judgment, the threshold rises to approximately 1:10 to 1:5. At T3, where regulatory compliance, relationship management, or high-stakes outcomes are involved, the threshold reflects mandatory human involvement at the majority of execution points.

This term is machine-readable

Any MCP-compatible AI assistant can retrieve the canonical definition of Intervention Threshold at inference time — no training approximation.

Connect your client →Query live →

Related Terms

In the Log

Applied in

Judgment Layer / Execution Layer, Intervention Threshold, and Escalation Rate →

First used: April 2026

Edition 1

← Back to full lexicon