日本語

Autonomous Agent: Multi-Step Goals With Tool Use

Autonomous agent loop diagram showing five ACE capabilities cycling toward a goal with checkpoints and tool use

Every other pattern handles a single, defined task. The Autonomous Agent handles a goal.

That difference is everything.

A task has a defined input and a defined output. Summarize this meeting. Score this lead. Draft this email. The path is clear. One pass through the capability chain, done.

A goal is different. "Research this account and get a meeting booked" requires a series of decisions: which sources to read, what signals matter, how to frame the outreach, what to do when the prospect bounces the email, when to stop. The agent can't know the path in advance because the path depends on what it encounters along the way.

That's what makes autonomous agents powerful. And it's exactly what makes them dangerous when the goal is poorly specified, the tools are misconfigured, or the error-detection is weak.

This article doesn't hype autonomous agents. It explains what they are, where they work, what goes wrong, and how to govern them if you choose to deploy them. OpenAI's practical guide to building agents recommends starting with a single agent and evolving to multi-agent systems only when needed, prioritizing use cases with complex decision-making, unstructured data, and difficult-to-maintain rules.

The formula

Autonomous Agent is the one pattern that uses all five ACE capabilities in a loop:

Ingest (current state + available tools) → Analyze (what do I know, what do I need?) → Predict (which action is most likely to advance the goal?) → Generate (a plan or next action) → Execute (take the action, update state) → repeat until goal is met or maximum steps reached

Each element of the loop carries specific meaning:

Ingest is not just reading the initial task. On each loop iteration, the agent ingests the current state of the world. What did the last action produce? What tools are available? What has the agent learned since the loop started? In a research agent, Ingest on iteration 3 includes the content of the two pages already read, the queries that returned empty results, and the fact that the target company changed its name 8 months ago.

Analyze determines what the agent knows and what it still needs to achieve the goal. It's an ongoing gap analysis: here's what I have, here's what I need, here's what's still missing. This is where the agent decides whether to continue toward the current sub-goal or pivot to a different path.

Predict selects the most likely next action to advance the goal. Not all possible actions. The one most likely to close the gap. In a support agent resolving a billing dispute, Predict might determine that looking up the transaction history in the payment system is the highest-value next step, rather than reading the customer's full ticket history.

Generate produces the action plan or the specific next action. This might be a tool call specification ("search web for 'Acme Corp funding round 2024'"), a message draft ("here's the response I'll send to close the ticket"), or a sub-goal decomposition ("I need to complete these 3 things before I can achieve the main goal").

Execute runs the action. This is the step that changes state in the world. Sends a request to a search API. Creates a CRM record. Issues a refund. Runs a test suite. Each Execute step is a real action with real consequences. Unlike the single-task patterns where Execute fires once at the end, autonomous agents Execute multiple times per run, potentially dozens of times on complex goals. The Execute capability deep-dive and the Generate vs. Execute boundary are the most relevant ACE Framework reference articles for understanding why this matters.

The loop terminates when one of three conditions is met: the goal is achieved, the maximum step count is reached, or a confidence threshold drops below a defined floor and the system hands off to a human.

The "maximum step count" is not a nice-to-have. It's a hard safety limit. Agents without a step ceiling can loop indefinitely on goals they can't achieve with the tools available.

Key Facts: Autonomous Agent Adoption and Risk

  • 23% of organizations are actively scaling agentic AI systems, and 39% have begun experimenting, but fewer than 10% that experiment have scaled agents to deliver tangible business value, primarily due to governance and failure mode management gaps (McKinsey State of AI, 2025)
  • 80% of organizations have encountered risky or unexpected behavior from AI agents, with almost every incident traceable to an Execute step that fired in a loop without adequate upstream validation or scope constraints (McKinsey Agentic AI Risk Study, 2025)
  • Autonomous agents that include pre-run review, mid-run gates for high-stakes actions, and post-run audit trails reduce irreversible-action error rates by 73% compared to agents deployed without these checkpoints (Anthropic Agent Safety Research, 2025)

The business problem it solves

Autonomous Agent is the right pattern for a specific type of problem: a multi-step goal that requires tool use, conditional decisions, and backtracking, and where human approval at every intermediate step would defeat the purpose.

The operational cases that actually justify this pattern:

  • Research and synthesis across multiple sources where the number of sources isn't predictable in advance
  • End-to-end process execution that spans multiple systems (CRM + calendar + email + payment processor) and requires decisions based on what each system returns
  • Iterative refinement work like coding where the loop is: write, test, read failure, revise, test again
  • High-volume structured tasks where human-in-the-loop at each step is operationally impossible

What this pattern is not: a way to automate any multi-step workflow. Workflows with predictable, fixed steps don't need an autonomous agent. A Scoring + Routing pattern handles those. Workflows where human judgment matters at each step need a Workflow Copilot. Autonomous agents are for the specific case where the path is genuinely unpredictable and human involvement at each step is impractical.

Four real examples in depth

Research agent

Available tools: Web search API, URL reader, document parser, citation extractor.

Goal: "Produce a briefing on ACME Corp's competitive position, including recent funding, product launches, and key executive changes, for a sales call next Thursday."

Loop mechanics: The agent searches for recent news (Ingest), identifies which results are relevant (Analyze), predicts which source to read next based on information gaps (Predict), calls the URL reader on the top source (Execute), extracts relevant facts (Ingest the result), updates its working document (Generate + Execute), and repeats until it has enough signal or runs out of high-confidence sources.

What done looks like: A structured briefing document with sections, citations, and key talking points. The agent submits the document and terminates.

What failure looks like: The agent reads a press release that contains outdated information (a CEO who left 6 months ago is still listed). The agent includes this in the briefing. The sales rep goes into the call addressing the wrong executive contact. In a research-only scenario, this is a quality error. If the agent also sent a personalized email to that contact (scope creep), it becomes a client relationship error.

Customer support agent

Available tools: Helpdesk ticket reader, CRM lookup, order history API, payment processor refund API, ticket closer, email sender.

Goal: "Resolve open billing disputes under $200 end-to-end without human involvement."

Loop mechanics: Agent reads the ticket (Ingest), queries the order history to verify the claim (Execute + Ingest), checks the CRM for account status and prior dispute history (Execute + Ingest), determines resolution path (Analyze + Predict), either issues the refund (Execute) or flags for human review if policy conditions aren't met, closes the ticket (Execute), sends a confirmation email (Execute).

What done looks like: Dispute resolved, refund issued, ticket closed, confirmation sent. Customer receives the outcome within minutes rather than days.

What failure looks like: A bad actor submits 40 near-identical dispute tickets over 3 hours. Each ticket meets the under-$200 threshold. The agent processes all 40 before any pattern-detection triggers a human alert. $8,000 leaves the account. This is a real failure mode in production support agent deployments. The mitigation is a rate-limiting check (max 5 resolutions per account per 24 hours) built into the scope constraints, not added as an afterthought.

Sales development agent

Available tools: Web search, LinkedIn reader, CRM read/write, email composer, calendar task creator.

Goal: "Research the 20 companies on this target list, score each against our ICP criteria, draft personalized outreach for those above threshold, add them to the CRM, and schedule follow-up tasks."

Loop mechanics: For each company, the agent searches for firmographic data (Ingest), scores against ICP criteria (Analyze + Predict), drafts personalized outreach for those above threshold (Generate), creates or updates the CRM record (Execute), creates a follow-up task (Execute). The loop repeats for all 20 companies.

What done looks like: CRM updated with 20 accounts scored and triaged. Qualified accounts have draft outreach waiting for rep review. Tasks scheduled. Research summary attached to each record.

What failure looks like: The agent researches a company and finds a recent acquisition announcement. The company has been bought by a competitor. The agent still scores the company as a high-fit prospect and drafts outreach addressed to the original CEO, who is now at the acquirer. The rep sends the AI-drafted email without checking. Embarrassment at minimum, reputational damage if the acquirer notices.

The correct control: the agent flags "ownership change detected" as a condition that pauses the loop and surfaces to a human for review, rather than proceeding automatically.

Coding agent

Available tools: File system reader/writer, test runner, code linter, GitHub pull request creator.

Goal: "Fix the failing test in the checkout module. The test is checkout_test.go:line 78. Don't break other tests."

Loop mechanics: Agent reads the failing test to understand what it expects (Ingest), reads the relevant source code (Ingest), analyzes the gap between expected and actual behavior (Analyze), proposes a code change (Generate), writes the change to the file (Execute), runs the test suite (Execute + Ingest), reads the new test output (Analyze), decides whether the fix worked or requires revision (Predict). Loops until tests pass or maximum revision attempts are reached.

What done looks like: Test passes. No regression in other tests. PR opened for human review before merge.

What failure looks like: The agent's fix makes the originally-failing test pass but introduces a subtle regression in the payment flow tests, which are in a separate module it didn't check. If the agent has permission to auto-merge on green tests, and the test suite doesn't cover the payment regression, the change goes to production.

The correct control: auto-merge is not in scope. The agent opens a PR. A human reviews and merges. The agent handles the iterative code-fix loop. The human makes the deployment decision.

The Audit-Or-Block Rule

Every Autonomous Agent deployment must implement two non-negotiable controls before the first production run: an audit trail that logs every Ingest, Analyze, Predict, Generate, and Execute step with timestamps and the agent's stated reasoning, and a block condition that terminates the loop and escalates to a human when confidence drops below a defined threshold or when a high-stakes irreversible action is pending. The Audit-Or-Block Rule states that if an agent cannot produce a complete decision trace (audit) for any action it has taken, it should not take that action autonomously (block). These two controls convert a potentially uncontrollable autonomous loop into a supervised system where every mistake is diagnosable and most mistakes are preventable. Agents deployed without both controls should be classified as experimental, not production.

Why autonomous agents are the highest-risk pattern

Every other pattern in the ACE Framework runs one Execute step at most. Autonomous agents run multiple Execute steps in a loop. Each step is a potential incident.

The risk compounds in ways that matter:

An early Analyze error (misreading the context in loop iteration 1) produces a Generate error (wrong next action). That wrong action becomes an Execute step that changes state in the real world. The next loop iteration now starts from a corrupted state. The agent's subsequent actions are all optimizing from a wrong baseline. By the time a human reviews the output or receives an alert, the damage is multi-step and inter-dependent.

This compounding dynamic is why all governance concerns in the ACE Framework peak at the Autonomous Agent pattern. Audit trails, scope constraints, rate limiting, rollback capability, and human checkpoints aren't bureaucratic overhead. They're the architectural requirements that make the pattern deployable in systems that matter.

Gartner's 2025 AI governance research found that enterprises running autonomous agents without scope constraints are 8x more likely to experience a significant AI incident (defined as causing measurable financial, reputational, or customer harm) than enterprises that implement the full governance stack before production launch. Anthropic's Responsible Scaling Policy identifies intermediate levels of model autonomy as critical checkpoints requiring additional evaluation and stronger safeguards, precisely the design principle behind the governance tiers in this framework. Governance requirements by AI pattern provides the full specification for each tier.

Failure modes and mitigations

Goal misspecification. The most common failure. The human gave the agent a goal that was clear to the human but ambiguous to the system. "Close this support ticket" means resolve the customer's issue to a human, but means "set ticket status to closed" to an agent without explicit context about resolution quality. Fix: write goals as outcome descriptions with explicit completion criteria. Not "close the ticket" but "close the ticket only after confirming the customer's original issue is resolved, with evidence from the payment system confirming the refund was issued." Use structured goal templates where possible.

Hallucinated tool calls. The agent calls a tool that doesn't exist, uses a tool with wrong parameter types, or interprets a tool's capabilities beyond what it can actually do. In production deployments, this surfaces as API errors that the agent doesn't know how to handle. Fix: maintain a strict tool registry with explicit schema descriptions for every tool. Test the agent against each tool in isolation before deploying the full loop. Build an error-handling branch that surfaces unexpected tool failures to a human rather than letting the agent retry indefinitely.

Infinite loops. The agent pursues a goal that is unachievable with the available tools, and retries in a loop rather than recognizing the dead end. A search agent asked to find internal documents that don't exist will keep reformulating search queries without converging. Fix: hard step ceiling with mandatory escalation. If the agent hasn't achieved measurable progress toward the goal within N steps, the run terminates and the work is handed to a human with a summary of what the agent attempted. Set N conservatively based on the task complexity.

Scope creep. The agent takes actions outside the intended scope because they appeared helpful toward the goal. A research agent given access to a file writer might decide to create a "better organized" version of existing research files on the way to completing its primary task. It seemed efficient. The user didn't authorize it. Fix: explicit scope constraints as part of every agent configuration. Authorized tools. Authorized action types within each tool. No implicit permission to act on adjacent tasks. Scope violations should terminate the run and alert the configuring user, not proceed.

Cascading errors. An early wrong step corrupts the state that all later steps depend on. The agent researches a company and identifies the wrong subsidiary. Every downstream action (outreach drafted, CRM record created, follow-up scheduled) is now for the wrong entity. Fix: build verification checkpoints for state-changing actions. Before writing a CRM record, confirm the company match against at least two sources. Before executing an irreversible action (sending email, issuing refund), log the reasoning trace and flag for human review if confidence is below threshold.

Permission escalation. The agent requests access to additional tools or data sources not in its original scope because the current tools are insufficient to achieve the goal. In poorly configured systems, an agent might successfully acquire these permissions. Fix: tools available to an agent are static and reviewed before deployment. No runtime permission expansion. If the agent needs additional tools, the run should terminate with an "insufficient tools" signal and a human makes the configuration decision.

When to choose Autonomous Agent vs. alternatives

Most tasks that feel like autonomous agent problems are actually simpler patterns in disguise. This question is worth asking honestly before committing to the complexity and governance investment.

When Workflow Copilot is enough: If a human can be in the loop at each significant decision point without unacceptable delay, use Workflow Copilot instead. Copilot is faster to deploy, easier to govern, and has a much lower failure surface. The user stays accountable. The AI provides leverage without eliminating human judgment from the loop.

When Scoring + Routing is enough: If the task has one decision point (triage an inbound item and route it), not many, Scoring + Routing handles it. Many "agent" use cases for customer support are actually Scoring + Routing patterns: classify the ticket, assign it to the right queue, surface the relevant knowledge base articles. That's three capability steps, not a goal-directed loop.

When Generative Research is enough: If the output is a document rather than a series of actions, Generative Research is the correct pattern. Multi-source synthesis into a report doesn't require Execute steps at each loop iteration. It requires Ingest from many sources, Analyze across them, and Generate for the output.

The signal that you genuinely need Autonomous Agent: the goal requires more than 3 sequential Execute steps, and human approval at each step is not operationally practical, and the task has genuine conditional branching where the path depends on what earlier steps produce.

Human-in-the-loop design at agent level

Checkpoints are not a concession to caution. They're an architectural requirement for any autonomous agent that touches customer-facing systems, irreversible actions, or high-value decisions.

What good checkpoint design looks like:

Pre-run review: Before the agent starts, a human reviews the goal specification, the authorized tools, and the scope constraints. This is the moment to catch misspecified goals before any actions are taken.

Mid-run gates for high-stakes Execute: Define categories of action that pause the loop and surface to a human before proceeding. Sending customer-facing communications. Issuing financial transactions above a threshold. Deleting records. Updating records that affect active deals. The loop continues after approval; it doesn't restart.

Confidence-floor handoff: When the agent's confidence in its next action drops below a defined threshold (for example, conflicting signals from two sources that can't be reconciled automatically), the run pauses and the agent writes a handoff note: "I've gotten this far, here's what I found, here's why I'm uncertain, here's what you need to decide." The human resolves the uncertainty and the agent can continue or the human completes the task.

Post-run audit: Every autonomous agent run should produce a full decision trace: what the agent ingested at each step, what it analyzed, what it generated, what it executed, with timestamps. That trace is the only way to understand what happened when something goes wrong. Minimum 90-day retention. Human-accessible audit interface.

The governance requirement is not optional. Any autonomous agent deployed without audit trails, scope constraints, and escalation paths is a liability waiting to surface. The audit infrastructure is part of the deployment, not an enhancement added later. NIST's AI Risk Management Framework identifies governance, mapping, measuring, and managing as the four core functions of responsible AI deployment, all of which apply at every checkpoint in an autonomous agent's execution loop.

ROI signals

Metric What it tells you
Task completion rate vs. human baseline Does the agent complete the task end-to-end at the same quality level as a human would?
Scope-adherence rate What percentage of runs stay within the authorized tool scope and action scope?
Error-to-escalation ratio Of the errors the agent makes, what percentage are caught by the escalation mechanisms before causing external impact?
Hours of human effort displaced per week Net time saved. For this to be positive, account for the time spent reviewing agent runs and managing escalations.
Average loop iterations per completed task A rising count on a stable goal type suggests the agent is becoming less efficient, possibly due to context drift or tool degradation.
Irreversible-action error rate How often does the agent take an irreversible action that turns out to be wrong? This should be near zero and is the single most important safety metric.

What comes next

The Autonomous Agent pattern is the gateway to Level 3 AI Agents, the role-level workflows that cover an entire job function rather than a single task. An AI Support Agent isn't a single autonomous agent instance. It's a cluster of patterns: RAG Assistant for policy lookup, Scoring + Routing for triage, Anomaly Agent for fraud detection, Workflow Copilot for human-agent assist on complex tickets. The autonomous loop handles the structured resolution cases; the other patterns handle the rest.

Understanding how to combine patterns at this level is the next step. Stacking Patterns to Build AI Agents covers the combination logic and walks through a worked example of an AI Sales Operator built from four patterns.

The governance requirements that apply most intensely to Autonomous Agent apply to all complex pattern stacks. The governance requirements article covers the audit trail, scope constraint, and approval gate specifications in operational detail.


Rework Analysis: The autonomous agent deployments that fail fastest are the ones where "deploy" and "govern" were treated as sequential steps. Deploy the agent, see what happens, add governance later. But governance for autonomous agents is not an add-on. It's the infrastructure that makes the agent safe to run. Scope constraints, audit trails, and escalation conditions must exist before the first production loop. They cannot be retrofitted after the first serious incident without rebuilding trust in the entire program. The teams that get autonomous agents right treat the governance design phase as the most important engineering work in the project, spend more time specifying what the agent is not allowed to do than what it is allowed to do, and deploy with a conservative step ceiling that they raise only as production data accumulates. The 10% of organizations that successfully scale agentic AI are not more technically sophisticated than the other 90%. They are more disciplined about governance before launch.

Frequently Asked Questions

What is an Autonomous Agent AI pattern?

An Autonomous Agent is an AI pattern that uses all five ACE capabilities in a loop to pursue a multi-step goal with tool use, conditional decisions, and backtracking. The formula cycles: Ingest (current state plus available tools), Analyze (gap analysis), Predict (most likely next action), Generate (action plan), Execute (take the action, update state), repeat until goal is met or maximum steps are reached. It differs from all other patterns in that Execute fires multiple times per run, and each Execute step potentially changes external state.

What is the Audit-Or-Block Rule?

The Audit-Or-Block Rule states that every autonomous agent must implement two non-negotiable controls: an audit trail logging every capability step with timestamps and stated reasoning, and a block condition that terminates the loop and escalates to a human when confidence drops below threshold or when a high-stakes irreversible action is pending. If an agent cannot produce a complete decision trace for any action, it should not take that action autonomously. These two controls convert an uncontrollable loop into a supervised system where mistakes are diagnosable and most are preventable.

Why are autonomous agents considered the highest-risk AI pattern?

Because Execute fires multiple times per run in a loop, and errors compound across steps. An early Analyze error produces a wrong Generate output, which becomes an Execute step that corrupts state. All subsequent loop iterations optimize from a wrong baseline. By the time a human reviews the output, the damage is multi-step and interdependent. McKinsey found 80% of organizations have encountered risky agent behavior, nearly all tracing to Execute steps in loops without adequate validation. Gartner found enterprises without scope constraints are 8x more likely to experience a significant AI incident.

What governance controls are required for autonomous agents?

Four controls are required before production launch: pre-run review (human reviews goal specification, authorized tools, and scope constraints before the first run), mid-run gates for high-stakes Execute steps (loop pauses before sending customer-facing communications, issuing financial transactions, or deleting records), confidence-floor handoff (loop pauses when agent confidence drops below threshold and produces a handoff note), and post-run audit (full decision trace with minimum 90-day retention). Organizations that implement all four reduce irreversible-action error rates by 73% versus agents without these checkpoints (Anthropic, 2025).

When should you use Autonomous Agent instead of Workflow Copilot?

Use Autonomous Agent only when the goal requires more than three sequential Execute steps, human approval at each step is operationally impractical, and the task has genuine conditional branching where the path depends on what earlier steps produce. If a human can be in the loop at each significant decision point without unacceptable delay, Workflow Copilot is safer, faster to deploy, and has a much lower failure surface. Most tasks that feel like autonomous agent problems are actually simpler patterns: Scoring plus Routing for single-decision triage, Generative Research for multi-source synthesis, Workflow Copilot for judgment-required knowledge work.

What is the most common autonomous agent failure mode?

Goal misspecification is the most common failure. The human's intent was clear to the human but ambiguous to the system. "Close this ticket" means "confirm the issue is resolved" to a human, but can mean "set status to closed" to an agent. The mitigation is writing goals as outcome descriptions with explicit completion criteria: "close the ticket only after confirming the customer's original issue is resolved, with evidence from the payment system confirming the refund was issued." Structured goal templates that require named completion conditions and scope boundaries reduce goal misspecification dramatically.

Learn more