Agents

ReAct Pattern

ReAct (Reasoning + Acting) is a prompting framework for AI agents that interleaves chain-of-thought reasoning steps with discrete tool-use actions, allowing the model to observe each action's result and update its reasoning before proceeding.

ReAct is an agent architecture that alternates between two operation types: Reasoning (generating a natural-language thought about the current situation and what to do next) and Acting (executing a concrete operation such as a web search, database query, or code execution). The framework was introduced by Yao et al. in a 2022 paper, 'ReAct: Synergizing Reasoning and Acting in Language Models,' presented at ICLR 2023, and has since become one of the most widely adopted templates for constructing AI agents.

In a ReAct loop, the model produces a 'Thought' — its internal reasoning about the task state — followed by an 'Action' that invokes an external tool. The environment returns an 'Observation,' which is appended to the context window. The model then generates the next thought, informed by that observation, and the cycle repeats until the model judges the task complete. This interleaved structure allows the agent to correct course dynamically rather than committing upfront to a static plan that cannot incorporate new information.

The pattern addresses a core limitation of static chain-of-thought prompting: the model cannot acquire new external information mid-response. By coupling reasoning to real tool calls, ReAct enables tasks that require live data retrieval, interactive computation, or multi-step navigation through an external environment. The thought-action-observation trace is also human-readable, making it substantially easier to debug agent failures than opaque tool-calling pipelines where internal reasoning is hidden.

By 2026, ReAct-style loops underpin most production agent frameworks, including LangChain and LangGraph agents, Anthropic's Claude tool-use API, and OpenAI's Assistants API with code interpreter and file search. Extensions such as Reflexion add a self-critique and memory update step after task completion. Multi-agent variants allow one ReAct agent to spawn specialized sub-agents as its 'Actions.' The primary practical limitations are latency — each reasoning-acting cycle adds at least one full inference round trip — and context-window growth as traces accumulate over long tasks.

Example

A customer support agent receives a question about order status: it generates a Thought ('I need to retrieve order #5892 from the database'), calls the order lookup tool as its Action, observes the result ('shipped 2026-06-30, expected delivery 2026-07-03'), generates a new Thought ('I have sufficient information to answer'), and replies to the customer — all within a single automated ReAct loop requiring no human intervention.

Related terms

← Glossary