Reasoning WithOut Observation
Reasoning WithOut Observation (ReWOO), proposed by Xu et al. (https://arxiv.org/abs/2305.18323), is a framework that combines a multi-step planner and variable substitution for effective tool use. It aims to improve upon ReAct-style agents by reducing token consumption and execution time by generating the full chain of tool usage in a single pass, thus minimizing redundant LLM calls. It also aims to simplify the fine-tuning process by enabling fine-tuning without actually invoking tools as planning data doesn’t depend on tool outputs (in theory).
ReAct operates through a cyclical “think-act-observe” pattern, where an AI engages in reasoning, performs an action, examines the resulting feedback, and then adjusts its subsequent actions accordingly, facilitating a dynamic and responsive problem-solving strategy. In contrast, ReWOO emphasizes comprehensive upfront planning, generating a complete sequence of actions before any execution...