用 LangGraph 构建智能对话系统：基于消息序列的 RAG 状态管理实践

最新推荐文章于 2025-06-25 14:43:38 发布

佑瞻

最新推荐文章于 2025-06-25 14:43:38 发布

阅读量1k

点赞数 33

CC 4.0 BY-SA版权

分类专栏： LangChain 文章标签： python langchain langgraph

本文链接：https://blog.csdn.net/The_Thieves/article/details/147318831

LangChain 专栏收录该内容

92 篇文章

订阅专栏

在构建基于检索增强生成（RAG）的对话系统时，如何高效管理对话状态并整合工具调用，一直是开发者关注的核心问题。LangGraph 框架通过消息序列模型为我们提供了一套优雅的解决方案，让我们一起来看看如何利用它构建灵活的状态管理体系。

一、消息序列：定义 RAG 应用的状态语言

在传统 RAG 系统中，对话状态往往分散在不同模块中，难以统一管理。LangGraph 提出了一种标准化的消息序列模型，将整个交互过程拆解为四种核心消息类型：

HumanMessage：用户输入的原始查询，是对话的起点
AIMessage：包含工具调用的模型生成消息，用于触发检索等操作
ToolMessage：工具执行后返回的结果，携带检索到的文档内容
SystemMessage：用于设定模型行为的系统指令，贯穿对话始终

这种模型的优势在于将复杂的交互流程转化为可追溯的消息链。例如当用户连续提问时：

python

Human: "什么是任务分解？"
AI: "任务分解涉及...管理。"
Human: "常见做法有哪些？"

模型可以基于历史消息生成更精准的检索查询 "common approaches to task decomposition"，这种上下文感知能力在对话场景中至关重要。

二、工具调用：让模型自主驱动检索流程

我们通过@tool装饰器将检索功能转化为可调用工具：

python

from langchain_core.tools import tool

@tool(response_format="content_and_artifact")
def retrieve(query: str):
    """Retrieve information related to a query."""
    retrieved_docs = vector_store.similarity_search(query, k=2)
    serialized = "\n\n".join(
        (f"Source: {doc.metadata}\n" f"Content: {doc.page_content}")
        for doc in retrieved_docs
    )
    return serialized, retrieved_docs

这里response_format="content_and_artifact"非常关键，它让工具返回包含两部分内容：

content：供模型直接使用的序列化文档内容
artifact：保留原始文档对象，方便后续扩展处理

这种设计让模型能够像人类一样自主决定何时需要检索信息，比如遇到需要专业知识的问题时触发检索，而面对简单问候则直接回应，极大提升了系统的灵活性。

三、节点构建：搭建对话流程的逻辑单元

整个对话流程被拆解为三个核心节点：

1. 输入处理节点（query_or_respond）

python

def query_or_respond(state: MessagesState):
    """Generate tool call for retrieval or respond."""
    llm_with_tools = llm.bind_tools([retrieve])
    response = llm_with_tools.invoke(state["messages"])
    return {"messages": [response]}

这个节点就像对话的 "大脑"，它做两件事：

分析用户输入，决定是直接生成回答还是触发检索工具
通过llm.bind_tools将检索工具注入模型，让模型具备调用能力

2. 工具执行节点（ToolNode）

python

tools = ToolNode([retrieve])

这是一个预构建的通用节点，专门负责执行工具调用，并将返回结果包装为ToolMessage添加到消息序列中。当模型决定需要检索时，这个节点就会被激活，完成从查询生成到结果获取的整个流程。

3. 回答生成节点（generate）

python

def generate(state: MessagesState):
    """Generate answer using retrieved content."""
    # 提取最新的工具消息
    recent_tool_messages = []
    for message in reversed(state["messages"]):
        if message.type == "tool":
            recent_tool_messages.append(message)
        else:
            break
    tool_messages = recent_tool_messages[::-1]  # 恢复消息顺序
    
    # 构建提示信息
    docs_content = "\n\n".join(doc.content for doc in tool_messages)
    system_prompt = f"""你是问答助手，使用以下检索内容回答问题。
    不知道答案就说不知道，最多三句话，保持简洁：
    \n\n{docs_content}
    """
    
    # 过滤有效对话历史
    conversation_messages = [
        msg for msg in state["messages"]
        if msg.type in ("human", "system") or 
           (msg.type == "ai" and not msg.tool_calls)
    ]
    
    # 生成最终回答
    response = llm.invoke([SystemMessage(system_prompt)] + conversation_messages)
    return {"messages": [response]}

这个节点的核心逻辑是：

从消息序列末尾向前提取最近的工具消息（确保优先使用最新检索结果）
过滤掉包含工具调用的 AI 消息，只保留有效对话历史
将系统指令、检索内容和对话历史组合成完整提示，让模型生成最终回答

四、图结构：将节点编织成智能流程

通过StateGraph我们将三个节点编织成可执行的对话流程：

python

from langgraph.graph import END
from langgraph.prebuilt import tools_condition

graph_builder = StateGraph(MessagesState)
graph_builder.add_node(query_or_respond)
graph_builder.add_node(tools)
graph_builder.add_node(generate)

# 设置入口节点
graph_builder.set_entry_point("query_or_respond")

# 添加条件边：根据是否有工具调用决定流向
graph_builder.add_conditional_edges(
    "query_or_respond",
    tools_condition,  # 内置条件函数，判断是否需要工具调用
    {END: END, "tools": "tools"}  # 无工具调用则结束，有则进入工具节点
)

# 连接工具节点和回答节点
graph_builder.add_edge("tools", "generate")
graph_builder.add_edge("generate", END)

graph = graph_builder.compile()

这里的tools_condition是关键的流程控制器，它会检查最新的 AI 消息是否包含工具调用：