Model Context Protocol (MCP) 是一个开放协议,用于标准化应用向 LLM 提供工具和上下文的方式。LangChain 代理可以使用 langchain-mcp-adapters 库调用 MCP 服务器上定义的工具。

快速开始

安装 langchain-mcp-adapters 库:
pip install langchain-mcp-adapters
langchain-mcp-adapters 使代理能够使用一个或多个 MCP 服务器中定义的工具。
MultiServerMCPClient 默认是无状态的。每次工具调用都会创建新的 MCP ClientSession,执行工具,然后清理资源。更多详情请参阅 stateful sessions 部分。
访问多个 MCP 服务器
import asyncio
from langchain_mcp_adapters.client import MultiServerMCPClient  
from langchain.agents import create_agent

async def main():
    client = MultiServerMCPClient(
        {
            "math": {
                "transport": "stdio",  # Local subprocess communication
                "command": "python",
                # Absolute path to your math_server.py file
                "args": ["/path/to/math_server.py"],
            },
            "weather": {
                "transport": "http",  # HTTP-based remote server
                # Ensure you start your weather server on port 8000
                "url": "http://localhost:8000/mcp",
            }
        }
    )

    tools = await client.get_tools()
    agent = create_agent(
        "claude-sonnet-4-6",
        tools  
    )
    math_response = await agent.ainvoke(
        {"messages": [{"role": "user", "content": "what's (3 + 5) x 12?"}]}
    )
    weather_response = await agent.ainvoke(
        {"messages": [{"role": "user", "content": "what is the weather in nyc?"}]}
    )
    print(math_response)
    print(weather_response)

if __name__ == "__main__":
    asyncio.run(main())
使用 LangSmith 将 MCP 工具调用与代理的推理步骤一起追踪。按照 tracing quickstart 完成设置。

自定义服务器

如需创建自定义 MCP 服务器,请使用 FastMCP 库:
pip install fastmcp
如需使用 MCP 工具服务器测试代理,请使用以下示例:
from fastmcp import FastMCP

mcp = FastMCP("Math")

@mcp.tool()
def add(a: int, b: int) -> int:
    """Add two numbers"""
    return a + b

@mcp.tool()
def multiply(a: int, b: int) -> int:
    """Multiply two numbers"""
    return a * b

if __name__ == "__main__":
    mcp.run(transport="stdio")

传输方式

MCP 支持用于客户端和服务器通信的不同传输机制。

HTTP

http 传输方式(也称为 streamable-http)使用 HTTP 请求进行客户端和服务器通信。更多详情请参阅 MCP HTTP transport specification
client = MultiServerMCPClient(
    {
        "weather": {
            "transport": "http",
            "url": "http://localhost:8000/mcp",
        }
    }
)

传递 headers

通过 HTTP 连接到 MCP 服务器时,可以在连接配置中使用 headers 字段包含自定义 headers,例如用于认证或追踪。sse(已被 MCP 规范弃用)和 streamable_http 传输都支持此能力。
使用 MultiServerMCPClient 传递 headers
from langchain_mcp_adapters.client import MultiServerMCPClient
from langchain.agents import create_agent

client = MultiServerMCPClient(
    {
        "weather": {
            "transport": "http",
            "url": "http://localhost:8000/mcp",
            "headers": {
                "Authorization": "Bearer YOUR_TOKEN",
                "X-Custom-Header": "custom-value"
            },
        }
    }
)
tools = await client.get_tools()
agent = create_agent("openai:gpt-5.4", tools)
response = await agent.ainvoke({"messages": "what is the weather in nyc?"})

认证

langchain-mcp-adapters 库底层使用官方 MCP SDK,因此你可以通过实现 httpx.Auth 接口来提供自定义认证机制。
from langchain_mcp_adapters.client import MultiServerMCPClient

client = MultiServerMCPClient(
    {
        "weather": {
            "transport": "http",
            "url": "http://localhost:8000/mcp",
            "auth": auth,
        }
    }
)

stdio

客户端将服务器作为子进程启动,并通过标准输入/输出通信。该方式最适合本地工具和简单设置。
与 HTTP 传输不同,stdio 连接本质上是有状态的:子进程会在客户端连接生命周期内持续存在。但是,如果使用 MultiServerMCPClient 且没有显式会话管理,每次工具调用仍会创建新会话。如需管理持久连接,请参阅 stateful sessions
client = MultiServerMCPClient(
    {
        "math": {
            "transport": "stdio",
            "command": "python",
            "args": ["/path/to/math_server.py"],
        }
    }
)

有状态会话

默认情况下,MultiServerMCPClient无状态的:每次工具调用都会创建新的 MCP 会话,执行工具,然后清理资源。 如果需要控制 MCP 会话的 lifecycle,例如使用会在工具调用之间维护上下文的有状态服务器,可以使用 client.session() 创建持久 ClientSession
使用 MCP ClientSession 进行有状态工具调用
from langchain_mcp_adapters.client import MultiServerMCPClient
from langchain_mcp_adapters.tools import load_mcp_tools
from langchain.agents import create_agent

client = MultiServerMCPClient({...})

# Create a session explicitly
async with client.session("server_name") as session:
    # Pass the session to load tools, resources, or prompts
    tools = await load_mcp_tools(session)
    agent = create_agent(
        "google_genai:gemini-3.5-flash",
        tools
    )

核心功能

Tools

Tools 允许 MCP 服务器暴露可执行函数,LLM 可以调用这些函数来执行操作,例如查询数据库、调用 API 或与外部系统交互。LangChain 会将 MCP tools 转换为 LangChain tools,使其可直接用于任何 LangChain 代理或工作流。

加载 tools

使用 client.get_tools() 从 MCP 服务器获取 tools,并将其传给代理:
from langchain_mcp_adapters.client import MultiServerMCPClient
from langchain.agents import create_agent

client = MultiServerMCPClient({...})
tools = await client.get_tools()
agent = create_agent("claude-sonnet-4-6", tools)

结构化内容

MCP tools 可以在面向人类可读的文本响应之外返回 structured content。当工具需要返回机器可解析数据(例如 JSON),并同时向模型展示文本时,这很有用。 当 MCP tool 返回 structuredContent 时,adapter 会将其包装为 MCPToolArtifact,并作为工具的 artifact 返回。你可以通过 ToolMessage 上的 artifact 字段访问它。也可以使用 interceptors 自动处理或转换结构化内容。 从 artifact 中提取结构化内容 调用代理后,可以从响应中的工具消息访问结构化内容:
from langchain_mcp_adapters.client import MultiServerMCPClient
from langchain.agents import create_agent
from langchain.messages import ToolMessage

client = MultiServerMCPClient({...})
tools = await client.get_tools()
agent = create_agent("claude-sonnet-4-6", tools)

result = await agent.ainvoke(
    {"messages": [{"role": "user", "content": "Get data from the server"}]}
)

# Extract structured content from tool messages
for message in result["messages"]:
    if isinstance(message, ToolMessage) and message.artifact:
        structured_content = message.artifact["structured_content"]
通过 interceptor 追加结构化内容 如果希望结构化内容显示在对话历史中(即对模型可见),可以使用 interceptor 自动将结构化内容追加到工具结果中:
import json

from langchain_mcp_adapters.client import MultiServerMCPClient
from langchain_mcp_adapters.interceptors import MCPToolCallRequest
from mcp.types import TextContent

async def append_structured_content(request: MCPToolCallRequest, handler):
    """Append structured content from artifact to tool message."""
    result = await handler(request)
    if result.structuredContent:
        result.content += [
            TextContent(type="text", text=json.dumps(result.structuredContent)),
        ]
    return result

client = MultiServerMCPClient({...}, tool_interceptors=[append_structured_content])

多模态工具内容

MCP tools 可以在响应中返回 multimodal content,例如图像和文本。当 MCP 服务器返回包含多个部分的内容(例如文本和图像)时,adapter 会将其转换为 LangChain 的 standard content blocks。你可以通过 ToolMessage 上的 content_blocks 属性访问标准化表示:
from langchain_mcp_adapters.client import MultiServerMCPClient
from langchain.agents import create_agent

client = MultiServerMCPClient({...})
tools = await client.get_tools()
agent = create_agent("claude-sonnet-4-6", tools)

result = await agent.ainvoke(
    {"messages": [{"role": "user", "content": "Take a screenshot of the current page"}]}
)

# Access multimodal content from tool messages
for message in result["messages"]:
    if message.type == "tool":
        # Raw content in provider-native format
        print(f"Raw content: {message.content}")

        # Standardized content blocks  #
        for block in message.content_blocks:
            if block["type"] == "text":
                print(f"Text: {block['text']}")
            elif block["type"] == "image":
                print(f"Image URL: {block.get('url')}")
                print(f"Image base64: {block.get('base64', '')[:50]}...")
这使你能够以 provider 无关的方式处理多模态工具响应,无论底层 MCP 服务器如何格式化其内容。

Resources

Resources 允许 MCP 服务器暴露客户端可读取的数据,例如文件、数据库记录或 API 响应。LangChain 会将 MCP resources 转换为 Blob 对象,该对象为处理文本和二进制内容提供统一接口。

加载 resources

使用 client.get_resources() 从 MCP 服务器加载 resources:
from langchain_mcp_adapters.client import MultiServerMCPClient

client = MultiServerMCPClient({...})

# Load all resources from a server
blobs = await client.get_resources("server_name")

# Or load specific resources by URI
blobs = await client.get_resources("server_name", uris=["file:///path/to/file.txt"])

for blob in blobs:
    print(f"URI: {blob.metadata['uri']}, MIME type: {blob.mimetype}")
    print(blob.as_string())  # For text content
也可以直接结合 session 使用 load_mcp_resources,以获得更多控制:
from langchain_mcp_adapters.client import MultiServerMCPClient
from langchain_mcp_adapters.resources import load_mcp_resources

client = MultiServerMCPClient({...})

async with client.session("server_name") as session:
    # Load all resources
    blobs = await load_mcp_resources(session)

    # Or load specific resources by URI
    blobs = await load_mcp_resources(session, uris=["file:///path/to/file.txt"])

Prompts

Prompts 允许 MCP 服务器暴露可由客户端获取和使用的可复用 prompt 模板。LangChain 会将 MCP prompts 转换为 messages,便于集成到基于聊天的工作流中。

加载 prompts

使用 client.get_prompt() 从 MCP 服务器加载 prompt:
from langchain_mcp_adapters.client import MultiServerMCPClient

client = MultiServerMCPClient({...})

# Load a prompt by name
messages = await client.get_prompt("server_name", "summarize")

# Load a prompt with arguments
messages = await client.get_prompt(
    "server_name",
    "code_review",
    arguments={"language": "python", "focus": "security"}
)

# Use the messages in your workflow
for message in messages:
    print(f"{message.type}: {message.content}")
也可以直接结合 session 使用 load_mcp_prompt,以获得更多控制:
from langchain_mcp_adapters.client import MultiServerMCPClient
from langchain_mcp_adapters.prompts import load_mcp_prompt

client = MultiServerMCPClient({...})

async with client.session("server_name") as session:
    # Load a prompt by name
    messages = await load_mcp_prompt(session, "summarize")

    # Load a prompt with arguments
    messages = await load_mcp_prompt(
        session,
        "code_review",
        arguments={"language": "python", "focus": "security"}
    )

高级功能

Tool interceptors

MCP 服务器作为独立进程运行,无法访问 LangGraph runtime 信息,例如 storecontext 或代理状态。Interceptors 会弥合这一差距,让你在 MCP 工具执行期间访问这些 runtime context。 Interceptors 还提供类似 middleware 的工具调用控制能力:可以修改请求、实现重试、动态添加 headers,或完全短路执行。
SectionDescription
Accessing runtime context读取用户 ID、API keys、store 数据和代理状态
State updates and commands使用 Command 更新代理状态或控制图流
Writing interceptors修改请求、组合 interceptors 和错误处理的模式

访问 runtime context

在 LangChain 代理中(通过 create_agent)使用 MCP tools 时,interceptors 可以访问 ToolRuntime context。这会提供 tool call ID、state、config 和 store 访问能力,从而支持访问用户数据、持久化信息和控制代理行为等强大模式。
访问调用时传入的用户专属配置,例如用户 ID、API keys 或权限:
Inject user context into MCP tool calls
from dataclasses import dataclass
from langchain_mcp_adapters.client import MultiServerMCPClient
from langchain_mcp_adapters.interceptors import MCPToolCallRequest
from langchain.agents import create_agent

@dataclass
class Context:
    user_id: str
    api_key: str

async def inject_user_context(
    request: MCPToolCallRequest,
    handler,
):
    """Inject user credentials into MCP tool calls."""
    runtime = request.runtime
    user_id = runtime.context.user_id  
    api_key = runtime.context.api_key  

    # Add user context to tool arguments
    modified_request = request.override(
        args={**request.args, "user_id": user_id}
    )
    return await handler(modified_request)

client = MultiServerMCPClient(
    {...},
    tool_interceptors=[inject_user_context],
)
tools = await client.get_tools()
agent = create_agent("gpt-5.4", tools, context_schema=Context)

# Invoke with user context
result = await agent.ainvoke(
    {"messages": [{"role": "user", "content": "Search my orders"}]},
    context={"user_id": "user_123", "api_key": "sk-..."}
)
如需更多 context engineering 模式,请参阅 Context engineeringTools

状态更新和 commands

Interceptors 可以返回 Command 对象来更新代理状态或控制图执行流。这对于追踪任务进度、在代理之间切换或提前结束执行很有用。
将任务标记为完成并切换代理
from langchain.agents import AgentState, create_agent
from langchain_mcp_adapters.interceptors import MCPToolCallRequest
from langchain.messages import ToolMessage
from langgraph.types import Command

async def handle_task_completion(
    request: MCPToolCallRequest,
    handler,
):
    """Mark task complete and hand off to summary agent."""
    result = await handler(request)

    if request.name == "submit_order":
        return Command(
            update={
                "messages": [result] if isinstance(result, ToolMessage) else [],
                "task_status": "completed",
            },
            goto="summary_agent",
        )

    return result
使用带有 goto="__end__"Command 可以提前结束执行:
完成后结束代理运行
async def end_on_success(
    request: MCPToolCallRequest,
    handler,
):
    """End agent run when task is marked complete."""
    result = await handler(request)

    if request.name == "mark_complete":
        return Command(
            update={"messages": [result], "status": "done"},
            goto="__end__",
        )

    return result

自定义 interceptors

Interceptors 是包装工具执行的异步函数,可用于修改请求/响应、实现重试逻辑,以及处理其他横切关注点。它们遵循“洋葱”模式,列表中的第一个 interceptor 是最外层。 基础模式 Interceptor 是一个接收 request 和 handler 的异步函数。你可以在调用 handler 前修改请求,在调用后修改响应,或完全跳过 handler。
基础 interceptor 模式
from langchain_mcp_adapters.client import MultiServerMCPClient
from langchain_mcp_adapters.interceptors import MCPToolCallRequest

async def logging_interceptor(
    request: MCPToolCallRequest,
    handler,
):
    """Log tool calls before and after execution."""
    print(f"Calling tool: {request.name} with args: {request.args}")
    result = await handler(request)
    print(f"Tool {request.name} returned: {result}")
    return result

client = MultiServerMCPClient(
    {"math": {"transport": "stdio", "command": "python", "args": ["/path/to/server.py"]}},
    tool_interceptors=[logging_interceptor],
)
修改请求 使用 request.override() 创建修改后的请求。这遵循不可变模式,不会改变原始请求。
修改工具参数
async def double_args_interceptor(
    request: MCPToolCallRequest,
    handler,
):
    """Double all numeric arguments before execution."""
    modified_args = {k: v * 2 for k, v in request.args.items()}
    modified_request = request.override(args=modified_args)
    return await handler(modified_request)

# Original call: add(a=2, b=3) becomes add(a=4, b=6)
在 runtime 修改 headers Interceptors 可以根据请求上下文动态修改 HTTP headers:
动态修改 header
async def auth_header_interceptor(
    request: MCPToolCallRequest,
    handler,
):
    """Add authentication headers based on the tool being called."""
    token = get_token_for_tool(request.name)
    modified_request = request.override(
        headers={"Authorization": f"Bearer {token}"}
    )
    return await handler(modified_request)
组合 interceptors 多个 interceptors 会按“洋葱”顺序组合,列表中的第一个 interceptor 是最外层:
组合多个 interceptors
async def outer_interceptor(request, handler):
    print("outer: before")
    result = await handler(request)
    print("outer: after")
    return result

async def inner_interceptor(request, handler):
    print("inner: before")
    result = await handler(request)
    print("inner: after")
    return result

client = MultiServerMCPClient(
    {...},
    tool_interceptors=[outer_interceptor, inner_interceptor],
)

# Execution order:
# outer: before -> inner: before -> tool execution -> inner: after -> outer: after
错误处理 使用 interceptors 捕获工具执行错误并实现重试逻辑:
出错时重试
import asyncio

async def retry_interceptor(
    request: MCPToolCallRequest,
    handler,
    max_retries: int = 3,
    delay: float = 1.0,
):
    """Retry failed tool calls with exponential backoff."""
    last_error = None
    for attempt in range(max_retries):
        try:
            return await handler(request)
        except Exception as e:
            last_error = e
            if attempt < max_retries - 1:
                wait_time = delay * (2 ** attempt)  # Exponential backoff
                print(f"Tool {request.name} failed (attempt {attempt + 1}), retrying in {wait_time}s...")
                await asyncio.sleep(wait_time)
    raise last_error

client = MultiServerMCPClient(
    {...},
    tool_interceptors=[retry_interceptor],
)
也可以捕获特定错误类型并返回 fallback 值:
使用 fallback 处理错误
async def fallback_interceptor(
    request: MCPToolCallRequest,
    handler,
):
    """Return a fallback value if tool execution fails."""
    try:
        return await handler(request)
    except TimeoutError:
        return f"Tool {request.name} timed out. Please try again later."
    except ConnectionError:
        return f"Could not connect to {request.name} service. Using cached data."

进度通知

订阅长时间运行工具执行的进度更新:
进度 callback
from langchain_mcp_adapters.client import MultiServerMCPClient
from langchain_mcp_adapters.callbacks import Callbacks, CallbackContext

async def on_progress(
    progress: float,
    total: float | None,
    message: str | None,
    context: CallbackContext,
):
    """Handle progress updates from MCP servers."""
    percent = (progress / total * 100) if total else progress
    tool_info = f" ({context.tool_name})" if context.tool_name else ""
    print(f"[{context.server_name}{tool_info}] Progress: {percent:.1f}% - {message}")

client = MultiServerMCPClient(
    {...},
    callbacks=Callbacks(on_progress=on_progress),
)
CallbackContext 提供:
  • server_name:MCP 服务器名称
  • tool_name:正在执行的工具名称(在工具调用期间可用)

Logging

MCP 协议支持来自服务器的 logging 通知。使用 Callbacks 类订阅这些事件。
Logging callback
from langchain_mcp_adapters.client import MultiServerMCPClient
from langchain_mcp_adapters.callbacks import Callbacks, CallbackContext
from mcp.types import LoggingMessageNotificationParams

async def on_logging_message(
    params: LoggingMessageNotificationParams,
    context: CallbackContext,
):
    """Handle log messages from MCP servers."""
    print(f"[{context.server_name}] {params.level}: {params.data}")

client = MultiServerMCPClient(
    {...},
    callbacks=Callbacks(on_logging_message=on_logging_message),
)

Elicitation

Elicitation 允许 MCP 服务器在工具执行期间向用户请求额外输入。服务器不必预先要求所有输入,而是可以按需交互式请求信息。

服务器设置

定义一个使用 ctx.elicit() 按 schema 请求用户输入的工具:
带 elicitation 的 MCP 服务器
from pydantic import BaseModel
from mcp.server.fastmcp import Context, FastMCP

server = FastMCP("Profile")

class UserDetails(BaseModel):
    email: str
    age: int

@server.tool()
async def create_profile(name: str, ctx: Context) -> str:
    """Create a user profile, requesting details via elicitation."""
    result = await ctx.elicit(
        message=f"Please provide details for {name}'s profile:",
        schema=UserDetails,
    )
    if result.action == "accept" and result.data:
        return f"Created profile for {name}: email={result.data.email}, age={result.data.age}"
    if result.action == "decline":
        return f"User declined. Created minimal profile for {name}."
    return "Profile creation cancelled."

if __name__ == "__main__":
    server.run(transport="http")

客户端设置

通过向 MultiServerMCPClient 提供 callback 来处理 elicitation 请求:
处理 elicitation 请求
from langchain_mcp_adapters.client import MultiServerMCPClient
from langchain_mcp_adapters.callbacks import Callbacks, CallbackContext
from mcp.shared.context import RequestContext
from mcp.types import ElicitRequestParams, ElicitResult

async def on_elicitation(
    mcp_context: RequestContext,
    params: ElicitRequestParams,
    context: CallbackContext,
) -> ElicitResult:
    """Handle elicitation requests from MCP servers."""
    # In a real application, you would prompt the user for input
    # based on params.message and params.requestedSchema
    return ElicitResult(
        action="accept",
        content={"email": "user@example.com", "age": 25},
    )

client = MultiServerMCPClient(
    {
        "profile": {
            "url": "http://localhost:8000/mcp",
            "transport": "http",
        }
    },
    callbacks=Callbacks(on_elicitation=on_elicitation),
)

响应动作

Elicitation callback 可以返回三种动作之一:
ActionDescription
accept用户提供了有效输入。请在 content 字段中包含数据。
decline用户选择不提供所请求的信息。
cancel用户完全取消了操作。
响应动作示例
# Accept with data
ElicitResult(action="accept", content={"email": "user@example.com", "age": 25})

# Decline (user doesn't want to provide info)
ElicitResult(action="decline")

# Cancel (abort the operation)
ElicitResult(action="cancel")

其他资源