Messages - Docs by LangChain中文

Messages 是 LangChain 中模型上下文的基本单元。它们表示模型的输入和输出，并携带与 LLM 交互时表示对话状态所需的内容和 metadata。 Messages 是包含以下内容的对象：

Role：标识 message 类型，例如 system、user
Content：表示 message 的实际内容，例如文本、图像、音频、文档等
Metadata：可选字段，例如响应信息、message IDs 和 token 用量

LangChain 提供可跨所有模型 provider 使用的标准 message 类型，确保无论调用哪个模型都具有一致行为。

基本用法

使用 messages 最简单的方式，是创建 message 对象，并在 invoking 模型时将其传给模型。

from langchain.chat_models import init_chat_model
from langchain.messages import HumanMessage, AIMessage, SystemMessage

model = init_chat_model("gpt-5-nano")

system_msg = SystemMessage("You are a helpful assistant.")
human_msg = HumanMessage("Hello, how are you?")

# Use with chat models
messages = [system_msg, human_msg]
response = model.invoke(messages)  # Returns AIMessage

多轮 agents 会累积很长的 message 历史。LangSmith 会记录每一轮、工具结果和模型响应，便于你检查完整对话。按照 tracing quickstart 启用 tracing。建议同时设置 LangSmith Engine，它会监控 traces、检测问题并提出修复建议。

Text prompts

Text prompts 是字符串，适合不需要保留对话历史的简单生成任务。

response = model.invoke("Write a haiku about spring")

在以下情况下使用 text prompts：

你有单个独立请求
你不需要对话历史
你希望代码复杂度最低

Message prompts

也可以通过提供 message 对象列表，将一组 messages 传给模型。

from langchain.messages import SystemMessage, HumanMessage, AIMessage

messages = [
    SystemMessage("You are a poetry expert"),
    HumanMessage("Write a haiku about spring"),
    AIMessage("Cherry blossoms bloom...")
]
response = model.invoke(messages)

在以下情况下使用 message prompts：

管理多轮对话
处理多模态内容（图像、音频、文件）
包含 system instructions

Dictionary format

也可以直接使用 OpenAI chat completions 格式指定 messages。

messages = [
    {"role": "system", "content": "You are a poetry expert"},
    {"role": "user", "content": "Write a haiku about spring"},
    {"role": "assistant", "content": "Cherry blossoms bloom..."}
]
response = model.invoke(messages)

Message types

System message：告诉模型如何行动，并为交互提供上下文
Human message：表示用户输入以及与模型的交互
AI message：模型生成的响应，包括文本内容、工具调用和 metadata
Tool message：表示 tool calls 的输出

System message

SystemMessage 表示一组用于引导模型行为的初始指令。可以使用 system message 设置语气、定义模型角色，并建立响应准则。

Basic instructions

system_msg = SystemMessage("You are a helpful coding assistant.")

messages = [
    system_msg,
    HumanMessage("How do I create a REST API?")
]
response = model.invoke(messages)

Detailed persona

from langchain.messages import SystemMessage, HumanMessage

system_msg = SystemMessage("""
You are a senior Python developer with expertise in web frameworks.
Always provide code examples and explain your reasoning.
Be concise but thorough in your explanations.
""")

messages = [
    system_msg,
    HumanMessage("How do I create a REST API?")
]
response = model.invoke(messages)

Human message

HumanMessage 表示用户输入和交互。它可以包含文本、图像、音频、文件，以及任意数量的多模态 content。

Text content

response = model.invoke([
  HumanMessage("What is machine learning?")
])

Message metadata

Add metadata

human_msg = HumanMessage(
    content="Hello!",
    name="alice",  # Optional: identify different users
    id="msg_123",  # Optional: unique identifier for tracing
)

name 字段的行为因 provider 而异，有些 provider 会用它识别用户，其他 provider 会忽略它。如需确认，请参阅模型 provider 的 reference。

AI message

AIMessage 表示模型调用的输出。它可以包含多模态数据、工具调用，以及稍后可访问的 provider-specific metadata。

response = model.invoke("Explain AI")
print(type(response))  # <class 'langchain.messages.AIMessage'>

调用模型时，模型会返回 AIMessage 对象，其中包含响应中的所有相关 metadata。不同 providers 会以不同方式权衡和理解不同类型的 messages，因此有时手动创建新的 AIMessage 对象并将其插入 message 历史会很有用，就像它来自模型一样。

from langchain.messages import AIMessage, SystemMessage, HumanMessage

# Create an AI message manually (e.g., for conversation history)
ai_msg = AIMessage("I'd be happy to help you with that question!")

# Add to conversation history
messages = [
    SystemMessage("You are a helpful assistant"),
    HumanMessage("Can you help me?"),
    ai_msg,  # Insert as if it came from the model
    HumanMessage("Great! What's 2+2?")
]

response = model.invoke(messages)

Attributes

text

string

Message 的文本内容。

content

string | dict[]

Message 的原始内容。

content_blocks

ContentBlock[]

Message 的标准化 content blocks。

tool_calls

dict[] | None

模型发出的工具调用。如果没有调用 tools，则为空。

string

Message 的唯一标识符，由 LangChain 自动生成或在 provider 响应中返回。

usage_metadata

dict | None

Message 的 usage metadata，可在可用时包含 token 计数。

response_metadata

ResponseMetadata | None

Message 的 response metadata。

Tool calls

当模型发出 tool calls 时，这些调用会包含在 AIMessage 中：

from langchain.chat_models import init_chat_model

model = init_chat_model("gpt-5-nano")

def get_weather(location: str) -> str:
    """Get the weather at a location."""
    ...

model_with_tools = model.bind_tools([get_weather])
response = model_with_tools.invoke("What's the weather in Paris?")

for tool_call in response.tool_calls:
    print(f"Tool: {tool_call['name']}")
    print(f"Args: {tool_call['args']}")
    print(f"ID: {tool_call['id']}")

其他结构化数据，例如 reasoning 或 citations，也可以出现在 message content 中。

Token usage

AIMessage 可以在其 usage_metadata 字段中保存 token 计数和其他 usage metadata：

from langchain.chat_models import init_chat_model

model = init_chat_model("gpt-5-nano")

response = model.invoke("Hello!")
response.usage_metadata

{'input_tokens': 8,
 'output_tokens': 304,
 'total_tokens': 312,
 'input_token_details': {'audio': 0, 'cache_read': 0},
 'output_token_details': {'audio': 0, 'reasoning': 256}}

详情请参阅 UsageMetadata。

Streaming and chunks

在 streaming 期间，你会收到 AIMessageChunk 对象，它们可以合并成完整的 message 对象：

chunks = []
full_message = None
for chunk in model.stream("Hi"):
    chunks.append(chunk)
    print(chunk.text)
    full_message = chunk if full_message is None else full_message + chunk

了解更多：

Tool message

对于支持 tool calling 的模型，AI messages 可以包含工具调用。Tool messages 用于将单个工具执行的结果传回模型。 Tools 可以直接生成 ToolMessage 对象。下面展示一个简单示例。更多内容请阅读 tools guide。

from langchain.messages import AIMessage
from langchain.messages import ToolMessage

# After a model makes a tool call
# (Here, we demonstrate manually creating the messages for brevity)
ai_message = AIMessage(
    content=[],
    tool_calls=[{
        "name": "get_weather",
        "args": {"location": "San Francisco"},
        "id": "call_123"
    }]
)

# Execute tool and create result message
weather_result = "Sunny, 72°F"
tool_message = ToolMessage(
    content=weather_result,
    tool_call_id="call_123"  # Must match the call ID
)

# Continue conversation
messages = [
    HumanMessage("What's the weather in San Francisco?"),
    ai_message,  # Model's tool call
    tool_message,  # Tool execution result
]
response = model.invoke(messages)  # Model processes the result

Attributes

content

string

required

工具调用的字符串化输出。

tool_call_id

string

required

此 message 响应的工具调用 ID。必须与 AIMessage 中的工具调用 ID 匹配。

name

string

required

被调用工具的名称。

artifact

dict

不发送给模型、但可通过程序访问的额外数据。

artifact 字段存储不会发送给模型、但可通过程序访问的补充数据。这适合存储原始结果、调试信息或用于下游处理的数据，同时避免污染模型上下文。

示例：使用 artifact 保存 retrieval metadata

例如，retrieval 工具可以从文档中检索一段文本，供模型参考。Message content 包含模型会引用的文本，而 artifact 可以包含文档标识符或应用可使用的其他 metadata，例如用于渲染页面。请参阅下面的示例：

from langchain.messages import ToolMessage

# Sent to model
message_content = "It was the best of times, it was the worst of times."

# Artifact available downstream
artifact = {"document_id": "doc_123", "page": 0}

tool_message = ToolMessage(
    content=message_content,
    tool_call_id="call_123",
    name="search_books",
    artifact=artifact,
)

如需查看使用 LangChain 构建 retrieval agents 的端到端示例，请参阅 RAG tutorial。

Message content

可以将 message 的 content 理解为发送给模型的数据负载。Messages 具有一个弱类型的 content 属性，支持字符串和无类型对象列表（例如字典）。这让 LangChain chat models 能够直接支持 provider-native 结构，例如 multimodal 内容和其他数据。此外，LangChain 为文本、reasoning、citations、多模态数据、server-side tool calls 以及其他 message content 提供专用 content 类型。请参阅下面的 content blocks。 LangChain chat models 通过 content 属性接收 message content。它可以包含以下任一内容：

字符串
provider-native 格式的 content blocks 列表
LangChain’s standard content blocks 列表

下面是使用 multimodal 输入的示例：

from langchain.messages import HumanMessage

# String content
human_message = HumanMessage("Hello, how are you?")

# Provider-native format (e.g., OpenAI)
human_message = HumanMessage(content=[
    {"type": "text", "text": "Hello, how are you?"},
    {"type": "image_url", "image_url": {"url": "https://example.com/image.jpg"}}
])

# List of standard content blocks
human_message = HumanMessage(content_blocks=[
    {"type": "text", "text": "Hello, how are you?"},
    {"type": "image", "url": "https://example.com/image.jpg"},
])

初始化 message 时指定 content_blocks 仍会填充 message content，但会提供类型安全接口。

Standard content blocks

LangChain 为 message content 提供可跨 providers 使用的标准表示。 Message 对象实现了 content_blocks 属性，会将 content 属性惰性解析为标准、类型安全的表示。例如，由 ChatAnthropic 或 ChatOpenAI 生成的 messages 会包含对应 provider 格式的 thinking 或 reasoning blocks，但可以惰性解析为一致的 ReasoningContentBlock 表示：

Anthropic
OpenAI

from langchain.messages import AIMessage

message = AIMessage(
    content=[
        {"type": "thinking", "thinking": "...", "signature": "WaUjzkyp..."},
        {"type": "text", "text": "..."},
    ],
    response_metadata={"model_provider": "anthropic"}
)
message.content_blocks

[{'type': 'reasoning',
  'reasoning': '...',
  'extras': {'signature': 'WaUjzkyp...'}},
 {'type': 'text', 'text': '...'}]

from langchain.messages import AIMessage

message = AIMessage(
    content=[
        {
            "type": "reasoning",
            "id": "rs_abc123",
            "summary": [
                {"type": "summary_text", "text": "summary 1"},
                {"type": "summary_text", "text": "summary 2"},
            ],
        },
        {"type": "text", "text": "...", "id": "msg_abc123"},
    ],
    response_metadata={"model_provider": "openai"}
)
message.content_blocks

[{'type': 'reasoning', 'id': 'rs_abc123', 'reasoning': 'summary 1'},
 {'type': 'reasoning', 'id': 'rs_abc123', 'reasoning': 'summary 2'},
 {'type': 'text', 'text': '...', 'id': 'msg_abc123'}]

请参阅 integrations guides，开始使用你选择的 inference provider。

序列化标准 content如果 LangChain 外部的应用需要访问标准 content block 表示，可以选择将 content blocks 存储在 message content 中。为此，可以将 LC_OUTPUT_VERSION 环境变量设置为 v1。也可以使用 output_version="v1" 初始化任意 chat model：

from langchain.chat_models import init_chat_model

model = init_chat_model("gpt-5-nano", output_version="v1")

Multimodal

Multimodality 指处理不同形式数据的能力，例如文本、音频、图像和视频。LangChain 为这些数据提供可跨 providers 使用的标准类型。 Chat models 可以接受多模态数据作为输入，并生成多模态数据作为输出。下面展示包含多模态数据的输入 messages 简短示例。

额外 keys 可以包含在 content block 顶层，或嵌套在 "extras": {"key": value} 中。OpenAI, 例如，PDF 需要 filename。具体要求请参阅所选模型的 provider page。

# From URL
message = {
    "role": "user",
    "content": [
        {"type": "text", "text": "Describe the content of this image."},
        {"type": "image", "url": "https://example.com/path/to/image.jpg"},
    ]
}

# From base64 data
message = {
    "role": "user",
    "content": [
        {"type": "text", "text": "Describe the content of this image."},
        {
            "type": "image",
            "base64": "AAAAIGZ0eXBtcDQyAAAAAGlzb21tcDQyAAACAGlzb2...",
            "mime_type": "image/jpeg",
        },
    ]
}

# From provider-managed File ID
message = {
    "role": "user",
    "content": [
        {"type": "text", "text": "Describe the content of this image."},
        {"type": "image", "file_id": "file-abc123"},
    ]
}

并非所有模型都支持所有文件类型。请查看模型 provider 的 reference，了解支持的格式和大小限制。

Content block reference

Content blocks 表示为类型化字典列表，无论是在创建 message 时，还是访问 content_blocks 属性时都是如此。列表中的每一项都必须符合以下 block 类型之一：

Core

TextContentBlock

用途： 标准文本输出

type

string

required

始终为 "text"

text

string

required

文本内容

annotations

object[]

文本的 annotations 列表

extras

object

额外的 provider-specific 数据

示例：

{
    "type": "text",
    "text": "Hello world",
    "annotations": []
}

ReasoningContentBlock

用途： 模型推理步骤

type

string

required

始终为 "reasoning"

reasoning

string

推理内容

extras

object

额外的 provider-specific 数据

示例：

{
    "type": "reasoning",
    "reasoning": "The user is asking about...",
    "extras": {"signature": "abc123"},
}

Multimodal

ImageContentBlock

用途： 图像数据

type

string

required

始终为 "image"

url

string

指向图像位置的 URL。

base64

string

Base64 编码的图像数据。

string

此 content block 的唯一标识符，由 provider 或 LangChain 生成。

mime_type

string

图像 MIME type，例如 image/jpeg、image/png。base64 数据必须提供。

AudioContentBlock

用途： 音频数据

type

string

required

始终为 "audio"

url

string

指向音频位置的 URL。

base64

string

Base64 编码的音频数据。

string

此 content block 的唯一标识符，由 provider 或 LangChain 生成。

mime_type

string

音频 MIME type，例如 audio/mpeg、audio/wav。base64 数据必须提供。

VideoContentBlock

用途： 视频数据

type

string

required

始终为 "video"

url

string

指向视频位置的 URL。

base64

string

Base64 编码的视频数据。

string

此 content block 的唯一标识符，由 provider 或 LangChain 生成。

mime_type

string

视频 MIME type，例如 video/mp4、video/webm。base64 数据必须提供。

FileContentBlock

用途： 通用文件（PDF 等）

type

string

required

始终为 "file"

url

string

指向文件位置的 URL。

base64

string

Base64 编码的文件数据。

string

此 content block 的唯一标识符，由 provider 或 LangChain 生成。

mime_type

string

文件 MIME type，例如 application/pdf。base64 数据必须提供。

PlainTextContentBlock

用途： 文档文本（.txt、.md）

type

string

required

始终为 "text-plain"

text

string

文本内容

mime_type

string

文本的 MIME type，例如 text/plain、text/markdown

Tool Calling

ToolCall

用途： 函数调用

type

string

required

始终为 "tool_call"

name

string

required

要调用的工具名称

args

object

required

传给工具的参数

string

required

此工具调用的唯一标识符

示例：

{
    "type": "tool_call",
    "name": "search",
    "args": {"query": "weather"},
    "id": "call_123"
}

ToolCallChunk

用途： 流式工具调用片段

type

string

required

始终为 "tool_call_chunk"

name

string

正在调用的工具名称

args

string

部分工具参数（可能是不完整 JSON）

string

工具调用标识符

index

number | string

此 chunk 在 stream 中的位置

InvalidToolCall

用途： 格式错误的调用，用于捕获 JSON 解析错误。

type

string

required

始终为 "invalid_tool_call"

name

string

调用失败的工具名称

args

object

传给工具的参数

error

string

错误说明

Server-Side Tool Execution

ServerToolCall

用途： 在服务器端执行的工具调用。

type

string

required

始终为 "server_tool_call"

string

required

与工具调用关联的标识符。

name

string

required

要调用的工具名称。

args

string

required

部分工具参数（可能是不完整 JSON）

ServerToolCallChunk

用途： 流式 server-side 工具调用片段

type

string

required

始终为 "server_tool_call_chunk"

string

与工具调用关联的标识符。

name

string

正在调用的工具名称

args

string

部分工具参数（可能是不完整 JSON）

index

number | string

此 chunk 在 stream 中的位置

ServerToolResult

用途： 搜索结果

type

string

required

始终为 "server_tool_result"

tool_call_id

string

required

对应 server tool call 的标识符。

string

与 server tool result 关联的标识符。

status

string

required

Server-side tool 的执行状态。"success" 或 "error"。

output

已执行工具的输出。

Provider-Specific Blocks

NonStandardContentBlock

用途： Provider-specific escape hatch

type

string

required

始终为 "non_standard"

value

object

required

Provider-specific 数据结构

用法： 用于实验性功能或 provider 独有功能

其他 provider-specific content 类型可在各模型 provider 的 reference documentation 中找到。

请在 API reference 中查看规范类型定义。

Content blocks 是 LangChain v1 在 messages 上引入的新属性，用于在保持现有代码向后兼容的同时，跨 providers 标准化 content 格式。Content blocks 并不是 content 属性的替代品，而是一个新属性，可用于以标准化格式访问 message content。

Use with chat models

Chat models 接受一组 message 对象作为输入，并返回 AIMessage 作为输出。交互通常是无状态的，因此简单的对话循环会使用不断增长的 messages 列表调用模型。请参阅以下指南了解更多：

用于 persisting and managing conversation histories 的内置功能
管理上下文窗口的策略，包括 trimming and summarizing messages

Connect these docs to Claude, VSCode, and more via MCP for real-time answers.

Edit this page on GitHub or file an issue.

​基本用法

​Text prompts

​Message prompts

​Dictionary format

​Message types

​System message

​Human message

​Text content

​Message metadata

​AI message

​Tool calls

​Token usage

​Streaming and chunks

​Tool message

​Message content

​Standard content blocks

​Multimodal

​Content block reference

​Use with chat models

基本用法

Text prompts

Message prompts

Dictionary format

Message types

System message

Human message

Text content

Message metadata

AI message

Tool calls

Token usage

Streaming and chunks

Tool message

Message content

Standard content blocks

Multimodal

Content block reference

Use with chat models