【AI Updates】DeepSeek-V3.1开启智能体新时代

最新推荐文章于 2025-08-22 20:23:00 发布

带你去吃小豆花

最新推荐文章于 2025-08-22 20:23:00 发布

阅读量637

点赞数 6

CC 4.0 BY-SA版权

文章标签：人工智能 deepseek deepseek v3.1

DeepSeek-V3.1 的发布标志着 DeepSeek 在“智能体时代”迈出的第一步，其核心在于引入了独特的“混合推理”模式，并显著提升了工具使用和多步智能体任务的能力。

DeepSeek-V3.1 最重要的创新是其混合推理能力，即“一个模型，两种模式”——“思”（Thinking）与“非思”（Non-Thinking）。

特点：用户可以通过 DeepSeek Chat 界面上的“DeepThink”按钮（https://chat.deepseek.com/）切换这两种模式。
API 也提供了对应的端点：
deepseek-chat → 非思考模式 (non-thinking mode)
deepseek-reasoner → 思考模式 (thinking mode)
优势： “思考模式”显著提升了效率。“DeepSeek-V3.1-Think 在更短的时间内给出答案，相比 DeepSeek-R1-0528。”（“Faster thinking: DeepSeek-V3.1-Think reaches answers in less time vs. DeepSeek-R1-0528”）

V3.1 版本在智能体任务和工具使用方面取得了显著的后训练提升：

工具使用与多步任务： “后训练提升了工具使用和多步智能体任务。”（“Post-training boosts tool use and multi-step agent tasks”）
实际应用表现：在 SWE / Terminal-Bench 上取得了“更好的结果”（“Better results on SWE / Terminal-Bench”）。
在复杂搜索任务中，多步推理能力“更强”（“Stronger multi-step reasoning for complex search tasks”）。
思考效率： 在思考效率方面实现了“巨大提升”（“Big gains in thinking efficiency”）。

DeepSeek-V3.1 在 API 和模型底层技术上也进行了重要更新：

上下文长度： 两种模式（思与非思）均支持 128K 上下文。
API 兼容性： “支持 Anthropic API 格式”（“Anthropic API format supported”），方便开发者集成。
严格函数调用： 在 Beta API 中“支持严格函数调用”（“Strict Function Calling supported in Beta API”）。
基础模型： V3.1 Base 是在 V3 的基础上，“为长上下文扩展进行了 840B token 的持续预训练”（“840B tokens continued pretraining for long context extension on top of V3”）。
分词器与聊天模板： 更新了分词器和聊天模板。
开源权重： V3.1 Base 和 V3.1 的开源权重已在 Hugging Face 上发布。

新定价生效： 新定价将从 2025 年 9 月 5 日 16:00 (UTC 时间) 开始生效，同时非高峰期折扣将结束。
当前定价： 在此日期之前，API 将遵循当前定价。
详细信息请参考定价页面：https://api-docs.deepseek.com/quick_start/pricing/

版本名称： DeepSeek-V3.1
核心创新： 混合推理 (Hybrid inference: Think & Non-Think)
两种模式：deepseek-chat → 非思考模式
deepseek-reasoner → 思考模式
思考模式优势： “Faster thinking: DeepSeek-V3.1-Think reaches answers in less time vs. DeepSeek-R1-0528.”
智能体能力： “Post-training boosts tool use and multi-step agent tasks.”
上下文长度： 128K (两种模式均支持)
API 兼容性： Anthropic API format supported.
严格函数调用： Strict Function Calling supported in Beta API.
基础模型： V3.1 Base: “840B tokens continued pretraining for long context extension on top of V3.”
定价变化时间： 2025 年 9 月 5 日 16:00 (UTC Time)