DeepSeek v4 （深求 v4）

DeepSeek v4 （深求 v4）
DeepSeek v4

原始链接: https://api-docs.deepseek.com/news/news260424

DeepSeek-V4现已开源，提供具有突破性100万上下文长度的性价比高的语言模型。有两个版本：**DeepSeek-V4-Pro**（总参数1.6T/活跃参数49B）在推理、编码和世界知识方面与顶级闭源模型相媲美，并在代理任务中表现出色。**DeepSeek-V4-Flash**（总参数284B/活跃参数13B）提供更快、更经济的选择，在较简单的任务中具有可比的推理能力。两种模型都利用创新的注意力机制（token-wise压缩 & DeepSeek稀疏注意力）来实现最高的效率和降低的计算成本。DeepSeek-V4可以无缝集成到流行的AI代理中，如Claude Code和OpenClaw。提供了一个更新的API，支持OpenAI和Anthropic APIs，并提供“思考”/“非思考”模式。现有的DeepSeek聊天和推理模型将于2026年7月退役。通过Hugging Face访问模型和技术报告，并在chat.deepseek.com上试用它们。

原文

🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length.

🔹 DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models.

🔹 DeepSeek-V4-Flash: 284B total / 13B active params. Your fast, efficient, and economical choice.

Try it now at chat.deepseek.com via Expert Mode / Instant Mode. API is updated & available today!

📄 Tech Report: https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro/blob/main/DeepSeek_V4.pdf

🤗 Open Weights: https://huggingface.co/collections/deepseek-ai/deepseek-v4

DeepSeek-V4-Pro

🔹 Enhanced Agentic Capabilities: Open-source SOTA in Agentic Coding benchmarks.
🔹 Rich World Knowledge: Leads all current open models, trailing only Gemini-3.1-Pro.
🔹 World-Class Reasoning: Beats all current open models in Math/STEM/Coding, rivaling top closed-source models.

DeepSeek-V4-Flash

🔹 Reasoning capabilities closely approach V4-Pro.
🔹 Performs on par with V4-Pro on simple Agent tasks.
🔹 Smaller parameter size, faster response times, and highly cost-effective API pricing.

Structural Innovation & Ultra-High Context Efficiency

🔹 Novel Attention: Token-wise compression + DSA (DeepSeek Sparse Attention).
🔹 Peak Efficiency: World-leading long context with drastically reduced compute & memory costs.
🔹 1M Standard: 1M context is now the default across all official DeepSeek services.

Dedicated Optimizations for Agent Capabilities

🔹 DeepSeek-V4 is seamlessly integrated with leading AI agents like Claude Code, OpenClaw & OpenCode.
🔹 Already driving our in-house agentic coding at DeepSeek.

The figure below showcases a sample PDF generated by DeepSeek-V4-Pro.

API is Available Today!

🔹 Keep base_url, just update model to deepseek-v4-pro or deepseek-v4-flash.
🔹 Supports OpenAI ChatCompletions & Anthropic APIs.
🔹 Both models support 1M context & dual modes (Thinking / Non-Thinking): https://api-docs.deepseek.com/guides/thinking_mode

⚠️ Note: deepseek-chat & deepseek-reasoner will be fully retired and inaccessible after Jul 24th, 2026, 15:59 (UTC Time). (Currently routing to deepseek-v4-flash non-thinking/thinking).

🔹 Amid recent attention, a quick reminder: please rely only on our official accounts for DeepSeek news. Statements from other channels do not reflect our views.
🔹 Thank you for your continued trust. We remain committed to longtermism, advancing steadily toward our ultimate goal of AGI.