Claude Code 的“扩展思考”只是一个摘要,并非真正的思考。
The text in Claude Code’s “Extended Thinking” output

原始链接: https://patrickmccanna.net/the-text-in-claude-codes-extended-thinking-output-is-not-authentic/

以下是内容摘要: Claude Code 用户需注意,本地记录的“思考区块”并非模型实际的推理过程。它们实际上是作为占位符的加密签名。由于加密密钥由 Anthropic 持有,API 仅返回模型逻辑的高级摘要,而非完整的执行轨迹。目前,获取原始且完整的推理输出需要企业级协议。 对于依赖这些日志进行审计追踪的用户而言,这引发了严重的透明度问题。由于本地日志仅是经过信息丢失处理的摘要,而非底层的决策数据,用户无法独立验证或重构其智能体所采用的具体逻辑。作者对 Anthropic 的间接文档表达了不满,指出这种行为往往被刻意模糊处理。对于需要智能体行为可验证、透明日志的开发者而言,目前的 Claude Code 本地文件尚不足以满足需求,这也凸显了对更透明的开源替代方案日益增长的迫切性。

Hacker News 上的一场讨论对 Claude Code 中“扩展思考”(extended thinking)的透明度提出了质疑。批评者认为,所提供的推理日志仅仅是摘要,而非模型真实的原始思维过程。 辩论的要点包括: * **数据丢失**:反对者认为,总结推理链会导致信息丢失,从而增加评估性能或调试模型行为的难度。 * **策略性掩盖**:一些用户指出,模型提供商故意隐藏这些日志是为了建立“经济护城河”,在保持专有控制权的同时增强用户依赖。 * **实用性与透明度**:另一些人则认为,完整的推理日志往往过于杂乱,摘要对于上下文管理更有帮助。此外,有人认为大型语言模型中的“思考”本质上是抽象的;即使是完整的日志也只是对复杂神经活动的解读,而非洞察机器认知的透明窗口。 归根结底,该讨论反映了人工智能推理工具的实用性与高级用户对深入获取原始日志以进行审计和性能评估的需求之间日益增长的矛盾。
相关文章

原文

Claude Code records each session to disk. Those logs include “thinking blocks” — the model’s own reasoning as it works.

I went to inspect that reasoning this weekend and found a signature (600 characters long) and no text.

So I read the docs: https://platform.claude.com/docs/en/build-with-claude/extended-thinking

Some details worth being aware of:

  • Claude encrypts its reasoning into that signature.
  • Anthropic holds the key. Your machine doesn’t receive it.
  • The API hands back a SUMMARY of reasoning, NOT the reasoning itself.
  • Getting the full thinking output requires an enterprise agreement.

Matt Green looked into this and has some more detailed observations on the signature blocks.

This is worth knowing before you promise anyone an audit trail. Also- BEWARE: The “extended-thinking” output from ctrl+o is a summary of Fable/Opus’ thinking. It isn’t the actual thinking that drove the model’s actions in a session- but a summary of the thinking logic. This is like using saving a jpeg as a .bmp and then editing the .bmp and presenting it as a .jpeg. The conversion produces data loss.

I’m underwhelmed by how Anthropic is presenting the behavior of their application. If you ever need a record of the logic a used by YOUR AGENT during a session:

  • you can’t produce one using the local files. The reasoning logs on your system are not accessible to you.
  • You can log the inputs, the outputs, and the actions of a running Claude code with some scrappy scraping- but even then- it’s not the actual reasoning that drove the agent’s behavior.

And the language in the docs is awfully indirect. If you haven’t had you’re coffee, you might miss that “extended thinking returns a summary of Claude’s full thinking process”

Screenshot

Performance improvements in Open Source models need to come faster.

联系我们 contact @ memedata.com