GPT-5-Codex 和 Codex 的升级
GPT‑5-Codex and upgrades to Codex

原始链接: https://simonwillison.net/2025/Sep/15/gpt-5-codex/

OpenAI “半发布”了GPT-5-Codex,这是一款专门为编码任务设计的模型,已集成到他们的VS Code扩展、Codex CLI和Codex Cloud等工具中。虽然目前尚未通过API提供,但OpenAI计划很快开放。 GPT-5-Codex在代码审查方面表现出色——Codex Cloud中的一项新功能允许针对GitHub仓库进行自动化审查——并且在代码重构方面表现出改进(在OpenAI的评估中从33.9%提高到51.3%)。它会根据任务的复杂程度动态调整处理时间,加速简单的请求,同时为复杂的任务投入更多时间(在某些情况下甚至长达七小时)。 改进还包括更相关的代码注释和更好的移动网站创建。然而,初步测试显示Codex CLI搜索工具存在潜在问题。尽管如此,初步评价积极,表明GPT-5-Codex是OpenAI编码模型家族的重要进步。

Hacker News新 | 过去 | 评论 | 提问 | 展示 | 招聘 | 提交登录GPT‑5-Codex 和 Codex 的升级 (simonwillison.net)38 分,amrrs 发表于 5 小时前 | 隐藏 | 过去 | 收藏 | 4 条评论 sanxiyn 发表于 1 分钟前 | 下一个 [–] > “我们发现 GPT‑5-Codex 的评论不太可能不正确或不重要”——代码中减少不重要的评论绝对是一项改进!这似乎是个误解。在 OpenAI 的原始文章中,这里的评论指的是代码审查评论,而不是代码中的注释。回复lostmsu 发表于 2 小时前 | 上一个 [–] 鹈鹕不太好回复TiredOfLife 发表于 2 小时前 | 父级 [–] 但可能很快回复AstroBen 发表于 10 分钟前 | 根 | 父级 [–] 如果它骑上自行车会更快回复 指南 | 常见问题 | 列表 | API | 安全 | 法律 | 申请 YC | 联系 搜索:
相关文章

原文

GPT‑5-Codex and upgrades to Codex. OpenAI half-released a new model today: GPT‑5-Codex, a fine-tuned GPT-5 variant explicitly designed for their various AI-assisted programming tools.

I say half-released because it's not yet available via their API, but they "plan to make GPT‑5-Codex available in the API soon".

I wrote about the confusing array of OpenAI products that share the name Codex a few months ago. This new model adds yet another, though at least "GPT-5-Codex" (using two hyphens) is unambiguous enough not to add to much more to the confusion.

At this point it's best to think of Codex as OpenAI's brand name for their coding family of models and tools.

The new model is already integrated into their VS Code extension, the Codex CLI and their Codex Cloud asynchronous coding agent. I'd been calling that last one "Codex Web" but I think Codex Cloud is a better name since it can also be accessed directly from their iPhone app.

Codex Cloud also a new feature: you can configure it to automatically run code review against specific GitHub repositories (I found that option on chatgpt.com/codex/settings/code-review) and it will create a temporary container to use as part of those reviews. Here's the relevant documentation.

Some documented features of the new GPT-5-Codex model:

  • Specifically trained for code review, which directly supports their new code review feature.
  • "GPT‑5-Codex adapts how much time it spends thinking more dynamically based on the complexity of the task." Simple tasks (like "list files in this directory") should run faster. Large, complex tasks should use run for much longer - OpenAI report Codex crunching for seven hours in some cases!
  • Increased score on their proprietary "code refactoring evaluation" from 33.9% for GPT-5 (high) to 51.3% for GPT-5-Codex (high). It's hard to evaluate this without seeing the details of the eval but it does at least illustrate that refactoring performance is something they've focused on here.
  • "GPT‑5-Codex also shows significant improvements in human preference evaluations when creating mobile websites" - in the past I've habitually prompted models to "make it mobile-friendly", maybe I don't need to do that any more.
  • "We find that comments by GPT‑5-Codex are less likely to be incorrect or unimportant" - less unimportant comments in code is definitely an improvement!

The system prompt for GPT-5-Codex in Codex CLI is worth a read. It's notably shorter than the system prompt for other models - here's a diff.

Theo Browne has a video review of the model and accompanying features. He was generally impressed but noted that it was surprisingly bad at using the Codex CLI search tool to navigate code. Hopefully that's something that can fix with a system prompt update.

Finally, can it drew a pelican riding a bicycle? Without API access I instead got Codex Cloud to have a go by prompting:

Generate an SVG of a pelican riding a bicycle, save as pelican.svg

Here's the result:

it's a bit messy - the pelican is quite good and the bicycle is quite good but the pelican is stood overlapping the bicycle not riding it.

联系我们 contact @ memedata.com