Gemini 2.5 Pro 预览：更强大的编码性能

Gemini 2.5 Pro 预览：更强大的编码性能
Gemini 2.5 Pro Preview

原始链接: https://developers.googleblog.com/en/gemini-2-5-pro-io-improved-coding-performance/

谷歌发布了 Gemini 2.5 Pro（I/O 版本）的早期预览版，该版本显著增强了编码能力，尤其是在前端和 UI 开发方面。此更新旨在为 Google I/O 之前赋能开发者。Replit 和 Cognition 已经分别利用该模型进行自主工作流程和复杂的代码重构。Gemini 2.5 Pro 现在在 WebDev Arena 排行榜上排名第一，展示了其构建功能强大且美观的 Web 应用的能力。主要改进包括最先进的视频理解能力，例如能够从 YouTube 视频创建交互式学习应用，以及通过自动化 UI 生成简化功能开发。新的听写启动应用展示了其将概念转化为具有精美 UI 的功能性应用的能力。开发者可以通过 Google AI Studio 中的 Gemini API 和 Vertex AI（面向企业用户）访问 Gemini 2.5 Pro。此更新通过减少函数调用错误和提高触发率来解决之前的反馈，价格保持不变。旧版本会自动指向此改进后的模型。谷歌鼓励开发者探索并使用此强大的工具构建创新应用。

这个 Hacker News 讨论帖关注 Gemini 2.5 Pro 模型及其在代码生成和设计方面的能力。用户分享了使用它进行编程任务的经验，指出了其在 API 幻觉方面的改进，以及其替代 Stack Overflow 搜索的潜力。然而，一些人对其处理抽象、代码设计和架构考虑的能力表示担忧，一些用户指出它仍然需要人工监督。讨论探讨了人工智能最终超越人类程序员和设计师的可能性。其他人则将其比作电动工具，认为它们提高了效率，而不是取代了工人。关于其能力是否真正非凡存在争议，讨论需要更高级的训练数据或架构突破。一些用户报告了注释过多、不必要的重构以及使用不太流行的编程语言时的挑战。总的来说，虽然该模型被认为是一个有价值的工具，但用户强调它仍然需要人工监督，并且其能力因技术栈而异。用户体验受价格和用户界面等因素影响。

原文

We’ve seen developers doing amazing things with Gemini 2.5 Pro, so we decided to release an updated version a couple of weeks early to get into developers hands sooner.

Today we’re excited to release Gemini 2.5 Pro Preview (I/O edition). This update features even stronger coding capabilities, for you to start building with before Google I/O later this month. Expect meaningful improvements for front-end and UI development, alongside improvements in fundamental coding tasks such as transforming and editing code, and creating sophisticated agentic workflows.

^{“We found Gemini 2.5 Pro to be the best frontier model when it comes to "capability over latency" ratio. I look forward to rolling it out on Replit Agent whenever a latency-sensitive task needs to be accomplished with a high degree of reliability.”
–} ^{Michele Catasta, President,} ^Replit

Best-in-class frontend web development

Gemini 2.5 Pro now ranks #1 on the WebDev Arena leaderboard, which measures human preference for a model’s ability to build aesthetically pleasing and functional web apps. Drawing on this leading capability, Gemini 2.5 Pro powers Cursor’s innovative code agent and empowers our collaborations with companies like Cognition and Replit. Together, we're pushing the frontiers of agentic programming to unlock new possibilities for developers.

^{“The updated Gemini 2.5 Pro achieves leading performance on our junior-dev evals. It was the first-ever model that solved one of our evals involving a larger refactor of a request routing backend. It felt like a more senior developer because it was able to make correct judgement calls and choose good abstractions.”
–} ^{Silas Alberti, Founding Team,} ^Cognition

Gemini 2.5 Pro in action

Gemini 2.5 Pro’s deep understanding of code, combined with powerful reasoning, continues to make Gemini 2.5 Pro the go-to model for developers. We’re particularly excited about how this model can be used in the following cases.

Video to code

Gemini 2.5 Pro delivers state-of-the-art video understanding, scoring 84.8% on the VideoMME benchmark. Combining this with coding enables new flows that were previously not possible with previous versions. For example, the Video to Learning App in Google AI Studio demonstrates how Gemini 2.5 Pro creates an interactive learning app based on a single YouTube video. With improved video understanding and complete UI, the updated Gemini 2.5 Pro model delivers a more functional experience than the previous simple example.

Easier feature development

Gemini 2.5 Pro is strong at front-end web development, helping you get more done. Implementing new features means manually diving into design files and inspecting components to match style properties like colors, fonts, padding, margins, and borders then manually writing the CSS code needed to replicate those visual properties accurately. Now imagine using Gemini 2.5 Pro in an IDE and having the model generate new features, like adding a video player in the style of the other apps in the Gemini 95 starter app.

Quick concepts to working apps

Bringing ideas to life with both functionality and a beautiful UI is made easier with Gemini 2.5 Pro. The new dictation starter app, built using the updated model, is a great example of this in action. Pay attention to some of the details like the wavelength animations, responsive design, and subtle button hover effects. By default, the model has a real taste for aesthetic web development while maintaining its steerability, helping developers quickly take a concept to a working web app. Gemini 2.5 Pro was able to design and code the microphone UI animation for the dictation starter app.

Start building with Gemini 2.5 Pro

You can build with Gemini 2.5 Pro with the Gemini API in Google AI Studio, and enterprise customers can use Vertex AI.

For developers already using Gemini 2.5 Pro, this new version will not only improve coding performance but will also address key developer feedback including reducing errors in function calling and improving function calling trigger rates. The previous iteration (03-25) now points to the most recent version (05-06), so no action is required to use the improved model, and it continues to be available at the same price. We have also updated the model card with the new version of 2.5 Pro .

We can’t wait to see the amazing apps you build!