Anthropic 发布了 Claude 的语音模式。
Anthropic launches a voice mode for Claude

原始链接: https://techcrunch.com/2025/05/27/anthropic-launches-a-voice-mode-for-claude/

Anthropic正在为其Claude聊天机器人应用程序推出语音模式,允许用户与AI进行语音对话。该功能目前处于测试阶段,正在英语地区推出,用户可以使用语音与Claude互动,使其更方便免提使用。语音模式由Claude Sonnet 4模型驱动,提供五种不同的语音选项、屏幕上的要点以及在文本和语音交互之间切换的能力。 与OpenAI、谷歌(Gemini Live)和xAI(Grok)的语音聊天选项类似,它旨在创造更自然的对话体验。使用Claude的语音模式,用户可以讨论文档和图像,并接收对话的文字记录和摘要。免费用户有对话限制(20-30次),付费订阅者可以访问Google Workspace连接器,以实现日历和Gmail集成。Anthropic在今年早些时候证实了其在语音功能方面的工作,并可能与亚马逊和ElevenLabs合作以增强未来的语音功能。

Anthropic has launched a voice mode for Claude, potentially leveraging ElevenLabs for text-to-speech, confirmed via Anthropic's Trust Center. Users are sharing their experiences, highlighting pros like explicit start/stop control, file uploading during voice chats, and real-time text display. However, transcription accuracy is criticized, and some users reported initial buggy moments. Some are suggesting improvements such as automatic interjection detection and structured output. Users also express frustrations with Anthropic's rate limits, auto-ban policies, and API uptime. Feature requests include "download all files" functionality and a "Deep Research" mode. Some find the paid tiers too expensive for the value.

原文

Anthropic has begun to roll out a “voice mode” for its Claude chatbot apps.

The voice mode (in beta for now) allows Claude mobile app users to have “complete spoken conversations with Claude,” and will arrive in English over the next few weeks, according to Anthropic’s official account on X and updated documentation on the company’s website.

At least one user on X reports having gained access to voice mode late Tuesday. By default, it’s powered by Anthropic’s Claude Sonnet 4 model.

“Voice mode … enables you to speak to Claude and hear responses through voice, making it easier to use Claude when your hands are busy but your mind isn’t,” reads a support page. “Voice mode transforms how you interact with Claude by … displaying key points on-screen as Claude speaks [and] allowing you to speak to Claude and hear Claude’s voice responses.”

A number of AI companies, including OpenAI, offer voice chat experiences for their respective chatbots. Google, for example, has Gemini Live, while xAI has Voice Mode for Grok. Each lets users interact with bots by speaking instead of typing, making conversations feel more natural and intuitive.

With Anthropic’s flavor of voice mode, users can chat about things like documents and images, and choose from five distinct voice options. Users can also switch between text and voice on the fly, and see a transcript and summary following conversations.

The capability has certain limits. Voice conversations count toward regular usage caps — Anthropic says that 20-30 conversations is what most free users can expect. Moreover, only paid Claude subscribers can take advantage of a Google Workspace connector that allows voice mode to access Google Calendar appointments and Gmail emails (Google Docs integration is exclusive to Claude Enterprise plans).

Anthropic CPO Mike Krieger confirmed the company was working on voice capabilities for Claude in an interview with the Financial Times in early March. According to the report, Anthropic was holding talks with Amazon, the company’s major investor and partner, and voice-focused AI startup ElevenLabs, to possibly drive future voice features for Claude.

It’s unclear which of those partnerships, if any, came to fruition.

联系我们 contact @ memedata.com