答案已发布4天前Last edited 3天前29 来源

谷歌 Gemini 3.5 Live Translate 发布：70+语言实时互译，告别尴尬停顿

谷歌 Gemini 3.5 Live Translate 是一款全新的 AI 音频模型，可以在超过 70 种语言之间进行近乎实时的语音到语音翻译，目前已在 Google 翻译 App、Gemini Live API 和 Google AI Studio 中上线。与传统需要等说话人讲完再翻译的“回合制”系统不同，该模型能在说话人一开口就开始翻译，同时保留其原本的语调、语速和音高，并持续输出音频翻译。

使用 Studio Global AI 搜索并核查事实浏览更多热门页面

144K0

Abstract visual representing Google Gemini 3.5 Live Translate's real-time speech-to-speech translation across 70+ languages. — What is Google's Gemini 3.5 Live Translate model, how does its streaming speech-to-speech translation work across more than 70 languages, anGoogle's Gemini 3.5 Live Translate model brings near real-time speech translation to Google Translate and the Gemini API.
AI 提示
Create a landscape editorial hero image for this Studio Global article: What is Google's Gemini 3.5 Live Translate model, how does its streaming speech-to-speech translation work across more than 70 languages, an. Article summary: ## What It Is. Topic tags: general, general web, user generated, documentation. Reference image context from search candidates: Reference image 1: visual subject "# Google Launches Gemini 3.5 Live Translate in 70 Languages. Google has launched Gemini 3.5 Live Translate, a new AI powered speech translation model that enables near real time co" source context "Google Launches Gemini 3.5 Live Translate in 70 ..." Reference image 2: visual subject "Google Patches Chrome Zero Day Vulnerability Under Attack" source context "Google Launches Gemini 3.5 Live Translate in 70 ..." Style: premium digital editorial illustration, source-backed research mood, clean compositio
openai.com

2026 年 6 月 9 日，谷歌正式发布了其迄今最强悍的音频翻译模型——Gemini 3.5 Live Translate。这款专为实时语音翻译打造的模型（开发者调用的模型代码是 gemini-3.5-live-translate-preview），核心任务只有一个：在近实时的情况下，将一段说话声直接变成另一段不同语言的说话声。它支持超过 70 种语言、超过 2000 个语言对的互译，对大多数人所习惯的“你一句、我一句，等译完再回复”的传统翻译模式来说，这是一次彻底的颠覆。

你不用再等一句话说完才开始翻译，Gemini 3.5 Live Translate 会立刻“听懂”并“说”出来，尽量让对话始终像母语聊天一样自然。这次公开亮相，其实是谷歌从 2025 年 12 月首次对外演示以来的一次最终完善，期间还经历了 2026 年春天推出 Gemini 3.1 flash live 模型的迭代打磨。

实时语音翻译到底是怎么工作的？

Gemini 3.5 Live Translate 的核心创新在于它的连续的、双向的流式翻译架构。这和传统的“回合制”翻译系统很不一样，它靠几个关键能力相互配合来实现。

边听边译，几乎同步

这个模型不会等人把一句话说完整再动手，而是一边接收音频，一边就产生出翻译好的语音。谷歌自己的说法是，它能“只比每个说话人慢个一两秒”，这样一来，那些让对话变得尴尬的冷场就基本被消灭了。

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

使用 Studio Global AI 搜索并核查事实

人们还问