答案已發布3 天前Last edited 3 天前29 個來源

Google 發表 Gemini 3.5 Live Translate：即時語音翻譯模型如何運作，以及你可以在哪裡用到它

Google 發表 Gemini 3.5 Live Translate，這是一款能在近即時狀態下進行語音轉語音翻譯的全新 AI 音訊模型，目前已開始在 Google 翻譯 App、Gemini Live API 和 Google AI Studio 等平台逐步推出。有別於傳統逐句翻譯工具，此模型在你一開口就開始產生翻譯，並保留原始說話者的語調、語速和音高，以連續串流的方式提供譯文音訊。

使用 Studio Global AI 搜尋並查證事實瀏覽更多熱門頁面

144K0

Abstract visual representing Google Gemini 3.5 Live Translate's real-time speech-to-speech translation across 70+ languages. — What is Google's Gemini 3.5 Live Translate model, how does its streaming speech-to-speech translation work across more than 70 languages, anGoogle's Gemini 3.5 Live Translate model brings near real-time speech translation to Google Translate and the Gemini API.
AI 提示詞
Create a landscape editorial hero image for this Studio Global article: What is Google's Gemini 3.5 Live Translate model, how does its streaming speech-to-speech translation work across more than 70 languages, an. Article summary: ## What It Is. Topic tags: general, general web, user generated, documentation. Reference image context from search candidates: Reference image 1: visual subject "# Google Launches Gemini 3.5 Live Translate in 70 Languages. Google has launched Gemini 3.5 Live Translate, a new AI powered speech translation model that enables near real time co" source context "Google Launches Gemini 3.5 Live Translate in 70 ..." Reference image 2: visual subject "Google Patches Chrome Zero Day Vulnerability Under Attack" source context "Google Launches Gemini 3.5 Live Translate in 70 ..." Style: premium digital editorial illustration, source-backed research mood, clean compositio
openai.com

2026 年 6 月 9 日，Google 正式向大眾發布了迄今為止最先進的音訊翻譯模型：Gemini 3.5 Live Translate。這個開發人員代碼為 gemini-3.5-live-translate-preview 的特殊模型，是專為一個核心任務所設計——近乎即時的、串流式的語音轉語音翻譯。它支援超過 70 種語言與超過 2,000 種語言組合，對於早已習慣按鍵、停頓、再播放這種傳統翻譯模式的大眾來說，是一次根本性的體驗轉變。

Gemini 3.5 Live Translate 不再需要等你把整句話說完才開始翻譯，而是在短短數秒內就開始處理並生成譯文音訊，目標是讓對話保持流暢自然。這次公開發布，也標誌著從 2025 年 12 月早期展示，到 2026 年春季透過 Gemini 3.1 flash live 模型持續打磨，這個階段性推出計畫的集大成。

拆解即時語音翻譯的核心技術：它如何做到邊聽邊翻？

Gemini 3.5 Live Translate 最核心的創新，在於它採用了 連續、雙向的串流架構。這與傳統一來一往的翻譯系統截然不同，仰賴數項關鍵技術協同運作。

同步翻譯與聆聽

模型不會等待說話者結束。它同時串流接收音訊，並逐步生成翻譯後的語音輸出。Google 形容這就像「僅僅落後說話者幾秒鐘」，進而消除了那種可能打斷自然對話節奏的尷尬停頓。

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

使用 Studio Global AI 搜尋並查證事實

大家也會問