答案已發布3 天前Last edited 3 天前29 來源

Google Gemini 3.5 Live Translate 實測：似隨身繙譯官唔使等佢講完先開波

Google 推出 Gemini 3.5 Live Translate，係一個支援超過 70 種語言、近乎實時嘅語音翻譯 AI 模型，已經喺 Google 翻譯 App、Gemini Live API 同 Google Meet 逐步推出。同舊時要等成句講完先開始翻譯唔同，呢個新模型你一開口佢就即刻開工，仲可以保留你原本講說話嗰陣嘅聲調、語速同語氣，令對話更自然流暢。

使用 Studio Global AI 搜尋並查核事實瀏覽更多熱門頁面

144K0

Abstract visual representing Google Gemini 3.5 Live Translate's real-time speech-to-speech translation across 70+ languages. — What is Google's Gemini 3.5 Live Translate model, how does its streaming speech-to-speech translation work across more than 70 languages, anGoogle's Gemini 3.5 Live Translate model brings near real-time speech translation to Google Translate and the Gemini API.
AI 提示
Create a landscape editorial hero image for this Studio Global article: What is Google's Gemini 3.5 Live Translate model, how does its streaming speech-to-speech translation work across more than 70 languages, an. Article summary: ## What It Is. Topic tags: general, general web, user generated, documentation. Reference image context from search candidates: Reference image 1: visual subject "# Google Launches Gemini 3.5 Live Translate in 70 Languages. Google has launched Gemini 3.5 Live Translate, a new AI powered speech translation model that enables near real time co" source context "Google Launches Gemini 3.5 Live Translate in 70 ..." Reference image 2: visual subject "Google Patches Chrome Zero Day Vulnerability Under Attack" source context "Google Launches Gemini 3.5 Live Translate in 70 ..." Style: premium digital editorial illustration, source-backed research mood, clean compositio
openai.com

2026 年 6 月 9 日，Google 向公眾發布咗佢哋最先進嘅音頻翻譯模型：Gemini 3.5 Live Translate。呢個代號為 gemini-3.5-live-translate-preview 嘅模型，專門為一個核心任務而設——就係近乎實時、串流式嘅語音對語音翻譯。佢支援超過 70 種語言，以及超過 2,000 個語言組合。

同大家用開嗰啲「一問一答」式翻譯系統唔同，Gemini 3.5 Live Translate 唔需要等講嘢嘅人成句講晒先開始翻譯，而係喺幾秒之內就處理好音頻，開始生成翻譯，保持對話好似流水咁順暢自然。呢一次公開發布，正式標誌住呢個由 2025 年 12 月開始展示，並且喺 2026 年春天經過 Gemini 3.1 flash live 模型不斷改良嘅階段性成果。

串流式語音對語音翻譯係點樣運作？

Gemini 3.5 Live Translate 最核心嘅創新，就係佢嗰套 連續、雙向嘅串流架構。呢個設計同傳統逐句翻譯嘅系統好唔同，要靠幾個關鍵技術一齊配合先做到。

同聲傳譯，雙管齊下

呢個模型唔會等講嘢嘅人收聲先開工。佢會一邊接收音頻輸入，一邊同步生成翻譯輸出。用 Google 自己嘅講法，佢只係「慢講嘢嘅人幾秒鐘」，咁做就解決咗以前翻譯時會出現嗰啲令人尷尬嘅停頓。

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

使用 Studio Global AI 搜尋並查核事實

人們還問