답변게시됨4일 전Last edited 4일 전29 소스

구글 제미나이 3.5 라이브 번역: 말하는 즉시 통역하는 AI, 어디서 어떻게 쓸 수 있을까

구글의 ‘제미나이 3.5 라이브 번역’은 70개 이상의 언어를 거의 실시간으로 음성 대 음성 통역하는 새로운 AI 오디오 모델로, 번역 앱, 제미나이 라이브 API, 구글 AI 스튜디오에서 전 세계 동시 출시되었습니다. 기존의 문장 단위로 끊어 번역하는 방식과 달리, 이 모델은 상대방이 말을 시작하는 즉시 번역을 시작하며 원래 화자의 억양, 속도, 높낮이를 보존한 채 매끄럽게 통역된 음성을 스트리밍합니다.

Studio Global AI로 검색 및 팩트체크 인기 페이지 더 보기

144K0

Abstract visual representing Google Gemini 3.5 Live Translate's real-time speech-to-speech translation across 70+ languages. — What is Google's Gemini 3.5 Live Translate model, how does its streaming speech-to-speech translation work across more than 70 languages, anGoogle's Gemini 3.5 Live Translate model brings near real-time speech translation to Google Translate and the Gemini API.
AI 프롬프트
Create a landscape editorial hero image for this Studio Global article: What is Google's Gemini 3.5 Live Translate model, how does its streaming speech-to-speech translation work across more than 70 languages, an. Article summary: ## What It Is. Topic tags: general, general web, user generated, documentation. Reference image context from search candidates: Reference image 1: visual subject "# Google Launches Gemini 3.5 Live Translate in 70 Languages. Google has launched Gemini 3.5 Live Translate, a new AI powered speech translation model that enables near real time co" source context "Google Launches Gemini 3.5 Live Translate in 70 ..." Reference image 2: visual subject "Google Patches Chrome Zero Day Vulnerability Under Attack" source context "Google Launches Gemini 3.5 Live Translate in 70 ..." Style: premium digital editorial illustration, source-backed research mood, clean compositio
openai.com

2026년 6월 9일, 구글이 자사의 가장 진보된 실시간 오디오 번역 모델인 **‘제미나이 3.5 라이브 번역(Gemini 3.5 Live Translate)’**을 대중에게 공개했습니다. gemini-3.5-live-translate-preview라는 개발자 코드로 명명된 이 특화 모델은 스트리밍 방식의 음성-대-음성 번역이라는 단 하나의 핵심 과업을 위해 설계되었습니다. 70개 이상의 언어와 2000개 이상의 언어 쌍을 지원하며, 대부분의 사용자가 경험하던 끊어 읽기 방식의 번역 시스템에서 완전히 탈피한 기술을 선보입니다 .

이 모델은 화자가 한 문장을 모두 마칠 때까지 기다렸다가 번역을 시작하는 것이 아니라, 음성 입력이 시작된 지 몇 초 만에 번역 오디오를 실시간으로 처리하고 생성하기 시작합니다. 이는 마치 유려한 자연스러운 대화를 가능하게 하는 ‘동시 통역사’와 같습니다. 이번 공개는 2025년 12월 첫 시연을 거쳐, 2026년 봄 제미나이 3.1 플래시 라이브 모델을 통해 다듬어 온 단계적 출시의 정점에 해당합니다 .

말하는 즉시 통역하는 실시간 스트리밍, 그 원리

제미나이 3.5 라이브 번역의 핵심 혁신은 지속적인 쌍방향 스트리밍 구조입니다. 이는 전통적인 순차 번역 시스템과의 결정적 차별점이며, 몇 가지 핵심 기술이 유기적으로 결합되어 작동합니다.

듣는 동시에 시작되는 통역

모델은 화자가 말을 끝낼 때까지 기다리지 않습니다. 오디오를 입력받는 동시에 통역된 결과물을 점진적으로 출력합니다. 구글은 이를 두고 “화자보다 불과 몇 초 뒤처진 상태”를 유지한다고 설명하는데, 이는 자연스러운 대화의 흐름을 깨는 어색한 멈춤을 제거합니다 .

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

Studio Global AI로 검색 및 팩트체크

사람들은 또한 묻습니다.