答案已发布6天前Last edited 5天前26 来源

谷歌 Gemini Live 新增实时图像生成与编辑功能：工作原理、技术内核及 I/O 2026 全景解读

谷歌为 Gemini Live 引入了实时的图像生成与编辑功能，用户在语音对话中开启摄像头共享后，即可用自然语言直接创作或修改图像 [8]。在安卓端，用户可通过 Gemini Live 的摄像头共享流程展示拍摄对象，并发出图像生成或编辑指令；iOS 端具备同样的核心能力，底层模型均为 Gemini 2.5 Flash Image（内部代号 nano banana）[2][8]。

使用 Studio Global AI 搜索并核查事实浏览更多热门页面

302K0

Smartphone displaying Google Gemini Live interface with camera view and AI image generation overlay — What real-time image generation and editing capability has Google added to Gemini Live, how does it work on Android and iOS, what technologyGemini Live now lets users point their camera and ask the AI to generate or edit images in real time. Image: AI-generated illustration for Studio Global Trending.
AI 提示
Create a landscape editorial hero image for this Studio Global article: What real-time image generation and editing capability has Google added to Gemini Live, how does it work on Android and iOS, what technology. Article summary: ## What Google Added to Gemini Live. Topic tags: general, documentation, general web, user generated, education. Reference image context from search candidates: Reference image 1: visual subject "Google has begun the deployment of Gemini's innovative real-time AI video functionalities, enabling the platform to interpret visual input from a user's device" source context "Google's Gemini update that can tell you live what it sees through your camera is now rolling out - PhoneArena" Reference image 2: visual subject "Smartphones must have user-replaceable batteries by 2027. But not your iPhone. Here's why" source context "Google's Gemini update that can tell you l
openai.com

谷歌为 Gemini Live 注入了什么新能力

谷歌已在 Gemini Live 中推出了实时图像生成与编辑功能，这进一步丰富了该对话式 AI 模式的体验。

在 Live 语音对话过程中，用户只需激活摄像头共享模式，将手机对准想要拍摄的对象，然后用自然语言下达指令，Gemini 就能现场生成或修改图像。最终生成或编辑好的图片会直接呈现在 Live 对话的界面中，整个过程无需跳出当前对话。

一句话总结：你可以像和朋友视频通话一样，让 AI 实时“看见”你眼前的画面，再按你的想法当场“画”出或“改”出一张新图。

安卓与 iOS 上的具体操作方式

Android 端：在 Gemini 应用中启动 Live 对话后，通过摄像头共享功能，让 Gemini 看到你的拍摄对象，然后直接用自然语言告诉它你想要生成或修改成什么样的图像。Google 官方帮助文档也明确指出，用户可以在 Gemini Live 中根据摄像头捕捉到的内容直接生成图像。安卓端的底层图像生成能力由 Gemini 2.5 Flash Image（内部代号 “nano-banana”）提供。

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

使用 Studio Global AI 搜索并核查事实

人们还问