答案已發布5 天前Last edited 前天36 個來源

輝達 Nemotron 3 Ultra：一款專為自主代理人設計的 550B 參數開放模型

輝達 Nemotron 3 Ultra 是一款總參數高達 5500 億的開放權重混合專家模型，採用獨特的 Mamba 2 與注意力機制混合架構，專為長時間運行的自主 AI 代理人打造，具備高達 100 萬 token 的上下文長度。該模型在人工分析智慧指數中獲得 48 分，是得分最高的美國開放權重模型；在 NVFP4 量化格式下，推理速度最高可達前代產品的 5 倍，並能在維持同等準確度的同時，大幅降低複雜代理任務的總成本達 30%。

使用 Studio Global AI 搜尋並查證事實瀏覽更多熱門頁面

360K0

Nvidia Nemotron 3 Ultra 550B AI model concept art showing neural network visualization — What are the key details about Nvidia's release of Nemotron 3 Ultra, including its model size, architecture, performance benchmarks, availabNvidia's Nemotron 3 Ultra represents a deliberate architectural shift toward hybrid state-space models optimized for long-running agentic workloads.
AI 提示詞
Create a landscape editorial hero image for this Studio Global article: What are the key details about Nvidia's release of Nemotron 3 Ultra, including its model size, architecture, performance benchmarks, availab. Article summary: Nvidia announced **Nemotron 3 Ultra** at Computex 2026 (June 1) as its largest open-weights model, built specifically for long-running AI agents [3][5]. Here is a comprehensive breakdown:. Topic tags: general, general web, user generated, academic, documentation. Reference image context from search candidates: Reference image 1: visual subject "# Nvidia unveils Nemotron 3 Ultra: America’s smartest open-weights AI model, 30% cheaper to run. Nemotron 3 Ultra, the new flagship AI model features 500-550 billion parameters. Nv" source context "Nvidia unveils Nemotron 3 Ultra: America's smartest open-weights ..." Reference image 2: visual subject "Nemotron 3 Ultra la
openai.com

當黃仁勳在 2026 年 6 月 1 日的台北國際電腦展（Computex）上登台時，他宣布的不只是另一款大型語言模型。輝達（NVIDIA）的 Nemotron 3 Ultra 是一場刻意的架構賭注，押注在企業 AI 的未來走向：能規劃、推理、使用工具，並在數小時甚至數天內持續處理複雜工作流程的自主代理人。這款模型擁有 5500 億個總參數，並透過積極的混合專家（Mixture-of-Experts, MoE）稀疏化技術，將每個 token 的活躍參數壓在 550 億個；它既是對原始智慧的宣示，也是對推理經濟學的表態。

這是輝達迄今為止最大的開放權重模型，也是 Nemotron 3 家族三層架構中的旗艦。這個家族始於 2025 年 12 月推出的 90 億參數的 Nano，並在 2026 年 3 月接續推出 490 億參數的 Super 。Ultra 的設計明確不是為了消費級聊天機器人，而是從頭開始為自主代理工作流程中所需的編排與重度推理呼叫而建構。

Nemotron 3 Ultra 的架構有何不同？

Nemotron 3 Ultra 的架構選擇，正是輝達與標準大型語言模型設計最顯著的分歧點。相較於傳統的密集Transformer，這款模型採用了混合潛在混合專家（LatentMoE）架構，將 Mamba-2 狀態空間模型層與混合專家層及少量的標準注意力層交錯結合。

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

使用 Studio Global AI 搜尋並查證事實

大家也會問