答案已发布5天前Last edited 前天36 来源

深度解析英伟达Nemotron 3 Ultra：5500亿参数的开源AI智能体“大脑”

英伟达Nemotron 3 Ultra是其史上最大开源模型，总参数5500亿，采用混合Mamba 2与注意力机制的混合专家架构，专为1M超长上下文的长期自主AI智能体工作流而生。模型在Artificial Analysis智能指数中得分48，是美国最高分的开源模型，同时在吞吐量上达到同级模型的约6倍，实现速度与智能的平衡。

使用 Studio Global AI 搜索并核查事实浏览更多热门页面

360K0

Nvidia Nemotron 3 Ultra 550B AI model concept art showing neural network visualization — What are the key details about Nvidia's release of Nemotron 3 Ultra, including its model size, architecture, performance benchmarks, availabNvidia's Nemotron 3 Ultra represents a deliberate architectural shift toward hybrid state-space models optimized for long-running agentic workloads.
AI 提示
Create a landscape editorial hero image for this Studio Global article: What are the key details about Nvidia's release of Nemotron 3 Ultra, including its model size, architecture, performance benchmarks, availab. Article summary: Nvidia announced **Nemotron 3 Ultra** at Computex 2026 (June 1) as its largest open-weights model, built specifically for long-running AI agents [3][5]. Here is a comprehensive breakdown:. Topic tags: general, general web, user generated, academic, documentation. Reference image context from search candidates: Reference image 1: visual subject "# Nvidia unveils Nemotron 3 Ultra: America’s smartest open-weights AI model, 30% cheaper to run. Nemotron 3 Ultra, the new flagship AI model features 500-550 billion parameters. Nv" source context "Nvidia unveils Nemotron 3 Ultra: America's smartest open-weights ..." Reference image 2: visual subject "Nemotron 3 Ultra la
openai.com

当黄仁勋在2026年6月1日的Computex（台北国际电脑展）上发表主题演讲时，他带来的并非又一个简单的大语言模型。英伟达的Nemotron 3 Ultra，是对企业级AI未来发展方向——即那些能规划、推理、使用工具，并能持续数小时甚至数天完成复杂工作流的自主智能体——的一次深思熟虑的架构赌博。该模型总参数规模达5500亿，通过激进的混合专家（MoE）稀疏化技术，每个Token仅激活550亿参数。这既是关于原始智能的宣言，更是关于推理经济学的精妙算账。

作为英伟达迄今为止最大的开源权重模型，Nemotron 3 Ultra是Nemotron 3家族的旗舰。该家族始于2025年12月的90亿参数Nano，发展至2026年3月的490亿参数Super 。Ultra的加入完善了这一产品线，其设计目标明确，不针对消费级聊天机器人，而是从零开始，为自主智能体工作流中所需的编排和繁重推理任务而生。

Nemotron 3 Ultra架构的独到之处

Nemotron 3 Ultra的架构选择，是英伟达与传统大语言模型设计思路分歧最大的地方。它没有采用传统的密集Transformer，而是使用了混合式隐式混合专家（LatentMoE）架构，将Mamba-2状态空间模型层与混合专家层及少量标准注意力层交织在一起。

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

使用 Studio Global AI 搜索并核查事实

人们还问