报告已发布2个月前Last edited 上个月24 来源

Claude Opus 4.8 正式发布：全方位解读 Anthropic 的最新旗舰模型

性能对决：Claude Opus 4.8 在 SWE Bench Pro 智能体编程测试中获得 69.2% 的成绩，领先于 GPT 5.5（58.6%）和 Gemini 3.1 Pro（54.2%），但在终端基准测试中 GPT 5.5 仍占优势。定价策略：标准 API 价格维持前代水平，为每百万输入 tokens 5 美元、输出 25 美元。新增快速模式定价为输入 10 美元/输出 50 美元，速度约提升 2.5 倍，且比前代快速模式便宜约三倍。

使用 Studio Global AI 搜索并核查事实浏览更多热门页面

Conceptual illustration of Claude Opus 4.8's launch, symbolizing AI performance benchmarks and agentic coding capabilities. — What were the key details of Anthropic's Claude Opus 4.8 launch on May 28, 2026, including its benchmark performance against OpenAI's GPT 5Anthropic launched Claude Opus 4.8 with significant improvements in agentic coding and model reliability. Image: AI-generated.
AI 提示
Create a landscape editorial hero image for this Studio Global article: What were the key details of Anthropic's Claude Opus 4.8 launch on May 28, 2026, including its benchmark performance against OpenAI's GPT 5.. Article summary: **Launch positioning:** Claude Opus 4.8 was described as outperforming Opus 4.7 across most major benchmarks and beating GPT-5.5 and Gemini 3.1 Pro in several categories.. Topic tags: deepresearch, general web, user generated, documentation. Reference image context from search candidates: Reference image 1: visual subject "Anthropic released Claude Opus 4.8 today, and it outperforms its predecessor across most major benchmarks while beating OpenAI’s GPT-5.5 and Google’s Gemini 3.1 Pro in several key" source context "Anthropic Just Dropped Claude Opus 4.8. - The VC Corner" Reference image 2: visual subject "Anthropic released Claude Opus 4.8 today, and it outper
openai.com

Anthropic 于 2026 年 5 月 28 日正式公开发布了其最新的旗舰模型 Claude Opus 4.8，这也是该公司目前能力最强的通用模型。这次更新直指编码、长时间运行的智能体任务以及企业级可靠性。。

基准测试：强者对决

在新一轮的 AI 模型比拼中，最受关注的是 SWE-Bench Pro 智能体编程基准测试。根据 Anthropic 的评估数据，Claude Opus 4.8 取得了 69.2% 的领先成绩，而它的前代 Opus 4.7 为 64.3%，OpenAI 的 GPT-5.5 为 58.6%，Google 的 Gemini 3.1 Pro 为 54.2% 。

当然，没有哪个模型能在所有项目上独占鳌头。在更广泛的智能体编码套件中，GPT-5.5 在一些特定领域仍保持领先。例如，在 Terminal-Bench 2.1 终端编码评估中，GPT-5.5 以 78.2% 的得分领先于 Opus 4.8 的 74.6% 和 Gemini 3.1 Pro 的 70.3% 。

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

使用 Studio Global AI 搜索并核查事实

人们还问