レポート公開済み2 か月前Last edited 先月24 ソース

AnthropicがClaude Opus 4.8をリリース ― ライバルを凌駕するベンチマークと開発者向け新機能

SWE Bench Pro（自律型コーディング）で69.2%を達成し、GPT 5.5（58.6%）やGemini 3.1 Pro（54.2%）を大きく引き離す。ただしターミナル操作系のベンチマークではGPT 5.5が依然リード。標準API料金は据え置き（入力100万トークンあたり5ドル、出力同25ドル）。新たに追加された高速モードは2.5倍の速度で、入力10ドル/出力50ドルと従来世代の高速モードより約3分の1の価格に。

Studio Global AIで検索して事実確認さらにトレンドページを見る

Conceptual illustration of Claude Opus 4.8's launch, symbolizing AI performance benchmarks and agentic coding capabilities. — What were the key details of Anthropic's Claude Opus 4.8 launch on May 28, 2026, including its benchmark performance against OpenAI's GPT 5Anthropic launched Claude Opus 4.8 with significant improvements in agentic coding and model reliability. Image: AI-generated.
AI プロンプト
Create a landscape editorial hero image for this Studio Global article: What were the key details of Anthropic's Claude Opus 4.8 launch on May 28, 2026, including its benchmark performance against OpenAI's GPT 5.. Article summary: **Launch positioning:** Claude Opus 4.8 was described as outperforming Opus 4.7 across most major benchmarks and beating GPT-5.5 and Gemini 3.1 Pro in several categories.. Topic tags: deepresearch, general web, user generated, documentation. Reference image context from search candidates: Reference image 1: visual subject "Anthropic released Claude Opus 4.8 today, and it outperforms its predecessor across most major benchmarks while beating OpenAI’s GPT-5.5 and Google’s Gemini 3.1 Pro in several key" source context "Anthropic Just Dropped Claude Opus 4.8. - The VC Corner" Reference image 2: visual subject "Anthropic released Claude Opus 4.8 today, and it outper
openai.com

Anthropicは2026年5月28日、最新のフラッグシップAIモデル「Claude Opus 4.8」を一般公開した。前世代のOpus 4.7の直接的な後継にあたり、特にコーディング、長時間のエージェントタスク（自律的処理）、エンタープライズ向けの信頼性を強化している。基本となるAPI利用料金は据え置かれた一方、大幅に値下げされた高速モードや、新たなワークフローツールも同時に提供が開始されている。

主要ベンチマークでライバルをリード

今回最も注目を集めているのが、自律型コーディングのベンチマーク「SWE-Bench Pro」での結果だ。Anthropicの評価データによると、Opus 4.8は**69.2%**を記録。これは前世代Opus 4.7の64.3%を上回るだけでなく、OpenAIのGPT-5.5（58.6%）やGoogleのGemini 3.1 Pro（54.2%）に対して、10ポイント以上の大差をつける結果となった。

ただし、全てのテストでトップに立ったわけではない。ターミナル上での自律的なコーディング能力を測る「Terminal-Bench 2.1」では、GPT-5.5が78.2%と首位を守り、Opus 4.8は74.6%でこれに次いだ。もっとも、このスコアはOpus 4.7の66.1%からは大幅な改善となる。

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

Studio Global AIで検索して事実確認

人々も尋ねます