報告已發布3 個月前Last edited 2 個月前19 個來源

Claude Opus 4.7 實力查核：強在 coding 與 agents，但還不能直接稱全市場第一

Claude Opus 4.7 屬於廣泛可用前沿模型第一梯隊，強在 coding、長流程 agents 與視覺任務；它支援 1M context / 128k 輸出，SWE bench Verified 轉述分數為 87.6%，但公開證據仍不足以證明它是全市場第一。[1][9][14][15] 最大實務升級包括 adaptive thinking、xhigh effort、task budgets beta 與高解析度影像；最大代價是新 tokenizer 可能讓文字 token 使用增加最多約 35%。[1] 最安全的用法不是只看官方跑分，而是把 Opus 4.7 放進自己的 coding / agent 評測集，同時量成功...

使用 Studio Global AI 搜尋並查證事實瀏覽更多熱門頁面

Claude Opus 4.7 實力查核示意圖，呈現 AI 模型、程式碼與 benchmark 分析元素 — Claude Opus 4.7 實力查核：1M 上下文、87.6% SWE-bench，但還不能稱全市場第一AI 生成的編輯示意圖；非 Anthropic 官方 benchmark 圖表。
AI 提示詞
Create a landscape editorial hero image for this Studio Global article: Claude Opus 4.7 實力查核：1M 上下文、87.6% SWE-bench，但還不能稱全市場第一. Article summary: Claude Opus 4.7 很強，尤其適合 coding、長流程 agents、專業工作與視覺任務；它支援 1M context、128k 最大輸出，AWS 與 benchmark 解讀轉述的 SWE bench Verified 成績為 87.6%，但公開證據仍不足以證明它已獨立成為全市場第一。[1][9][14]. Topic tags: ai, anthropic, claude, llm benchmarks, ai agents. Reference image context from search candidates: Reference image 1: visual subject "幾個值得關注的數據點： Agentic coding（SWE-bench Verified）拿到87.6%，目前同場最高。Agentic computer use 78.0%、scaled tool use 77.3%，也都排在第一。" source context "Claude Opus 4.7 發布附上跟主流模型的 benchmark 對比。幾個值得關注的數據點： Agentic coding（SWE-bench Verified）拿到 87.6%，目前同場最高。Agentic computer" Reference image 2: visual subject "[Skip to main content](https://www.anthropic.com/claude/opus#main-content)[Skip to footer](https://www.anthropic.com/claude/opus#footer). ![Image 1: Claude
openai.com

Claude Opus 4.7 的重點，不是某個單一跑分，而是 Anthropic 把 Opus 線推向更長上下文、更可控的 agent 執行、更高解析度視覺，以及更強的軟體工程任務。Anthropic 文件、產品頁與 AWS 上線文都把它放在 coding、long-running agents、professional work 與多步任務的高階位置。

但「很強」不等於「已被證明全市場第一」。目前公開資料能支持的穩健判斷是：Claude Opus 4.7 在 coding 與 agentic tasks 上非常有競爭力；但關鍵分數多來自 Anthropic、AWS 轉述、合作夥伴內部評測或 benchmark 解讀，還不足以構成獨立、可重現的全市場總排名。

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

使用 Studio Global AI 搜尋並查證事實

大家也會問