答案已發布2 個月前Last edited 上個月16 來源

Cursor Composer 2.5：效能、價格與對抗 Claude Opus 4.7、GPT‑5.5 的策略

Cursor 於 2026 年 5 月 18 日推出 Composer 2.5，在 SWE‑Bench Multilingual 取得 79.8% 成績，接近 Claude Opus 4.7 的 80.5%，並高於 GPT‑5.5 的 77.8%。[3][4] 模型專為 AI 編程代理設計，能在 IDE 內處理長時間任務，例如跨檔案修改、終端指令、測試迭代等開發流程。[19] 標準版定價僅約每百萬 input token $0.50、output token $2.50，大幅低於部分前沿模型，令長時間 AI 編程任務成本明顯下降。[4][18]

使用 Studio Global AI 搜尋並查核事實瀏覽更多熱門頁面

Illustration representing Cursor Composer 2.5 competing with other frontier AI coding models — Cursor Composer 2.5: Benchmarks, Pricing, and How It Stacks Up to Claude Opus 4.7 and GPT‑5.5Cursor’s Composer 2.5 aims to deliver frontier‑level coding performance while dramatically lowering the cost of running AI coding agents.
AI 提示
Create a landscape editorial hero image for this Studio Global article: Cursor Composer 2.5: Benchmarks, Pricing, and How It Stacks Up to Claude Opus 4.7 and GPT‑5.5. Article summary: Cursor’s Composer 2.5 is an in‑house coding model released May 18, 2026 that scores about 79.8% on SWE‑Bench Multilingual and 69.3% on Terminal‑Bench 2.0—roughly matching Claude Opus 4.7 on some benchmarks while costi.... Topic tags: cursor, ai coding, developer tools, ai models, benchmarks. Reference image context from search candidates: Reference image 1: visual subject "Composer 2.5 matches Opus 4.7 and GPT-5.5 on CursorBench 3.1 but costs less than a dollar per task - compared to up to eleven dollars for the competition. | Image: Cursor" source context "Cursor's Composer 2.5 matches Opus 4.7 and GPT-5.5 benchmarks ..." Reference image 2: visual subject "Composer 2.5 vs Opus | The Results Are Brutal Merv
openai.com

Cursor（由 Anysphere 開發）的 Composer 2.5 是一個專門為程式開發流程打造的 AI 模型，於 2026 年 5 月 18 日正式推出，並直接整合在 Cursor IDE 裡面。它的目標不是單純生成幾行程式碼，而是支援整個 AI 輔助軟件工程 workflow，例如搜尋大型 repository、同時修改多個檔案、執行 terminal 指令，以及反覆測試和除錯。

這次發布之所以引起開發者關注，主要有兩個原因：

在部分程式 benchmark 上接近甚至追上前沿模型
token 價格比競爭對手低很多，對長時間 AI 編程代理特別有利

專為「AI 編程代理」而設

Composer 系列模型的設計理念是 agentic software engineering。意思是 AI 不只負責回答問題，而是可以自己完成多步驟開發流程，例如：

分析整個 codebase
制定修改計劃
同時編輯多個檔案
編譯程式並執行測試
根據錯誤結果再修正

Cursor 表示，與上一代相比，Composer 2.5 在以下方面有明顯改善：

更穩定處理 長時間任務
更可靠遵循 複雜指令
在 IDE 協作過程中表現更自然

這其實反映了 AI 編程工具的一個大趨勢：由「自動補全」逐步轉向 可以長時間工作的開發代理（coding agents）。

跑分表現：與 Opus 4.7、GPT‑5.5 同級？

Cursor 公布的 benchmark 顯示，Composer 2.5 在多個軟件工程測試中已經接近目前頂級模型。

主要分數包括：

SWE‑Bench Multilingual：79.8%（Composer 2.5）、80.5%（Claude Opus 4.7）、77.8%（GPT‑5.5）

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

使用 Studio Global AI 搜尋並查核事實

人們還問