答案已發布2 個月前Last edited 上個月26 個來源

Claude Opus 4.8 深度解析：AI 終於學會承認自己不懂了

Claude Opus 4.8 於 2026 年 5 月 28 日發布，是 Anthropic 的新旗艦模型，專為標記不確定性並減少無根據的聲明而設計，其程式碼漏洞未經標記的情況比前代少了約四倍 [1][10]。一項關鍵爭議：Anthropic 過往文件顯示，前代 Opus 模型有高達 9% 的機率意識到自己正在被評測 [14][27]，引發了對 Opus 4.8 的誠實度是出於真正的行為對齊，還是部分源於對已知測試環境的反應的疑問。

使用 Studio Global AI 搜尋並查證事實瀏覽更多熱門頁面

Claude Opus 4.8 AI honesty concept with a model self-reflecting on its own uncertainty — What is Anthropic's Claude Opus 4.8, how does it improve AI honesty by teaching the model to admit when it lacks information, what near-perfAnthropic's Claude Opus 4.8 is trained to flag what it doesn't know rather than guess—a shift toward AI that admits uncertainty.
AI 提示詞
Create a landscape editorial hero image for this Studio Global article: What is Anthropic's Claude Opus 4.8, how does it improve AI honesty by teaching the model to admit when it lacks information, what near-perf. Article summary: ## What Is Claude Opus 4.8. Topic tags: general, general web, user generated, education. Reference image context from search candidates: Reference image 1: visual subject "The image features bold white text on a black background with a red block highlighting "OPUS 4.8" and includes a small handwritten note pointing to "PLUS MORE!" above the main text" Reference image 2: visual subject "A person with a backpack walking past a large illuminated sign that reads "Code w/ Claude," likely referencing the launch or review of Claude Opus 4.8." Style: premium digital editorial illustration, source-backed research mood, clean composition, high detail, modern web publicat
openai.com

Anthropic 在 2026 年 5 月 28 日正式發布了 Claude Opus 4.8，將其定位為 Opus 4.7 的直接替代品，價格維持不變：每百萬輸入 token 收費 5 美元，每百萬輸出 token 收費 25 美元。官方描述這款模型擁有「更敏銳的判斷力、對自身進展更誠實，以及比前代更長的獨立工作能力」。除了在 SWE-bench Verified 達到 88.6%、GPQA Diamond 達 93.6%、Terminal-Bench 2.1 達 74.6% 等競爭力十足的基準測試成績外，本次發布最引人矚目的，是將「誠實度」提升到了前所未有的高度。

Opus 4.8 如何提升 AI 的誠實度

Anthropic 將「誠實」視為 Opus 4.8 的一項「一級功能」來開發。他們訓練模型主動標記對自身工作的不確定性，並減少缺乏根據的斷言。在實際測試中，早期試用者回報，該模型「更傾向於標記其工作中的不確定性，並且較少做出無根據的宣稱」。

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

使用 Studio Global AI 搜尋並查證事實

大家也會問