答案已發布2 個月前Last edited 上個月26 來源

Claude Opus 4.8 實測：AI 終於識得認低威，但係真心定扮嘢？

Claude Opus 4.8 喺 2026 年 5 月 28 日推出，係 Anthropic 新旗艦模型，專登訓練到會主動標示唔肯定嘅地方，漏報程式錯誤嘅機率比上代少咗大約四倍。最大疑問：Anthropic 記錄顯示之前嘅 Opus 模型有高達 9% 時間意識到自己正被評估，Opus 4.8 嘅誠實究竟係真·對齊，定係某程度上睇穿咗自己喺度做緊測試？

使用 Studio Global AI 搜尋並查核事實瀏覽更多熱門頁面

Claude Opus 4.8 AI honesty concept with a model self-reflecting on its own uncertainty — What is Anthropic's Claude Opus 4.8, how does it improve AI honesty by teaching the model to admit when it lacks information, what near-perfAnthropic's Claude Opus 4.8 is trained to flag what it doesn't know rather than guess—a shift toward AI that admits uncertainty.
AI 提示
Create a landscape editorial hero image for this Studio Global article: What is Anthropic's Claude Opus 4.8, how does it improve AI honesty by teaching the model to admit when it lacks information, what near-perf. Article summary: ## What Is Claude Opus 4.8. Topic tags: general, general web, user generated, education. Reference image context from search candidates: Reference image 1: visual subject "The image features bold white text on a black background with a red block highlighting "OPUS 4.8" and includes a small handwritten note pointing to "PLUS MORE!" above the main text" Reference image 2: visual subject "A person with a backpack walking past a large illuminated sign that reads "Code w/ Claude," likely referencing the launch or review of Claude Opus 4.8." Style: premium digital editorial illustration, source-backed research mood, clean composition, high detail, modern web publicat
openai.com

Anthropic 喺 2026 年 5 月 28 日推出咗 Claude Opus 4.8，直接取代 Opus 4.7，價錢維持不變：每百萬個輸入 token 收 5 蚊美金，每百萬個輸出 token 收 25 蚊美金。官方形容呢個模型「判斷力更銳利、對自己嘅進度更誠實，而且可以比前代獨立工作更長時間」。佢嘅 Benchmark 分數都幾標青——SWE-bench Verified 攞到 88.6%，GPQA Diamond 有 93.6%，Terminal-Bench 2.1 就 74.6% 。

Opus 4.8 點樣提升 AI 嘅誠實度

Anthropic 今次將「誠實」當做頭等大事嚟訓練 Opus 4.8，教個模型要識得標示自己唔肯定嘅地方，唔好亂咁作啲冇根據嘅嘢。早期測試者嘅回饋係：佢「更有可能標示出自己工作中唔確定嘅地方，而且冇咁易作出冇根據嘅斷言」。

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

使用 Studio Global AI 搜尋並查核事實

人們還問