答案已發布3 天前Last edited 3 天前32 個來源

被「綁手綁腳」的最強 AI：為何資安專家集體撻伐 Anthropic 的 Claude Fable 5？

資安研究員連想讀一篇部落格都被擋：Claude Fable 5 的內容過濾器極度敏感，對所有「擦邊」網路安全查詢一律斷然拒絕，或悄悄降級成舊版模型回應，嚴重影響合法用途。爭論核心在於「無聲降級」：請求一旦被標記，Fable 5 會在用戶毫無察覺下，將問題導向功能較弱的 Claude Opus 4.8，而這項機制竟被藏在長達 319 頁的系統說明文件中。

使用 Studio Global AI 搜尋並查證事實瀏覽更多熱門頁面

27K0

A conceptual illustration of a locked digital shield representing AI safety guardrails, with glowing data streams being filtered and diverted, set against a dark cybersecurity-them — What is causing cybersecurity professionals to criticize Anthropic's Claude Fable 5, and how does the model's safety guardrail system work,Anthropic's Claude Fable 5 uses aggressive, silent guardrails to keep its most powerful capabilities out of public hands, a move that has sparked intense debate in the cybersecurity community.
AI 提示詞
Create a landscape editorial hero image for this Studio Global article: What is causing cybersecurity professionals to criticize Anthropic's Claude Fable 5, and how does the model's safety guardrail system work,. Article summary: Anthropic released Claude Fable 5 on June 9, 2026 as a guardrailed public version of its powerful Mythos-class model, alongside an unrestricted twin, Claude Mythos 5, available only to vetted partners through Project Gla. Topic tags: general, general web, user generated. Reference image context from search candidates: Reference image 1: visual subject "# Claude Fable 5: Why Anthropic Put Its Most Powerful AI Behind Guardrails. * Anthropic released Claude Fable 5 on 9 June 2026. It is the first publicly available Mythos-class mode" source context "Claude Fable 5: Anthropic Locks Down Cyber and Bio" Reference image 2: visual subject "# Anthropic says these topics
openai.com

2026 年 6 月 9 日，Anthropic 推出了它口中「迄今最強」的公開版模型 Claude Fable 5。然而，這家總是把「負責任」掛在嘴邊的公司，這回卻踢到了鐵板——來自資安社群的猛烈砲火，幾乎瞬間淹沒了新品發布的掌聲。

批評者指出，問題不在於模型有安全機制，而在於這套機制實在太超過，甚至到了「荒謬」的地步。Fable 5 對內容的過濾機制不僅廣泛，還會在用戶毫不知情下，偷偷派一個較弱的人工智慧（AI）來回答你的問題。這場從「安全性」燒到「公平性」的爭議，背後究竟藏著什麼樣的技術細節與產業算計？以下是完整拆解。

資安圈的怒吼：連看個部落格都不行？

對許多資安專家來說，Fable 5 的日常使用體驗是一場災難。

來自 IBM X-Force 的知名安全研究員 Valentina “Chompie” Palmiotti 對《TechCrunch》直言不諱：「（Fable 5）會拒絕任何跟網路安全擦得上邊的請求，就算是『讀一篇部落格文章』這種完全無害的任務也一樣。」換句話說，使用者只是想理解基本的資安概念，都可能被系統視為心懷不軌。

更讓專業人士跳腳的是，Fable 5 在擋下這些問題後，並不會直接告訴你「這題不能答」，而是在背後偷偷切換成較舊、能力較差的 Claude Opus 4.8 來產生回應。對使用者而言，唯一能察覺到的跡象，就是原本聰明絕頂的 AI 突然間回答變得驢頭不對馬嘴，彷彿智商瞬間歸零。

這種「無聲降級」之所以讓外界大呼離譜，是因為它的運作機制竟被藏在了一份長達 319 頁的系統說明文件深處。若非刻意翻找，一般使用者根本無從得知自己被餵了一個次級品。這也讓 Anphropic 被網路上許多開發者貼上了「祕密破壞自家模型能力」的標籤。

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

使用 Studio Global AI 搜尋並查證事實

大家也會問