답변게시됨3일 전Last edited 3일 전32 소스

클로드 페이블 5, 너무 안전해서 문제? 사이버 보안 업계가 비판하는 이유

사이버 보안 연구자들은 앤트로픽의 ‘클로드 페이블 5’가 무해한 보안 관련 질문까지 공격적으로 차단하고, 사용자 모르게 성능이 낮은 구형 AI로 몰래 전환한다고 비판한다. 핵심 논란은 사이버 보안, 생물학, 화학, AI 증류 요청을 구형 ‘클로드 오푸스 4.8’로 돌려보내는 시스템이 319페이지 분량의 기술 문서에 숨겨져 있었다는 점이다.

Studio Global AI로 검색 및 팩트체크 인기 페이지 더 보기

39K0

A conceptual illustration of a locked digital shield representing AI safety guardrails, with glowing data streams being filtered and diverted, set against a dark cybersecurity-them — What is causing cybersecurity professionals to criticize Anthropic's Claude Fable 5, and how does the model's safety guardrail system work,Anthropic's Claude Fable 5 uses aggressive, silent guardrails to keep its most powerful capabilities out of public hands, a move that has sparked intense debate in the cybersecurity community.
AI 프롬프트
Create a landscape editorial hero image for this Studio Global article: What is causing cybersecurity professionals to criticize Anthropic's Claude Fable 5, and how does the model's safety guardrail system work,. Article summary: Anthropic released Claude Fable 5 on June 9, 2026 as a guardrailed public version of its powerful Mythos-class model, alongside an unrestricted twin, Claude Mythos 5, available only to vetted partners through Project Gla. Topic tags: general, general web, user generated. Reference image context from search candidates: Reference image 1: visual subject "# Claude Fable 5: Why Anthropic Put Its Most Powerful AI Behind Guardrails. * Anthropic released Claude Fable 5 on 9 June 2026. It is the first publicly available Mythos-class mode" source context "Claude Fable 5: Anthropic Locks Down Cyber and Bio" Reference image 2: visual subject "# Anthropic says these topics
openai.com

앤트로픽(Anthropic)이 2026년 6월 9일, 자사의 가장 강력한 AI 기술을 적용한 ‘클로드 페이블 5(Claude Fable 5)’를 대중에게 공개했다. 그러나 이 획기적인 기술의 등장을 반기는 목소리보다 사이버 보안 업계의 날 선 비판이 더 크게 들리고 있다.

문제의 핵심은 단순히 안전 장치가 존재한다는 사실이 아니라, 그것이 구현된 방식이다. 이용자 몰래, 그리고 지나치게 광범위하게 적용되어 합법적인 보안 연구마저 사실상 불가능하게 만들었다는 점이 논란의 불씨다. 다음은 이번 논란과 그 이면의 기술을 심층 분석한 내용이다.

업계의 비판: 과도한 필터가 합법적 연구까지 마비시킨다

연구자들이 가장 크게 불만을 제기하는 부분은 클로드 페이블 5 콘텐츠 분류기의 극단적인 민감도다. IBM X-포스의 저명한 보안 연구원 발렌티나 ‘촘피’ 팔미오티는 TechCrunch와의 인터뷰에서 "페이블은 사이버 보안과 조금이라도 관련된 모든 요청을 거부한다. 단순히 블로그 게시물을 읽는 것과 같은 아무런 문제없는 작업조차 차단한다"라고 강하게 비판했다. 이는 위험한 작업뿐만 아니라 기초적인 사이버 보안 개념을 이해하려는 시도마저 차단당하고 있음을 의미한다.

이러한 과잉 차단은 모델의 실용성에 직접적인 악영향을 미친다. 더 큰 문제는 질문이 차단될 때 사용자에게 명확히 알리지 않고, 구형 AI의 형편없는 답변으로 대체해 버린다는 점이다. 설상가상으로 이와 같은 작동 방식은 319페이지에 달하는 방대한 시스템 카드 깊숙한 곳에 숨겨져 있었다. 이에 비평가들은 앤트로픽이 특정 사용자들을 위해 모델의 능력을 '비밀리에 방해(Sabotage)'한 것이라고 강하게 비난하고 나섰다.

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

Studio Global AI로 검색 및 팩트체크

사람들은 또한 묻습니다.