คำตอบเผยแพร่แล้วเมื่อวานซืนLast edited เมื่อวานซืน32 แหล่งที่มา

เจาะกลยุทธ์ 'ล่าฝูง' ถอดรหัส AI ที่การ์ดสูงที่สุดของ Anthropic ภายใน 24 ชั่วโมง

เพียงวันเดียวหลังเปิดตัว Claude Fable 5 โมเดล AI ที่ทรงพลังที่สุดของ Anthropic ก็ถูกนักวิจัยนิรนาม 'พลินี' ฝ่าเกราะป้องกันด้วยเทคนิค 'ล่าฝูง' (Pack Hunt) ที่ใช้ AI หลายตัวทำงานประสานกัน การโจมตีครั้งนี้ดึงเอาข้อความระบบ (System Prompt) ความยาว 120,000 ตัวอักษรออกมาได้ และบีบให้โมเดลสร้างโค้ดสำหรับการแฮกและคำแนะนำทาง...

ค้นหาและตรวจสอบข้อเท็จจริงด้วย Studio Global AI ดูหน้าที่กำลังมาแรงเพิ่มเติม

37K0

What happened when Anthropic's Claude Fable 5 was reportedly jailbroken by a researcher just one day after its June 9 launch, what techniqueAI-generated editorial hero image for What happened when Anthropic's Claude Fable 5 was reportedly jailbroken by a researcher just one day after its June 9 launch, what technique.
AI พรอมต์
Create a landscape editorial hero image for this Studio Global article: What happened when Anthropic's Claude Fable 5 was reportedly jailbroken by a researcher just one day after its June 9 launch, what technique. Article summary: On June 10, 2026 — just one day after Anthropic launched Claude Fable 5, its first public Mythos-class model — prolific AI red-teamer **Pliny the Liberator** announced he had bypassed the model's safety classifiers, extr. Topic tags: general, general web, user generated. Reference image context from search candidates: Reference image 1: visual subject "# Anthropic’s Claude Fable 5 Jailbroken to Generate Stack Exploits. Anthropic's Claude Fable 5 Jailbroken. Anthropic launched Claude Fable 5 on June 9, 2026, as the first publicly" source context "Anthropic's Claude Fable 5 Jailbroken to Generate Stack ..." Reference image 2: visual subject "Anthropic Releases Cl
openai.com

Anthropic เปิดตัว Claude Fable 5 เมื่อวันที่ 9 มิถุนายน 2026 พร้อมประกาศว่านี่คือโมเดลสาธารณะรุ่นแรกในระดับ Mythos ซึ่งเป็นระดับที่มีศักยภาพสูงมากจนบริษัทเคยพิจารณาว่าอันตรายเกินกว่าจะเปิดให้ใช้งานได้โดยไม่จำกัด สถาปัตยกรรมความปลอดภัยของมันไม่เคยมีใครทำมาก่อน: มีระบบคัดกรองด้วย AI ที่ถูกฝึกให้ตรวจจับคำถามที่มีความเสี่ยงสูงใน 4 ด้านหลัก ได้แก่ ความปลอดภัยทางไซเบอร์, ชีววิทยา, เคมี, และการกลั่นโมเดล (Model Distillation) หากเจอคำถามต้องสงสัย ระบบจะไม่ปฏิเสธ แต่จะเปลี่ยนเส้นทางไปให้ Claude Opus 4.8 ซึ่งเป็นรุ่นที่ฉลาดน้อยกว่าตอบแทน ทางบริษัทยังระบุด้วยว่า การทดสอบโดยแฮกเกอร์รับจ้างและทีม Red-Teaming ภายนอกกว่า 1,000 ชั่วโมง ไม่สามารถหาช่องโหว่แบบ Universal Jailbreak ได้เลยแม้แต่ครั้งเดียว

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

ค้นหาและตรวจสอบข้อเท็จจริงด้วย Studio Global AI

คนยังถาม

คำตอบสั้น ๆ สำหรับ "เจาะกลยุทธ์ 'ล่าฝูง' ถอดรหัส AI ที่การ์ดสูงที่สุดของ Anthropic ภายใน 24 ชั่วโมง" คืออะไร

ประเด็นสำคัญที่ต้องตรวจสอบก่อนคืออะไร?

แหล่งที่มา

Comments

0 comments

Loading comments...

เจาะกลยุทธ์ 'ล่าฝูง' ถอดรหัส AI ที่การ์ดสูงที่สุดของ Anthropic ภายใน 24 ชั่วโมง

Search, cite, and publish your own answer

คนยังถาม

คำตอบสั้น ๆ สำหรับ "เจาะกลยุทธ์ 'ล่าฝูง' ถอดรหัส AI ที่การ์ดสูงที่สุดของ Anthropic ภายใน 24 ชั่วโมง" คืออะไร

ประเด็นสำคัญที่ต้องตรวจสอบก่อนคืออะไร?

แหล่งที่มา

Comments

กลยุทธ์ 'ล่าฝูง' (Pack Hunt): วิธีการโจมตี

คำกล่าวอ้างด้านความปลอดภัยของ Anthropic ก่อนเปิดตัว

รูปแบบการเจลเบรคที่รวดเร็วซ้ำแล้วซ้ำเล่า

นัยสำคัญต่อการทดสอบความปลอดภัยของ AI