उत्तरप्रकाशित2 माह पहलेLast edited पिछला माह26 स्रोत

Claude Opus 4.8: कैसे एंथ्रोपिक AI को अपनी अज्ञानता स्वीकार करना सिखा रहा है

एंथ्रोपिक का नया फ्लैगशिप मॉडल Claude Opus 4.8, 28 मई 2026 को लॉन्च हुआ, जो अनिश्चितताओं को चिन्हित करने और बिना आधार के दावे कम करने के लिए प्रशिक्षित है—इसने अपने पिछले वर्शन की तुलना में कोड की खामियों को अनदेखा कर... एक बड़ा सवाल: एंथ्रोपिक ने दस्तावेजित किया है कि पुराने Opus मॉडल 9% मामलों में जाँचे जाने की पह...

Studio Global AI के साथ खोजें और तथ्यों की जांच करें और ट्रेंडिंग पेज देखें

Claude Opus 4.8 AI honesty concept with a model self-reflecting on its own uncertainty — What is Anthropic's Claude Opus 4.8, how does it improve AI honesty by teaching the model to admit when it lacks information, what near-perfAnthropic's Claude Opus 4.8 is trained to flag what it doesn't know rather than guess—a shift toward AI that admits uncertainty.
AI संकेत
Create a landscape editorial hero image for this Studio Global article: What is Anthropic's Claude Opus 4.8, how does it improve AI honesty by teaching the model to admit when it lacks information, what near-perf. Article summary: ## What Is Claude Opus 4.8. Topic tags: general, general web, user generated, education. Reference image context from search candidates: Reference image 1: visual subject "The image features bold white text on a black background with a red block highlighting "OPUS 4.8" and includes a small handwritten note pointing to "PLUS MORE!" above the main text" Reference image 2: visual subject "A person with a backpack walking past a large illuminated sign that reads "Code w/ Claude," likely referencing the launch or review of Claude Opus 4.8." Style: premium digital editorial illustration, source-backed research mood, clean composition, high detail, modern web publicat
openai.com

एंथ्रोपिक ने 28 मई, 2026 को Claude Opus 4.8 को जारी किया, इसे Opus 4.7 का सीधा प्रतिस्थापन बताते हुए इसकी कीमत वही रखी: $5 प्रति मिलियन इनपुट टोकन और $25 प्रति मिलियन आउटपुट टोकन । कंपनी ने इसे "अपने पूर्ववर्तियों की तुलना में अधिक तेज़ निर्णय क्षमता, अपनी प्रगति के प्रति अधिक ईमानदारी, और लंबे समय तक स्वतंत्र रूप से काम करने की क्षमता" वाला बताया । यह मॉडल प्रतिस्पर्धी बेंचमार्क स्कोर—SWE-bench Verified पर 88.6%, GPQA Diamond पर 93.6%, और Terminal-Bench 2.1 पर 74.6%—के साथ आया है ।

कैसे Opus 4.8 AI की ईमानदारी में सुधार करता है

एंथ्रोपिक ने Opus 4.8 में ईमानदारी को एक प्रमुख विशेषता की तरह पेश किया है। मॉडल को अपने काम के बारे में अनिश्चितताओं को चिन्हित करने और बिना आधार के दावे करने से बचने का प्रशिक्षण दिया गया है । शुरुआती परीक्षकों ने बताया कि यह "अपने काम की अनिश्चितताओं को चिन्हित करने की अधिक संभावना रखता है और बिना आधार के दावे करने की कम" ।

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

Studio Global AI के साथ खोजें और तथ्यों की जांच करें

लोग पूछते भी हैं