उत्तरप्रकाशितपिछला सप्ताहLast edited 5 दिन पहले16 स्रोत

माइक्रोसॉफ्ट का ASSERT फ्रेमवर्क: कैसे AI एजेंट्स की गलतियाँ प्रोडक्शन से पहले पकड़ी जाएंगी

ASSERT (अडैप्टिव स्पेक ड्रिवन स्कोरिंग फॉर इवैल्यूएशन एंड रिग्रेशन टेस्टिंग) एक ओपन सोर्स फ्रेमवर्क है जो सादी भाषा के व्यवहार नियमों को क्रियान्वयन योग्य और स्कोर योग्य टेस्ट सूट में बदलता है, तथा नीति उल्लंघनों और स... यह प्रतिकूल (एडवर्सेरियल) परिदृश्य तैयार करता है, हर टूल कॉल को लॉग करता है, और स्कोर के साथ पास...

Studio Global AI के साथ खोजें और तथ्यों की जांच करें और ट्रेंडिंग पेज देखें

682K0

Abstract visualization representing Microsoft ASSERT framework converting natural-language AI behavior policies into structured, scored test suites for agent evaluation — What is Microsoft's ASSERT framework, announced at Build 2026, and how does it convert natural-language AI behavior policies into structuredMicrosoft's ASSERT framework automates the translation of plain-English behavior rules into executable, scored evaluation suites.
AI संकेत
Create a landscape editorial hero image for this Studio Global article: What is Microsoft's ASSERT framework, announced at Build 2026, and how does it convert natural-language AI behavior policies into structured. Article summary: Here is a concise answer based on the official Microsoft sources and trusted reporting.. Topic tags: general, general web. Reference image context from search candidates: Reference image 1: visual subject "# Build agents you can trust across any framework with open evals and a control standard. The gap is concrete: written policies do not translate into working runtime controls, eval" source context "Build agents you can trust across any framework with open evals ..." Reference image 2: visual subject "# Microsoft is making AI behavior testing easier for developers. Microsoft has released ASSERT, an open-source framework that turns plain-language AI behavior re
openai.com

माइक्रोसॉफ्ट ने 2 जून, 2026 को अपने बिल्ड 2026 डेवलपर कॉन्फ्रेंस में ASSERT (अडैप्टिव स्पेक-ड्रिवन स्कोरिंग फॉर इवैल्यूएशन एंड रिग्रेशन टेस्टिंग) की घोषणा की और इसे रिस्पॉन्सिबल AI पहल के तहत GitHub पर एक ओपन-सोर्स प्रोजेक्ट के रूप में जारी किया । यह फ्रेमवर्क एजेंटिक AI डेवलपमेंट की एक बड़ी समस्या का समाधान करता है: आप कैसे सुनिश्चित करें कि एक स्वायत्त एजेंट, असली यूज़र्स या सिस्टम से संपर्क करने से पहले, आपके उत्पाद के विशिष्ट नियमों और सुरक्षा सीमाओं का पालन करेगा? पारंपरिक AI बेंचमार्क — जो सहायकता, विषाक्तता, या सामान्य सटीकता मापते हैं — अक्सर एप्लिकेशन-विशिष्ट व्यवहार में होने वाली गंभीर विफलताओं को मिस कर देते हैं, जैसे कोई एजेंट बिना अनुमति रिफंड जारी कर दे या गोपनीय डेटा गलत व्यक्ति को भेज दे । ASSERT इस कमी को पूरा करता है और प्राकृतिक भाषा के व्यवहार विनिर्देशों को मूल्यांकन का प्रथम-श्रेणी इनपुट बनाता है, न कि सिर्फ पृष्ठभूमि संदर्भ।

कैसे ASSERT शब्दों को टेस्ट सूट में बदलता है

ASSERT पांच चरणों वाली एक प्रक्रिया का अनुसरण करता है जो एक डेवलपर के लिखित इरादे को एक स्कोर योग्य और जांच योग्य मूल्यांकन में बदल देती है:

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

Studio Global AI के साथ खोजें और तथ्यों की जांच करें

लोग पूछते भी हैं