What should I do next in practice?

Tidak seperti penanda aras umum, ASSERT menyasarkan peraturan perniagaan yang spesifik.

AnswersPublishedlast weekLast edited 5 days ago16 sources

Microsoft ASSERT: Uji Ejen AI Hanya Guna Ayat Bahasa Inggeris, Kegagalan Dikesan Awal!

ASSERT (Adaptive Spec driven Scoring for Evaluation and Regression Testing) ialah rangka kerja sumber terbuka yang menukar peraturan tingkah laku bahasa Inggeris biasa kepada suite ujian boleh skor, mengesan pelanggar... Ia menjana senario ujian ‘adversarial’, merekod setiap tindakan dan panggilan alatan ejen, serta...

Search & fact-check with Studio Global AI Browse more Trending pages

682K0

Abstract visualization representing Microsoft ASSERT framework converting natural-language AI behavior policies into structured, scored test suites for agent evaluation — What is Microsoft's ASSERT framework, announced at Build 2026, and how does it convert natural-language AI behavior policies into structuredMicrosoft's ASSERT framework automates the translation of plain-English behavior rules into executable, scored evaluation suites.
AI Prompt
Create a landscape editorial hero image for this Studio Global article: What is Microsoft's ASSERT framework, announced at Build 2026, and how does it convert natural-language AI behavior policies into structured. Article summary: Here is a concise answer based on the official Microsoft sources and trusted reporting.. Topic tags: general, general web. Reference image context from search candidates: Reference image 1: visual subject "# Build agents you can trust across any framework with open evals and a control standard. The gap is concrete: written policies do not translate into working runtime controls, eval" source context "Build agents you can trust across any framework with open evals ..." Reference image 2: visual subject "# Microsoft is making AI behavior testing easier for developers. Microsoft has released ASSERT, an open-source framework that turns plain-language AI behavior re
openai.com

Microsoft mengumumkan ASSERT (Adaptive Spec-driven Scoring for Evaluation and Regression Testing) pada persidangan pembangun Build 2026 pada 2 Jun, dan mengeluarkannya sebagai projek sumber terbuka di GitHub di bawah inisiatif Responsible AI . Rangka kerja ini menangani masalah kritikal dalam pembangunan ejen AI (AI Agent): bagaimana memastikan AI autonomi mematuhi peraturan spesifik produk dan sempadan keselamatan anda sebelum ia berinteraksi dengan pengguna sebenar .

Sering kali, ujian AI tradisional hanya mengukur ketepatan umum, ‘ketoksikan’ (toxicity), atau sama ada jawapan itu ‘membantu’. Ujian sebegini gagal mengesan kegagalan aplikasi-spesifik yang serius, seperti ejen yang secara automatik meluluskan permohonan ‘refund’ tanpa kebenaran atau mendedahkan data sulit syarikat kepada penerima yang salah .

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

Microsoft ASSERT: Uji Ejen AI Hanya Guna Ayat Bahasa Inggeris, Kegagalan Dikesan Awal!

Search, cite, and publish your own answer

People also ask

What is the short answer to "Microsoft ASSERT: Uji Ejen AI Hanya Guna Ayat Bahasa Inggeris, Kegagalan Dikesan Awal!"?

What are the key points to validate first?

What should I do next in practice?

Sources

Comments

Bagaimana Ayat Bahasa Inggeris Ditukar Menjadi Ujian Gempur

Mengatasi Kelemahan Penanda Aras Generik

Sebahagian daripada ‘Trust Stack’ yang Lebih Besar