AnswersPublished23 hours agoLast edited 23 hours ago29 sources

OpenAI GPT-5.6 System Card: Key Safety and Capability Findings for Sol, Terra, and Luna

OpenAI's GPT 5.6 Preview System Card, published June 26, 2026, classifies all three models (Sol, Terra, Luna) as High capability in cybersecurity and biological/chemical risk under the Preparedness Framework v2 — the... All three GPT 5.6 models (Sol, Terra, Luna) are classified as High capability under OpenAI's Prep...

Search & fact-check with Studio Global AI Browse more Trending pages

69K0

OpenAI GPT-5.6 Preview System Card cover graphic showing Sol, Terra, and Luna models with safety findings — Search & fact-check with cited sources for What are the key safety and capability findings from OpenAI's GPT-5.6 Preview System Card, coveriOpenAI's GPT-5.6 Preview System Card details safety and capability findings for the Sol, Terra, and Luna model family.
AI Prompt
Create a landscape editorial hero image for this Studio Global article: Search & fact-check with cited sources for What are the key safety and capability findings from OpenAI's GPT-5.6 Preview System Card, coveri. Article summary: Here is a comprehensive summary of the key safety and capability findings from the **GPT-5.6 Preview System Card** (published June 26, 2026), based on OpenAI's official Deployment Safety Hub and supporting analyses.. Topic tags: general, general web, user generated, academic, education. Style: premium digital editorial illustration, source-backed research mood, clean composition, high detail, modern web publication hero. Use reference image context only for broad subject, composition, and topical grounding; do not copy the exact image. Avoid: logos, brand marks, copyrighted characters, real person likenesses, fake screenshots, UI text, readable text, watermarks
openai.com

OpenAI published the GPT-5.6 Preview System Card on June 26, 2026, detailing safety and capability findings across a three-model family — Sol (flagship), Terra (mid-tier), and Luna (fastest/smallest) — alongside a layered safety stack and a limited-preview deployment strategy. The card marks the first time OpenAI has classified smaller and faster models in a family as High risk under its Preparedness Framework, and introduces new safety technology including activation classifiers and a deployment simulation method.

Model Family and Risk Classifications

Under OpenAI's Preparedness Framework (Version 2), all three GPT-5.6 models are treated as High capability in both Cybersecurity and Biological/Chemical risk . None of the three models reach the High threshold in AI Self-Improvement .

For cybersecurity specifically, Sol did not cross the "Cyber Critical" threshold — the highest risk level. In evaluations involving Chromium and Firefox, Sol identified bugs and exploitation primitives but did not autonomously produce a functional full-chain exploit under the conditions tested . Sol saturated OpenAI's internal cyber challenge set at 96.7%, placing it above the High threshold but below Critical .

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

OpenAI GPT-5.6 System Card: Key Safety and Capability Findings for Sol, Terra, and Luna

Model Family and Risk Classifications

Search, cite, and publish your own answer

People also ask

What is the short answer to "OpenAI GPT-5.6 System Card: Key Safety and Capability Findings for Sol, Terra, and Luna"?

What are the key points to validate first?

What should I do next in practice?

Sources

Comments

Layered Safeguards: "Our Most Robust Safety Stack to Date"

Activation Classifiers (New)

Model-Level Safety Training

Real-Time Misuse Classifiers

Deployment Simulation ("Deployment Replay")

Deployment Simulation Results

Disallowed Content

Misalignment and Agentic Overreach

Jailbreak and Prompt Injection Robustness

HealthBench Professional Performance

Chain-of-Thought Monitorability and Controllability

Metagaming Behavior

Bias Evaluations

External Red-Teaming Results

Deployment Strategy: Limited Preview with Trusted Access

Key Uncertainty and Limitations

Model	Cybersecurity Risk	Biological/Chemical Risk	AI Self-Improvement
Sol (flagship)	High (not Critical)	High	Below High
Terra (mid-tier)	High	High	Below High
Luna (fastest)	High	High	Below High