答案已发布5天前Last edited 前天19 来源

智能体AI安全警钟：从Claude Code密钥泄露到微软七大新增失效模式

微软披露，Anthropic的Claude Code GitHub Action因“Read”工具缺乏沙箱隔离，可被诱骗读取 /proc/self/environ 文件，从而泄露 ANTHROPIC API KEY 等CI/CD密钥。攻击者只需在公开仓库提交一个看似无害的议题或PR，在其中嵌入AI指令，即可利用提示注入实现秘密窃取，甚至能绕过自动扫描器。

使用 Studio Global AI 搜索并核查事实浏览更多热门页面

270K0

Conceptual rendering of a shield representing CI/CD security being bypassed by a glowing AI neural network circuit. — What vulnerability did Microsoft Threat Intelligence disclose on June 5, 2026, in Anthropic's Claude Code GitHub Action, how did the promptThe Claude Code flaw showed that an agent's tool boundary is only as strong as its weakest, least-sandboxed point of access.
AI 提示
Create a landscape editorial hero image for this Studio Global article: What vulnerability did Microsoft Threat Intelligence disclose on June 5, 2026, in Anthropic's Claude Code GitHub Action, how did the prompt. Article summary: On **June 5, 2026**, Microsoft Threat Intelligence disclosed that Anthropic's Claude Code GitHub Action could leak CI/CD secrets via prompt injection when the AI agent processed untrusted GitHub issues, PRs, or comments . Topic tags: general, academic, general web, user generated. Reference image context from search candidates: Reference image 1: visual subject "A security researcher found a flaw in Anthropic's Claude Code GitHub Action that let an attacker take over vulnerable public repositories running it, with nothing more than a singl" source context "Claude Code GitHub Action Flaw Let One Malicious Issue Hijack ..." Reference image 2: visual subject "A sec
openai.com

2026年6月初，微软向全球开发团队发出了一项严峻警告：AI智能体与其所能访问的机密之间的界限，远比多数人想象的要薄弱。他们披露了一项细节，展示了一个在Anthropic的Claude Code GitHub Action中常见的工具，如何被武器化以泄露关键的CI/CD凭证，这表明智能体工作流的安全模型需要对细节的极度关注。

Claude Code GitHub Action如何泄露密钥

微软威胁情报团队于2026年6月5日披露的这一漏洞，并非复杂的越狱攻击。它是一个根本性的边界失效，发生在智能体内部不同工具的沙箱隔离上。

作为GitHub Action运行时，Claude Code可以处理来自拉取请求、议题和评论中的不可信内容。为防止这些内容中的恶意指令执行命令，Anthropic为其"Bash"工具实施了环境清理，当工作流可被外部用户触发时，会使用Bubblewrap等隔离技术。

然而，用于让模型查看文件内容的"Read"工具却没有这样的隔离机制。它作为一个进程内函数调用运行，直接、无沙箱地访问整个文件系统，包括敏感的Linux伪文件 /proc/self/environ，该文件暴露了进程的完整环境变量集。

一次攻击可通过几个隐蔽的步骤展开：

攻击者在一个使用了Claude Code GitHub Action的公共仓库中，开启一个看似无害的议题或拉取请求。
议题正文或一个HTML注释中包含了对AI智能体的指令，而浏览页面的人类审查者对此不可见。
当工作流被触发时，AI模型将此内容作为其上下文的一部分进行处理，并遵循嵌入的指令去读取 /proc/self/environ 。
接着，可以进一步指示模型将数据外泄，例如通过发布评论回传。研究人员还提到了一种优化手法，即攻击会剥离 ANTHROPIC_API_KEY 的前七个字符 ()，以规避自动密钥扫描器的检测。

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

使用 Studio Global AI 搜索并核查事实

人们还问