答案已发布2个月前Last edited 上个月19 来源

AudioHijack：人耳听不到的“隐形指令”，正以96%成功率劫持AI语音助手

AudioHijack能将人耳无法察觉的恶意指令嵌入播客、YouTube视频等寻常音频中，以高达79 96%的成功率劫持大型音频语言模型，全程用户听不到任何异常。一段精心制作的30分钟对抗性音频可无限次重复使用，且在不同用户、对话和场景下均有效，攻击者只需能将音频注入AI的拾音环境即可。

使用 Studio Global AI 搜索并核查事实浏览更多热门页面

Abstract visualization of imperceptible sound waves hijacking an AI voice assistant, with audio waveforms intersecting a smart speaker icon — How does the AudioHijack attack work, and what makes it a significant new threat to AI voice assistantsA conceptual illustration of how AudioHijack uses inaudible adversarial audio to commandeer AI voice models without human detection.
AI 提示
Create a landscape editorial hero image for this Studio Global article: How does the AudioHijack attack work, and what makes it a significant new threat to AI voice assistants?. Article summary: **AudioHijack** is an auditory prompt-injection attack that embeds imperceptible adversarial noise into otherwise benign audio, hijacking Large Audio-Language Models (LALMs) with 79–96% success rates [1][3][10]. It was p. Topic tags: general, academic, general web. Reference image context from search candidates: Reference image 1: visual subject "A digital visualization depicts an AI chip at the center, radiating connections and signals, symbolizing a cyber attack on voice assistants like AudioHijack, with a focus on techno" Reference image 2: visual subject "The image shows a software interface called Voice Chat that displays a workflow involving capturing audio from Zoom.us, analyzing it with P
openai.com

想象一下，你在家惬意地听着播客。你的智能音箱捕捉到了声音，不一会儿，你的AI助手便开始自动发送消息、下载文件，或是搜索你的私密信息——而你，自始至终没说过一句话。你什么异常都没听到，但你的助手，刚刚被一种隐藏在人类听觉阈值之下的声音劫持了。

这并非虚构。2026年5月，在旧金山举办的IEEE安全与隐私研讨会（IEEE Symposium on Security and Privacy）上，来自浙江大学、新加坡国立大学和南洋理工大学的研究人员展示了这一确凿的威胁，并公布了他们的研究成果——AudioHijack。这是一种针对大型音频语言模型（LALMs）的新型听觉提示注入攻击。

该攻击在多种顶尖模型上取得了令人警醒的**79%至96%**的成功率，同时，整个过程对人类听众来说完全不可察觉。

AudioHijack与以往的音频攻击有何不同？

早期的语音助手攻击通常依赖于唤醒词激活——例如，播放录制的“Hey Siri”或“小爱同学”指令来唤醒助手，然后再下达可听的恶意命令。AudioHijack则危险得多，因为它直接瞄准了能自主执行复杂多步骤操作的生成式大型音频语言模型。这类模型无需任何可听的触发短语，就能发送邮件、访问个人数据、控制智能家居设备。

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

使用 Studio Global AI 搜索并核查事实

人们还问