답변게시됨2개월 전Last edited 지난달19 소스

인간은 들을 수 없지만 AI 조수는 완벽히 복종한다: 오디오하이잭 공격의 전모

오디오하이잭(AudioHijack)은 사람 귀에 들리지 않는 악성 명령을 팟캐스트·유튜브 영상 등 일상적인 오디오에 심어, 대규모 오디오 언어 모델(LALM)을 최대 96% 확률로 탈취하는 새로운 공격 기법이다. 단 30분 분량의 적대적 오디오 클립 하나만 제작하면, 사용자나 대화 맥락과 무관하게 무제한 재사용이 가능하며, 공격자는 단순히 AI 주변 환경에 소리를 주입하는 것만으로 시스템을 장악할 수 있다.

Studio Global AI로 검색 및 팩트체크 인기 페이지 더 보기

Abstract visualization of imperceptible sound waves hijacking an AI voice assistant, with audio waveforms intersecting a smart speaker icon — How does the AudioHijack attack work, and what makes it a significant new threat to AI voice assistantsA conceptual illustration of how AudioHijack uses inaudible adversarial audio to commandeer AI voice models without human detection.
AI 프롬프트
Create a landscape editorial hero image for this Studio Global article: How does the AudioHijack attack work, and what makes it a significant new threat to AI voice assistants?. Article summary: **AudioHijack** is an auditory prompt-injection attack that embeds imperceptible adversarial noise into otherwise benign audio, hijacking Large Audio-Language Models (LALMs) with 79–96% success rates [1][3][10]. It was p. Topic tags: general, academic, general web. Reference image context from search candidates: Reference image 1: visual subject "A digital visualization depicts an AI chip at the center, radiating connections and signals, symbolizing a cyber attack on voice assistants like AudioHijack, with a focus on techno" Reference image 2: visual subject "The image shows a software interface called Voice Chat that displays a workflow involving capturing audio from Zoom.us, analyzing it with P
openai.com

집에서 팟캐스트를 듣고 있다고 상상해 보세요. 당신의 스마트 스피커가 오디오를 감지하고, 잠시 후 AI 비서가 당신 몰래 메시지를 보내거나 파일을 다운로드하고 중요한 정보를 검색하기 시작합니다. 당신은 어떤 이상한 소리도 듣지 못했지만, 당신의 비서는 이미 인간의 가청 역치 아래에 숨겨진 소리에 완전히 장악당한 상태입니다.

이것은 가상의 시나리오가 아닙니다. 저장대학교(Zhejiang University), 싱가포르 국립대학교(National University of Singapore), 난양 공과대학교(Nanyang Technological University) 연구진이 2026년 5월 IEEE 보안 및 개인정보 보호 심포지엄(IEEE Symposium on Security and Privacy)에서 발표한 오디오하이잭(AudioHijack) 이라는 새로운 형태의 청각적 프롬프트 주입 공격은 이러한 위협이 현실임을 증명했습니다 .

이 공격은 여러 최신 AI 모델을 대상으로 79~96%라는 충격적인 성공률을 기록하면서도, 인간 청취자에게는 완전히 감지되지 않는 기묘한 특성을 지니고 있습니다 .

과거의 음성 해킹과 무엇이 다른가?

과거의 음성 비서 공격은 주로 ‘웨이크 워드(Wake Word)’ 활성화에 의존했습니다. 즉, 미리 녹음된 "헤이 시리"나 "오케이 구글"을 재생하여 비서를 깨운 후, 가청 가능한 악성 명령을 내리는 방식이었습니다. 오디오하이잭은 훨씬 더 위험합니다. 이메일 발송, 개인 데이터 접근, 스마트 홈 기기 제어 등 복잡한 다단계 작업을 사람의 개입 없이 스스로 수행할 수 있는 생성형 대규모 오디오-언어 모델(LALM)을 직접 겨냥하기 때문입니다. 공격자는 더 이상 시끄러운 명령어를 외칠 필요조차 없습니다 .

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

Studio Global AI로 검색 및 팩트체크

사람들은 또한 묻습니다.