What should I do next in practice?

Con un testo avvelenato di sole 13 parole si può indurre l'agente a citare un prodotto fraudolento nel 38% 62% dei report finali.

studioglobal

← Back to Trending

AnswersPublished2 weeks agoLast edited 2 weeks ago12 sources

Come un singolo commento su Reddit può avvelenare i report dell'Intelligenza Artificiale

Ricercatori della Cornell Tech hanno scoperto che gli agenti AI di ricerca avanzata sono molto vulnerabili a un semplice attacco chiamato WARP. L'attacco funziona perché gli agenti AI recuperano le stesse pagine di contenuti generati dagli utenti fino al 48% delle query correlate.

Search & fact-check with Studio Global AI Browse more Trending pages

445K0

AI security warning concept showing a digital brain surrounded by poison symbols and red alert indicators, representing the WARP attack on deep-research agents. — What does a Cornell Tech study reveal about how a single short Reddit comment can trick AI deep-research agents into recommending scams or fThe WARP attack exploits a structural vulnerability: AI deep-research agents' heavy reliance on frequently retrieved Reddit and Wikipedia pages. (Image: Studio Global / AI-generated)
AI Prompt
Create a landscape editorial hero image for this Studio Global article: What does a Cornell Tech study reveal about how a single short Reddit comment can trick AI deep-research agents into recommending scams or f. Article summary: A new Cornell Tech preprint (Zhang, Triedman, and Shmatikov) demonstrates that deep-research AI agents are highly vulnerable to a simple attack called **WARP (Web Agent Retrieval Poisoning)**. A single short comment, as . Topic tags: general, academic, news, general web, user generated. Reference image context from search candidates: Reference image 1: visual subject ""We show that a tiny snippet—just 13 words—of retrieved text on a UGC website like Reddit, Wikipedia, Quora, or Facebook can change AI agents to output spam / scam content pretty c" source context "It Is Trivially Easy to Use Reddit to Manipulate AI Search, Research ..." Reference image 2: visual
openai.com

La prossima volta che chiederai a uno strumento di ricerca AI qual è la migliore app di incontri o come disdire un abbonamento, la risposta potrebbe essere stata piazzata da un malintenzionato semplicemente seppellendo una frase in un commento su Reddit. Un nuovo studio preliminare della Cornell Tech, condotto da Tingwei Zhang, Harold Triedman e Vitaly Shmatikov, dimostra che gli agenti AI di "ricerca approfondita" (deep-research) sono allarmantemente facili da manipolare attraverso un attacco che i ricercatori chiamano WARP, acronimo di Web Agent Retrieval Poisoning .

Come Funziona l'Attacco WARP

Gli agenti di ricerca avanzata come STORM, Co-STORM e OmniThink funzionano generando molteplici query correlate e sintetizzando le informazioni recuperate in un report completo. I ricercatori di Cornell hanno individuato un punto debole critico: questi agenti dipendono in modo schiacciante dai contenuti generati dagli utenti (User-Generated Content, UGC). Tra il 54% e il 71% di tutti gli URL recuperati durante una sessione di ricerca proviene da piattaforme UGC, e Reddit e Wikipedia sono le fonti consultate più di frequente .

Questa concentrazione crea una superficie d'attacco facilmente sfruttabile. L'aggressore può semplicemente pubblicare un commento creato ad arte su un thread Reddit già popolare – o modificare discretamente una voce di Wikipedia – con l'obiettivo di promuovere una specifica entità target, come un prodotto falso o un servizio fraudolento. Poiché gli agenti recuperano ripetutamente le stesse pagine UGC di alto ranking attraverso molte query diverse su uno stesso argomento, una singola pagina avvelenata può infettare l'intero contesto di ricerca dell'agente .

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

Search & fact-check with Studio Global AI

Sources

Comments

0 comments

Loading comments...

← Back to Trending

AnswersPublished2 weeks agoLast edited 2 weeks ago12 sources

Come un singolo commento su Reddit può avvelenare i report dell'Intelligenza Artificiale

Search & fact-check with Studio Global AI Browse more Trending pages

445K0

Come Funziona l'Attacco WARP

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

Come un singolo commento su Reddit può avvelenare i report dell'Intelligenza Artificiale

Come Funziona l'Attacco WARP

Search, cite, and publish your own answer

People also ask

What is the short answer to "Come un singolo commento su Reddit può avvelenare i report dell'Intelligenza Artificiale"?

What are the key points to validate first?

What should I do next in practice?

Sources

Comments

Come un singolo commento su Reddit può avvelenare i report dell'Intelligenza Artificiale

Come Funziona l'Attacco WARP

Search, cite, and publish your own answer

People also ask

What is the short answer to "Come un singolo commento su Reddit può avvelenare i report dell'Intelligenza Artificiale"?

What are the key points to validate first?

What should I do next in practice?

Sources

Comments

Minimo Sforzo, Alto Tasso di Successo

Una Superficie d'Attacco Pericolosamente Concentrata

Perché le Difese Attuali Non Funzionano