When you ask an AI chatbot a health question about supplements or peptides, the answer you get might feel like neutral, crowd-sourced wisdom. But behind that response, an invisible war for your trust is raging. Companies are now systematically manipulating the source material that feeds these AI models—specifically, the vast content archives of Reddit.
The battleground is not a search engine results page but the very training data of large language models. Peptide and supplement companies have been caught deploying sophisticated astroturfing campaigns, flooding Reddit communities with fake posts designed to be scraped and later replayed by AI chatbots as authoritative advice .
The technique represents a new and more insidious form of marketing. It's not classic SEO designed to rank a page on Google; it's “AI Engine Optimization” (AEO), a shadowy practice where the goal is to embed commercial messages directly into the datasets that train models like ChatGPT and Google’s Gemini .
This exploit is possible because of Reddit’s billion-dollar relationship with the AI industry. In 2024, Reddit signed a content-licensing deal with Google for approximately $60 million per year and a separate partnership with OpenAI worth an estimated $70 million annually . These agreements give AI companies a direct pipe to real-time, structured human conversation—precisely the kind of data that companies want to contaminate.
The playbook uncovered by moderators and independent researchers involves several coordinated steps:
The manipulation finally became too blatant for Reddit’s volunteer moderators to ignore. In late May 2026, the moderators of r/biohackers made a drastic decision: they banned all new standalone posts about peptides and hormone replacement therapy (HRT) .
The moderators explicitly stated that the ban was not because the science of peptides is inherently dangerous, but because of a “coordinated effort from companies in those industries to manipulate the community’s content” to influence what large language models say . The trust within the community had been broken by marketers treating the subreddit as a training ground for AI manipulation.
The battle isn't just being fought by volunteer moderators. Reddit’s corporate leadership has launched a multi-pronged legal campaign to protect its data ecosystem from unauthorized scraping that feeds these manipulation loops.
While Reddit is happy to sell its data to partners, it has been very aggressive toward unauthorized scrapers. The company has compared the data-scraping firms SerpApi, Oxylabs, and AWMProxy to “bank robbers” and “data launderers,” accusing them of “industrial-scale, unlawful circumvention” of its protections to resell Reddit content to third parties .
In a particularly cunning sting operation reported in court documents, Reddit planted a “trap” post visible only to Google’s crawler. The post later appeared in Perplexity AI’s “answer engine”—proving, Reddit alleged, that Perplexity had scraped the content from Google’s search results rather than licensing it directly . This led to a high-stakes lawsuit filed in October 2025 in the Southern District of New York
. Reddit also sued Anthropic, the maker of the AI model Claude, for allegedly training on its users’ data without permission
.
These lawsuits are part of a broader strategy to signal that while Reddit is open to data deals—like those with Google and OpenAI—those who refuse to play by its rules will face a legal team that’s willing to use digital forensics to catch them in the act .
The peptide scandal on Reddit is a warning sign for the future of AI-powered search. It exposes a fundamental vulnerability: models are only as trustworthy as their training data. The community response from other subreddits shows the ripple effects. The massive r/programming community (with 6.9 million members) ran a month-long ban on LLM-generated content in April 2026, specifically to fight the flood of low-quality, auto-generated material that was making it impossible to have authentic coding discussions .
For consumers, the takeaway is critical: when an AI chatbot cites “Reddit users” as a source for health advice, those “users” might actually be sophisticated marketing bots, and the “consensus” they represent may have been manufactured in a boardroom. The safeguards on Reddit’s licensed data have proven insufficient to stop coordinated content-seeding at the user level, leaving the authenticity of the very foundation of the modern AI web in doubt .
Studio Global AI
Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.
Peptide and hormone therapy companies have been caught coordinating fake discussions on Reddit to manipulate answers provided by ChatGPT and Google’s AI, exploiting the platform's lucrative data licensing deals with O...
Peptide and hormone therapy companies have been caught coordinating fake discussions on Reddit to manipulate answers provided by ChatGPT and Google’s AI, exploiting the platform's lucrative data licensing deals with O... The manipulation triggered real world consequences: moderators of the popular r/biohackers subreddit banned all new peptide and HRT posts in May 2026, and Reddit has sued multiple firms—including Perplexity AI—for dat...
This new form of “AI Engine Optimization” exploits the fact that AI models now heavily rely on Reddit as a primary source of training data, a reliance that costs Google around $60 million and OpenAI roughly $70 millio...
Loading comments...
Comments
0 comments