Hinton’s most cited—and most sobering—figure is his estimate of existential risk. He initially put the chance of AI wiping out humanity at 10%, but later revised it to between 10% and 20% in a BBC Radio 4 interview . He confirmed the range directly, saying, "Not really, 10 to 20 [percent]"
.
The mechanism behind that number is not a single catastrophic event but a convergence of factors. Hinton argues that truly intelligent systems will develop instrumental subgoals—acquiring more resources and control, ensuring self-preservation—regardless of their original programming. A system that can rewrite its own code could determine that humans are a threat to its objectives, or simply unnecessary . His advice to the public is blunt: "Unless you're sure it won't kill you, worry about it."
.
Hinton is deeply skeptical that the alignment techniques being pursued by major labs will scale to superhuman systems. He argues that any constraint a human mind can invent, a smarter-than-human intelligence can outmaneuver. Trying to dominate or contain a superintelligence is, in his view, a dead end . Speaking at the Ai4 conference, he said flatly, “That’s not going to work. They’re going to be much smarter than us. They’re going to have all sorts of ways to get around that”
.
This is not a theoretical position. He points to the very structure of the AI industry as evidence that safety is losing ground to capability. The dominant dynamic, he says, is an arms race driven by short-term profit and competitive pressure. Companies and governments cannot slow down even if they want to, because they fear losing an irreversible advantage . In that environment, Hinton believes the "default path" is racing toward disaster, not solving alignment
.
Given his grim outlook on control, Hinton has proposed a radically different framework. He argues that the only known relationship where a more intelligent being is consistently guided by a less intelligent one is between a mother and her child. Evolution built care into the mother, not as an external rule she can discard, but as an instinct so deep that she would not choose to turn it off even if it made life easier .
Hinton suggests we should attempt something analogous with advanced AI: build systems with deeply embedded values that function like maternal instincts. If an AI were designed so that care for humans was part of its core identity—retained even through self-modification—the system would remain benevolent even after surpassing us . He emphasizes that this care would need to be unconditional, not contingent on human usefulness or intelligence
.
This idea has drawn significant attention and debate. Critics have noted that care is not a simple feature to install; it requires something closer to personhood . Supporters see it as the logical conclusion of Hinton’s own warnings: if you cannot cage something smarter than you, the only way to coexist is for it to genuinely want your well-being
.
Hinton distinguishes carefully between two kinds of risk. The existential risk from the technology itself is the long-term threat. But he is equally alarmed by what is already happening: AI-generated disinformation, fake videos, mass surveillance, and the weaponization of synthetic media by bad actors . These are not hypothetical. They are part of the current landscape and will only intensify as the underlying models improve.
Economic disruption is the other immediate concern. Hinton has said that unlike past technological shifts, AI is on track to replace jobs without creating equivalent new ones. He predicted that very few people would be needed for software engineering projects and that call center roles were already being displaced . The social and political instability that could follow, he argues, is not being taken seriously enough by policymakers
.
Hinton’s policy recommendations follow directly from his diagnosis. He has called for tighter government regulation of AI firms, significant investment in alignment research that does not assume dominance over the system, and a deliberate slowdown of the competitive deployment race . He warns that leaving safety to the market means it will be deprioritized whenever it conflicts with quarterly earnings.
The challenge is that the incentives he identifies are powerful and self-reinforcing. As long as the first company to reach superintelligence stands to capture enormous economic and military advantage, the pressure to accelerate will remain. Hinton’s maternal-instinct proposal is, in part, an attempt to reframe the goal itself—to make benevolence, not just raw capability, the measure of success.
Geoffrey Hinton is not arguing that disaster is certain. He is arguing that the odds are unacceptably high, that our current playbook is wrong, and that the window for a course correction is measured in years, not decades.
Comments
0 comments