But the reality is that Anthropic has already been moving aggressively on pricing. On June 9, just two days before the OpenAI story broke, Anthropic launched Claude Fable 5 at $10 per million input tokens and $50 per million output tokens — half the price of its previous Mythos Preview model. The new model scored 80.3% on SWE-Bench Pro, a 22-point lead over GPT-5.5's 58.6% . Anthropic also introduced an 8x cheaper "Compact" mode and, on May 14, overhauled its subscription structure so that heavy users of Claude's Agent SDK would be moved out of flat-rate subscription pools into metered API billing starting June 15
.
An honest reading of the competitive landscape suggests OpenAI is not acting from a position of strength. The price cuts are a response to a market position it lost, not a magnanimous gesture .
OpenAI CEO Sam Altman publicly acknowledged at a recent event that AI usage costs have become "a huge issue" for enterprise customers . Alexander Embiricos, OpenAI's head of enterprise, told TechCrunch that customer conversations have fundamentally shifted: "Six months ago, I would have a conversation with a customer and it would be all about 'What can it do? Is it good enough?' Our conversations are never about that now. Now the conversations are about, 'hey, we're spending so much. What visibility do you have? What auditability do you have?'"
.
The scale of enterprise spending is staggering. Altman revealed that one OpenAI power user burns through 100 billion tokens per month, equating to roughly $100,000–$300,000 per month at blended enterprise rates . Altman has said the company would like to bill AI "like electricity" — a pay-for-what-you-use model that sounds appealing in theory but becomes terrifying for CFOs when the meter never stops running
.
Both companies view developer tools as the critical beachhead for enterprise AI adoption. OpenAI slashed its "Pro" subscription to $100 per month in April 2026 specifically to make its Codex programming tools cheaper for professional developers . Anthropic's Claude Code and Agent SDK are positioned as direct alternatives.
Anthropic's June credit overhaul effectively removed a 15–30× subsidy that had made heavy SDK usage artificially cheap under flat-rate plans, which will significantly raise costs for the heaviest Claude Code users . The timing of OpenAI's reported price cuts, just days after Anthropic's model launch and ahead of the June 15 billing change, suggests a deliberate attempt to exploit that moment of customer sticker shock.
The pricing crisis cannot be understood without the parallel collapse of the "tokenmaxxing" productivity narrative. Tokenmaxxing — the practice of treating AI token consumption as a proxy for engineering productivity — became an internal culture across Silicon Valley throughout 2025 and early 2026. The New York Times reported in March that an OpenAI engineer processed 210 billion tokens in a single week, and at Amazon, some employees spun up AI agents to complete "wholly meaningless or unnecessary tasks" simply to keep their token usage statistics high .
But the data has turned decisively against this practice. Engineering analytics firm Faros AI, analyzing data from 22,000 developers across 4,000 teams, found that while AI adoption accelerated task throughput (task completion up 34%, epics up 66%), it also drove bugs per developer up 54%, median code review time up 5×, and code churn up an astonishing 861% in high AI adoption environments .
Initial code acceptance rates of 80–90% — which managers celebrated — turned out to be a mirage. When researchers tracked code revisions over the following weeks, the real-world acceptance rate plummeted to 10–30%, revealing substantial hidden technical debt . Jellyfish found that the top 10% of Claude Code users consumed roughly 10 times as many tokens as the median developer but produced only about twice the output
. The cost per merged pull request rose from $0.28 under light AI usage to as high as $89 under heavy usage, according to data from software firm Jellyfish
.
Even beyond engineering, the broader productivity case is crumbling. BCG's 2026 Global AI at Work report, surveying nearly 12,000 frontline employees, found that 42% of regular AI users reported saving eight hours per week, the equivalent of a full workday. But 66% said they received limited to no guidance on what to do with that saved time, and half said they were not measurably more productive . Uber COO Andrew Macdonald admitted the company has been struggling to connect the boost in individual worker productivity with any company-wide impact
.
ServiceNow Chief Customer Officer Chris Bedi captured the growing skepticism bluntly: "It's almost like measuring a restaurant's success by the amount of food it buys, not how many happy customers leave. There's a bill to pay for those tokens" .
The enterprise conversation has shifted from "go fast" to "we need guardrails" . That shift directly threatens the core revenue model of AI providers who benefit from unbounded usage.
Both OpenAI and Anthropic are reportedly preparing for initial public offerings . That timeline makes the economics of a price war especially dangerous. Aggressive token price cuts directly compress margins at exactly the moment both companies need to demonstrate sustainable unit economics to public market investors. Cutting prices without corresponding reductions in the enormous computing costs required for training and inference could make profitability even more elusive
.
But the deeper investor concern is about switching costs — or the lack of them. A March and April 2026 survey of 200 executives by Wakefield Research found that 79% were slightly or very concerned about lock-in with their current AI vendors . When one AI model's output is roughly as good as another's for a given task, and API integration is relatively straightforward, enterprise customers can pivot to the cheaper option with minimal friction.
The all-you-can-eat AI era is over . What is emerging in its place looks less like a winner-take-all platform war and more like a commodity pricing battle where the provider with the leanest cost structure survives. OpenAI's reported plan to cut prices is, at its core, an acknowledgment that the product itself is not differentiated enough to command a premium when customers are scrutinizing every token.
Comments
0 comments