OpenAI is losing. And it's not close.

Gloss Key Takeaways

Anthropic has reportedly surpassed OpenAI in annualized revenue and is winning a large majority of head-to-head enterprise deals, signaling a shift where the money is made.
OpenAI’s consumer lead is eroding as ChatGPT’s traffic and app share fall while Gemini scales to massive usage and Claude gains share quickly.
On product performance and value, competitors increasingly match or beat OpenAI on key benchmarks and/or price, making “parity” a strategic loss when you’re no longer best and still expensive.
GPT-5’s launch triggered significant user backlash (speed, reliability, hallucinations), suggesting quality/regression issues possibly tied to transitions, safety tuning, and routing to smaller models.
The article argues OpenAI’s internal instability—mission changes, safety cuts, legal conflict, and heavy burn—may be a deeper risk than any single model release.

Three things you'll walk away with after reading this:

Anthropic passed OpenAI in revenue. $30 billion annualized run rate, up from $1 billion fourteen months ago. Anthropic wins 70% of enterprise deals in head-to-head competition. That's not a rounding error.
The products tell the story. Claude Code went from zero to the most loved coding tool in eight months. GPT-5 launched to user backlash so severe OpenAI declared an internal "Code Red." Gemini 3.1 Pro tops 13 of 16 major benchmarks. OpenAI's models aren't bad. They're third.
The organizational rot runs deeper than the products. OpenAI deleted "safely" from its mission statement, gutted its safety team, faces a $135 billion lawsuit from its co-founder, and is burning $14 billion in 2026 alone. The pattern is hard to ignore.

A phrase I keep hearing from people who evaluate both platforms for enterprise procurement: "OpenAI is a consumer company that makes enterprise products. Anthropic is an enterprise company that happens to have a consumer product." That distinction sounds like positioning talk until you watch it play out in seven-figure purchasing decisions. Then it starts looking like a structural explanation for why the numbers are moving in one direction and accelerating.

The revenue picture

Anthropic crossed $30 billion in annualized revenue in April, overtaking OpenAI for the first time. Fourteen months earlier, Anthropic was at $1 billion. Eight of the Fortune 10 use Claude. More than 1,000 companies spend over $1 million per year on Claude, a figure that doubled in under two months. Anthropic holds 32% of the enterprise API market. OpenAI holds 25%. In direct competition for enterprise contracts, Anthropic wins 70% of the time. The consumer metrics paint a different but equally unfavorable picture for OpenAI. ChatGPT's web traffic share dropped from 87% to 68% over twelve months. Its app market share fell from 69% to approximately 45%. Claude's US market share doubled in March alone, jumping from 2.5% to 5.6% in a single month. Google Gemini crossed 750 million monthly active users and 2 billion monthly visits.

Where the models actually stand

Gemini 3.1 Pro leads 13 of 16 major benchmarks. It scores 94.3% on GPQA Diamond compared to GPT-5.4's 87-89%, hits 77.1% on ARC-AGI-2 versus GPT-5.4's 73.3%, and does all of it at $2 per million input tokens, half what Claude Opus charges and roughly matching GPT-5.4's price. Claude Opus 4.6 tops SWE-bench Verified at 80.8%. GPT-5.4 takes SWE-bench Pro at 57.7%. The numbers are close enough that parity is arguable. But when you're the most expensive option and no longer the highest performing, parity amounts to a loss. Then there's the generation ahead. Claude Mythos Preview, still restricted from public access, scored 93.9% on SWE-bench Verified and 97.6% on USAMO 2026 math. Those results represent a full generational jump. OpenAI has announced nothing that competes with them.

The GPT-5 reception

GPT-5 didn't simply underwhelm. It generated active hostility from users. Developer forums and Reddit filled with complaints after launch. Slower responses, degraded reasoning, increased hallucinations compared to GPT-4o. The recurring assessment: "GPT-4o was sharp, focused, and reliable. GPT-5 feels unstable and inconsistent." GPT-5.2 compounded the problem. Users characterized it as "everything I hate about 5 and 5.1, but worse." Hallucination rates during certain periods were described as extremely high. GPT-5.2 Instant drew criticism for feeling "bland, refusing more, and hedging more," probably because OpenAI applied aggressive safety tuning as a reaction to the backlash rather than addressing underlying quality problems. The likely root causes include model transition complications and over-aggressive RLHF safety tuning. Multiple developers and researchers have concluded that OpenAI is also routing queries to smaller, cheaper models to control compute costs. The erratic quality variation between sessions, performing the same task twice and receiving dramatically different capability levels, aligns with that analysis. Users noticed the inconsistency.

The coding tools gap

Claude Code launched in beta in October 2025. Six months later, it leads the market in user satisfaction: 46% "most loved" rating, 91% customer satisfaction, NPS of 54. GitHub Copilot, which holds 29% market share (the largest), scores 9% on "most loved." Developers use Copilot because their GitHub subscription includes it. They use Claude Code because they choose to. Claude Code reached $2.5 billion in annualized revenue, one of the fastest product ramps in enterprise software history. More than half that revenue comes from enterprise customers. The JetBrains developer survey from April 2026 shows the adoption trajectory: 18% of developers worldwide use Claude Code at work (24% in the US and Canada), tied with Cursor and closing rapidly on Copilot's 29%. A year ago, Claude Code did not exist. OpenAI responded with a rebuilt Codex platform, launched in February 2026 as a standalone cloud coding agent. Usage grew from 5% of Claude Code's volume in late 2025 to about 40% by January 2026. Reuters reported that pressure from Claude Code directly caused OpenAI to redirect engineering resources toward the Codex relaunch. They're chasing a category Anthropic defined.

Google's pricing weapon

The Claude-versus-OpenAI narrative dominates discussion, but Google may represent the more fundamental threat to OpenAI's position. Google is making frontier-class AI free or nearly free. Gemini Code Assist became free for individual developers in March 2026. Gemini 3 Flash costs $0.50 per million input tokens, a fraction of what OpenAI or Anthropic charge for comparable performance. Over 70% of Google Cloud customers already use Gemini-powered tools. Google's advantage extends beyond pricing to distribution. Gemini runs on 1 to 5 billion devices across Search, Android, Workspace, Chrome, and Google Cloud. It has 8 million paid enterprise seats across 2,800 companies. When non-technical users encounter AI that isn't ChatGPT, it's almost always Gemini, embedded in products they already use daily. Andrew Ng, arguably the most influential AI educator in the world, told his Stanford CS230 class that Claude Code is his current favorite coding tool. He also observed that Gemini 3 "seems like another huge leap forward." He didn't reference OpenAI.

Internal dysfunction

Products can be repaired. Organizational culture is a harder problem. OpenAI has revised its mission statement six times in nine years. In February 2026, the word "safely" was quietly removed from "AI that safely benefits humanity." The change was buried in a tax filing. The company completed its for-profit conversion in October 2025, eliminating the clause "unconstrained by a need to generate financial return." Investors now hold board seats with direct profit-sharing arrangements. The majority of senior safety leadership has departed. The catastrophic risks lead stepped down less than nine months after the previous lead was reassigned without announcement. More than a dozen senior researchers left for Anthropic and Google DeepMind, several publishing public criticisms during their departures. Elon Musk's lawsuit goes to trial April 27 in Oakland. He seeks $135 billion in damages, wants Sam Altman and Greg Brockman removed, and wants OpenAI's nonprofit status restored. Microsoft, OpenAI's largest shareholder, is reportedly weighing its own legal action over alleged contract violations. One analysis summarized it plainly: OpenAI entered 2026 "confronting a pattern of broken commitments that had turned some of its closest allies into adversaries."

The financial reality

OpenAI projects $14 billion in losses for 2026, following $11.5 billion lost in 2025. Cumulative losses through 2028 are projected at $44 billion. The burn rate sits at 57% of revenue without declining. Anthropic's trajectory moves in the opposite direction. Their burn rate drops to one-third of revenue in 2026, projected at 9% by 2027. Anthropic is building a business that sustains itself. OpenAI is building one that requires continuous capital infusions to survive. OpenAI raised $122 billion at an $852 billion valuation. That figure looks impressive until you account for the need behind it. At a 57% burn rate on $25 billion in revenue, OpenAI spends $14 billion more than it earns. Every year. For the foreseeable future. The path to profitability runs through 2029, with $200 billion in annual revenue projected for 2030. That projection assumes OpenAI stops losing ground. Given the last twelve months, that's a substantial assumption.

The case for OpenAI, examined honestly

Dismissing a company with 800 million users and Microsoft's full financial support would be premature. OpenAI is being woven into Windows, Azure, Office 365, and the broader Microsoft developer ecosystem. That distribution channel is something Anthropic cannot match and even Google cannot easily replicate. The consumer user base also is a data flywheel, hundreds of millions of daily interactions informing OpenAI about what people want from AI. The problem is that none of this is converting into product superiority. Microsoft distribution places ChatGPT in front of users without making them prefer it. The data flywheel should yield better models, but GPT-5 was worse than GPT-4o by many users' accounts. That trajectory resembles IBM's path, not Apple's.

What recovery would require

OpenAI isn't finished. They have 800 million users, deep Microsoft integration, and sufficient capital to operate for years. But operating isn't the same as winning, and they are not currently winning. They need a model release that erases the memory of GPT-5. The kind of capability jump that Mythos represents for Anthropic. And they need to resolve what the company stands for, because "we used to prioritize safety but now we prioritize revenue" doesn't build enterprise trust. Anthropic has its own safety tensions, the Pentagon relationship, the loosened commitments, but at least the contradictions are visible. OpenAI buries its changes in tax filings. The window for course correction hasn't closed. But the numbers move in one direction, and they move fast. Anthropic's enterprise share shifted from 50/50 with OpenAI to 60/40 in roughly ten weeks. That rate of change is unprecedented in enterprise software. Claude is the tool developers prefer. Gemini is the tool everyone uses whether they realize it or not. ChatGPT is the tool everyone recognizes by name. That last advantage is genuine, but it's the kind that depreciates. Brand recognition without product superiority is a wasting asset.

Gloss What This Means For You

If you’re choosing an AI stack, treat this as a prompt to re-run evaluations rather than defaulting to OpenAI: benchmark Claude, Gemini, and GPT side-by-side on your real workloads, latency, and cost. If you’re buying for an enterprise, watch vendor reliability signals—model regressions, routing behavior, and policy shifts can matter as much as raw benchmark scores. And if you’re building products on top of these models, design for portability (multi-model routing, abstraction layers, and fallback options) so a single vendor’s turbulence doesn’t become your outage.