Sebastian Mallaby is not an AI hype blogger. He is a senior fellow at the Council on Foreign Relations, author of a seminal biography of Alan Greenspan, and one of the few economists who has spent serious time studying the venture capital industry from the inside. When he went on record saying OpenAI has a "50-50 chance of going bust by next summer," the question is not whether the claim is sensational. The question is why a sober economist said it — and what analysis sits underneath the coin-flip metaphor.
The 50-50 Bet
Mallaby's statement was precise: "I would say there's a 50-50 chance OpenAI would go bust in the next 12 months — not because the technology fails, but because the business model doesn't work fast enough." That qualifier is doing significant work. The technology is not in question. The structural economics are.
The mechanism is straightforward and brutal. OpenAI's compute costs scale with usage. More users, more queries, more tokens processed — all of it translates directly to infrastructure spend. Revenue is growing, but not at the rate that closes the gap between what it costs to run the models and what enterprises are willing to pay per token. That gap is not a temporary startup problem. It is a structural feature of a market where the product is rapidly becoming a commodity.
US AI investment has reached $285.9 billion — larger than any previous technology wave at this comparable stage of development. Mallaby's point is that the scale of investment means the scale of failure will also be historic. This is not a "some startups will fail" situation. This is a "one of the three largest technology companies on earth might not exist in 18 months" situation. The venture industry has never absorbed a loss at that magnitude from a single bet.
What saves OpenAI, in Mallaby's analysis, is not the technology. It is the relationships — the enterprise contracts locked in at multi-year terms, the consumer brand recognition that still carries weight, the political connections that make OpenAI a fixture in regulatory conversations in Washington. None of those are sufficient on their own. Together, they form the kind of institutional inertia that might keep a fundamentally challenged business alive long enough for the economics to shift. Might.
A-Plus Technology, C-Minus Business Model
Mallaby's full assessment lands harder in context: "You have a company with A-plus technology and a C-minus business model. That combination produces one of two outcomes: the business model catches up, or the technology becomes someone else's asset." The second outcome is the one that should concern the people who have put $285.9 billion into the space.
The specific structural problem is that OpenAI's revenue model runs on two channels — API usage and ChatGPT subscriptions — both of which are easily replicated and increasingly competed on price. The API market is collapsing toward commodity pricing as every major cloud provider bundles competing models into existing enterprise relationships. The consumer subscription market faces constant pressure from free tiers and alternative products.
OpenAI cannot charge a meaningful premium because its products are converging with the rest of the market. Every successive model release narrows the quality gap between OpenAI's frontier models and the alternatives. Six months of performance advantage is no longer enough to anchor enterprise pricing.
"A-plus technology with a C-minus business model produces one outcome: the technology becomes someone else's asset."
Sebastian Mallaby — Council on Foreign RelationsMeta's open-source strategy is particularly damaging to OpenAI's position. Llama is now good enough for the majority of enterprise use cases — summarization, classification, document processing, customer service automation — and costs essentially nothing to run on existing cloud infrastructure. OpenAI cannot price against free. The playbook that worked in 2023, when ChatGPT's capabilities were genuinely discontinuous from anything else available, no longer functions in a market where open-source alternatives are within one generation of parity.
The talent problem compounds the business problem in a way that is difficult to escape. Meta is running a systematic campaign to recruit OpenAI engineers, offering compensation packages that OpenAI cannot match without revenue it does not yet have. As Demis Hassabis reportedly described Meta's equivalent campaign targeting Google DeepMind: "Sebastian, they've parked their tanks on the lawn. This is war." OpenAI is fighting the same war on a weaker balance sheet.
Project Mario: The Safety Battle Hassabis Lost
Mallaby disclosed details of "Project Mario," a secret internal initiative at Google DeepMind named, apparently, after the video game character. The name was deliberately mundane — an attempt to keep the work off the radar of Google's product and timeline teams.
Project Mario was DeepMind's attempt to build safety constraints directly into model architecture rather than applying them after training as RLHF-based guardrails. The architectural approach was technically more robust — constraints baked into the model's core structure are harder to circumvent and more generalizable across deployment contexts than inference-layer filters. The problem was timelines. The architectural approach added approximately 18 months to DeepMind's development cycle, and Google leadership did not believe the competitive landscape would pause for it.
Hassabis fought for Project Mario. He lost. DeepMind shipped Gemini with the same inference-layer safety approach used by every other major lab — because that was the approach that could ship on a competitive timeline. The architectural work was shelved.
Mallaby's reading of Project Mario is that it represents the structural problem with safety research at every major lab. Safety research does not compete with deployment timelines on equal footing. Deployment timelines are set by competitive pressure. Safety readiness is not a market signal. "Hassabis lost that battle," Mallaby observed, "because the war didn't pause for it." That is the sentence that should be quoted in every AI governance discussion happening right now.
The Motivation Map
Mallaby's most original analytical contribution is what he calls a "motivation map" of the people who might actually change the trajectory of AI development. The map is uncomfortable because it reorders the conventional narrative about who drives change and how.
He identified four categories of actors:
Mallaby's thesis is that meaningful policy change requires aligning categories one and two. Category three follows once they have sufficient cover. Category four is the greatest ongoing risk to any safety initiative because it operates silently and continuously, redistributing talent away from the work that most needs doing.
Most current AI governance discussions are focused on category three — what regulators should do. Mallaby argues this is exactly backwards. Regulators cannot move without commercial cover. Commercial actors will not move without credible pressure from researchers who have standing to make the technical case. The sequence matters.
Chernobyl for AI
Geoffrey Hinton, who left Google specifically to speak without institutional constraint, has described the possibility of a "Chernobyl for AI" — a single catastrophic failure that triggers massive regulatory overreaction. Mallaby's interpretation of Hinton's position is precise: Hinton does not believe the Chernobyl scenario is likely. He believes it is possible. And his point is that the industry is not building the institutions that would either prevent it or contain its consequences.
What a Chernobyl-equivalent looks like for AI is not a robot uprising — that framing is both inaccurate and counterproductive. The realistic scenario is a large-scale model deployment that causes measurable, attributable real-world harm at a scale that cannot be plausibly attributed to human error. Financial system disruption from a model deployed in trading infrastructure. A medical diagnosis system that introduces systematic errors across a hospital network. Critical infrastructure failure that traces back to an automated decision system. The harm is mundane in its mechanism and catastrophic in its scale.
The Chernobyl analogy is precise in one specific way: the nuclear industry had safety institutions — international coordination bodies, mandatory incident reporting, engineering standards, liability frameworks — before the major accidents. Those institutions were imperfect and did not prevent Chernobyl. But they contained the aftermath and drove the regulatory response. The AI industry is building products at nuclear scale and has nothing equivalent. As Mallaby put it: "The question isn't whether something will go wrong. Something always goes wrong at scale. The question is whether the institutions exist to absorb it."
Three Policy Fixes Nobody Is Building
Mallaby's policy prescriptions are unusually specific for an economist who works primarily in analysis rather than advocacy. That specificity is worth taking seriously.
First, compute auditing: mandatory disclosure of training compute by any model that exceeds a defined threshold. This creates a verifiable paper trail for capability claims. When a lab says its model is equivalent in capability to a previous generation, compute disclosure makes that claim checkable. Currently no such requirement exists in any jurisdiction. The absence is not an oversight — it is a feature preferred by labs that benefit from opacity around their actual capability trajectories.
Second, product liability for AI deployed in high-stakes contexts. Models used in medical diagnosis, legal analysis, or financial decision-making should carry product liability equivalent to physical products in the same domains. Software currently operates behind near-total liability shields. Removing those shields in specific high-risk domains creates a direct commercial incentive for safety investment — not because labs become more ethical, but because the cost-benefit calculation on safety testing changes fundamentally.
Third, international coordination on frontier model development — not a moratorium, not a ban, but a framework analogous to the Nuclear Non-Proliferation Treaty that creates mutual transparency and slows race dynamics without halting development. The goal is not to prevent AI development. The goal is to ensure that no major power is in a race condition so extreme that safety decision time is eliminated entirely.
None of these three proposals are on any legislative agenda in any major AI-developing country. Not in the United States, not in the European Union beyond what the AI Act addresses, not in the United Kingdom, not in China. The gap between the sophistication of the analysis and the emptiness of the policy pipeline is the most alarming data point in Mallaby's entire account.
| Policy Fix | Current Status | Who's Blocking It | What Would Change |
|---|---|---|---|
| Compute auditing | Not proposed | Labs (competitive disclosure) | Training transparency |
| Product liability for AI | Not proposed | Tech lobby | Commercial safety incentives |
| International coordination | Informal talks | Geopolitical competition | Race dynamic reduction |
| Open-source safety standards | Voluntary only | No enforcement mechanism | Baseline floor |
Mallaby's conclusion does not leave room for optimism about the current trajectory: "We have built the most consequential technology since nuclear power and we have responded with blog posts about responsible AI. That gap will close — either through policy or through incident." The coin-flip he assigned to OpenAI's survival is, in this reading, a microcosm of a much larger bet the industry is making about whether governance can arrive before the costs of its absence do.
Related Reading
On the demand economics underlying AI investment: The AI Bubble's Missing Demand Problem. On how enterprise adoption is reshaping AI economics: The CEO Sycophancy Trap. On what consumer AI looks like once governance catches up: The Anticipation Gap.