Skip to content

The Infographics Show AI Business

The $15,000 AI Bill. Your $20 Subscription is a DELUSION

The Infographics Show takes one uncomfortable number and builds an entire economic argument on top of it. A serious Claude Code power user chews through roughly 10 billion tokens a year, and at standard API rates that workload costs about $15,000. The same person on a flat rate Max subscription pays around $1,200. The missing $13,800 is a 92% subsidy, and it is not magic, it is venture capital deliberately eating losses to get a generation hooked before the price tags change.

Published Jun 9, 2026 18:11 video 24 min read Added Jun 14, 2026 Open on YouTube →

At a glance

The Infographics Show takes one uncomfortable number and builds an entire economic argument on top of it. A serious Claude Code power user chews through roughly 10 billion tokens a year, and at standard API rates that workload costs about $15,000. The same person on a flat rate Max subscription pays around $1,200. The missing $13,800 is a 92% subsidy, and it is not magic, it is venture capital deliberately eating losses to get a generation hooked before the price tags change. The video calls this the AI Uber moment, walks the same playbook from ride sharing into AI, and then shows why the math is actually worse this time: agentic workflows burn 5 to 30 times more tokens than old chatbots, the hardware rots in 18 months, the revenue is partly an accounting illusion, and the bill simply has not arrived yet.

This is an explainer that is really a warning. It argues, number by number, that cheap AI is a temporary promotional rate, that the correction is already penciled in, and that when it lands it will hit freelancers and small businesses first and hardest. Below is the full case, chapter by chapter, with every figure the narrator puts on the table and nothing softened.

The full walkthrough

The hook: a trillion dollar house of cards

The video opens by calling your bluff. You think the $20 AI subscription is the deal of the century. The narrator says it is a trap. A power user on a tool like Claude Code actually costs $15,000 a year to run, and you only pay a sliver of that because venture capitalists are footing the difference. You are living inside the AI Uber moment, a temporary illusion built to get you hooked before the prices reset. The money is running out. When this trillion dollar house of cards collapses, the tools you rely on every day will either vanish or cost you ten times more. The thesis, stated flat: the economics of AI are broken.

Chapter 1: the $20 illusion

It starts with your wallet. A serious Claude Code user runs through roughly 10 billion tokens a year. Tokens are the thought units of AI: every word it reads, every word it writes, every decision it makes spends a token. Run that 10 billion through a standard API and you would pay around $15,000 a year. That is the real unsubsidized price, no discounts, no incentives, just the raw compute cost. That same user on a flat rate Max subscription pays around $1,200 for the whole year for the identical workload. From $15,000 down to $1,200. A 92% hidden subsidy.

The narrator makes the absurdity concrete: imagine walking into a dealership, picking a car priced at $15,000, and being told you only owe $1,200 because someone, somewhere, covered the rest. It makes no sense, and the strangeness is the point. The explanation sits in OpenAI's own financial projections, leaked to The Information: the company is on track to lose $14 billion in 2026. Not revenue. Losses. A $22 monthly subscription covers about 1.7% of what an active power user actually costs to serve.

Which leads to the line the whole video pivots on: you are not a customer, you are bait. Every prompt, every line of generated code, every late night chat session is paid for by investors who are betting nobody will be able to live without the product once the real bill lands. Whole industries are being signed up at a loss. Law firms run document review at five cents on the dollar. Marketing agencies turn out campaigns at prices that would have been impossible eighteen months ago. Hospitals trial diagnostic tools at sticker prices no model provider could sustain at scale. Every deal is propped up by patient capital that expects ten times its money back. So why race to sign up more users if you lose money on each one? Because we have seen this exact playbook before, and we know how it ends.

Figure 1. The whole video in one bar. A power user pays $1,200 a year for a workload that costs about $15,000 a year to serve. The empty space above the $1,200 bar is the 92% gap that venture capital is quietly covering. A $22 monthly plan funds only 1.7% of an active power user.

Chapter 2: the ghost of Uber

Back in 2014, a black SUV pulled up outside your apartment in three minutes. The driver was polite, the car spotless, the trip to the airport cost eleven bucks. You wondered how it added up. It did not, and that was the point. It was never meant to. For the better part of a decade an entire generation lived inside what economists later named the Millennial Lifestyle Subsidy. Venture capitalists poured money into ride sharing, food delivery, co-working spaces, and meal kits on purpose, setting prices below cost to crush legacy competitors and build a habit. The plan was always the same: take over first, then raise prices until it turns a profit.

Uber's take rate, the slice of every fare the company keeps, tells the story. In 2022 Uber kept about 32 cents of every dollar a rider paid. By 2024 that had climbed to roughly 42 cents. Drivers got a smaller share, riders paid more, and the company eventually posted a profit. Now it is happening in AI, with the same investors, the same playbook, and the same pricing memo. Industry analysts expect consumer subscription tiers to roughly double in price over the next two years. Anthropic has rolled out new rate limits that gently push power users toward higher priced plans. Google is testing premium only Gemini features that used to be free. A 100% price hike is not a rumor, the narrator says, it is already penciled in on the calendar. Enterprise contracts follow the same curve: custom deals signed in 2024 are quoted much higher in 2026 renewals. Same product, multiple times the price, take it or leave it.

Then the video draws the line that makes AI worse than ride sharing. Ride sharing had to do one thing, move a car from A to B, and that cost does not explode as usage rises. If anything it gets more efficient with more drivers, more density, and better routing. AI works differently. The underlying math of thinking does not get cheaper the same way, it gets complicated fast. The narrator concedes the executives' favorite line is technically true: compute does get cheaper every year, chips are more efficient, models are leaner, and each individual word an AI generates is genuinely cheaper to produce than eighteen months ago. That is the part they want you to hear. The next chapter is the part they do not.

Chapter 3: the Claude Code math

Here is the trap inside the half truth. Modern agentic workflows, the kind that power Claude Code and ChatGPT's deep research tools, burn anything from 5 to 30 times more tokens than a simple chat session from two years ago. When you ask a code assistant to fix this bug, it does not write fifty words of response. It quietly spawns subtasks, rereads your files, checks its own work, writes draft after draft, throws most of them away, and runs tests in the background. A single user request can chew through hundreds of thousands of tokens before any answer appears on screen.

So the per word price falls while the words per request explode, and the total bill shoots upward anyway. The narrator names it: the token tax. It bankrupts scrappy AI startups burning through their seed rounds, and it threatens to wipe out one of the most profitable business models in the history of the internet. This is the engine fault under the whole industry. Cheaper tokens times vastly more tokens equals a bigger invoice, and the cheaper part is the only part anyone advertises.

Figure 2. Why agentic AI breaks the old subscription math. A single "fix this bug" does not return fifty words, it fans out into subtasks, file rereads, self checks, repeated drafts, and background tests. Each step spends tokens, and the request burns hundreds of thousands of them, 5 to 30 times a chatbot from two years ago. The per word price can fall and the total bill still climbs.

Chapter 4: the search penalty

For 25 years Google printed money, and the mechanism was brutally simple. A user types a query, Google returns ten blue links pulled from the open web, and the total cost in servers, electricity, and indexing is a fraction of a cent per search. The ads next to the results earn far more than that. Margin, the narrator calls it, one of the great financial miracles of modern times. Now Google is rebuilding the entire system on top of generative AI. A single AI powered search response, the kind that writes a paragraph long answer instead of showing links, costs significantly more to produce than a traditional keyword search. Multiply that across billions of queries a day. If Google fully replaces traditional search with AI overviews, the most reliable profit machine of the 21st century vanishes, and the margins that funded YouTube, Android, Waymo, and Gmail begin to dry up. Wall Street analysts have quietly mapped the worst case, and the numbers are catastrophic.

It gets worse, because the advertising model goes redundant too. When AI just hands you the answer, nobody clicks the links, so advertisers stop paying. Google faces a future where it serves more queries than ever, costs more to run than ever, and earns less revenue per query than at any point in its modern history. So why do it at all? Because tech giants are willingly cannibalizing their most profitable businesses on purpose, having decided that the only thing more dangerous than killing a cash cow is letting a competitor kill it first. Business school has a name for this: the innovator's dilemma. When a new technology threatens the core business, an incumbent can sit still and defend the existing cash engine while a rival builds the future, or cannibalize itself on its own terms and hope to grow revenue on the next platform before the old one erodes. Google, Microsoft, and Meta are all betting on the second path, betting that AI eventually replaces the current money makers. Nobody can prove it is true, and everybody is in too deep to back out. Which raises the obvious question: if the unit economics are this bad, how are these companies posting record AI revenues every quarter?

Chapter 5: the round trip scam

This is where it gets clever. Microsoft commits very publicly to investing $13 billion in OpenAI. The press release is slick, the headlines dramatic, stock prices rise, investors are happy. Read the fine print and a different story appears. A big chunk of that investment never hits OpenAI's bank account. It arrives as Azure cloud credits, essentially a gift card redeemable only at Microsoft's own data centers. OpenAI records the sum as capital raised. Microsoft logs the cloud usage as revenue. It is an investment and a sale at the same time. OpenAI has separately committed to spending up to $250 billion on Azure services, locking the loop in for years.

Now layer Nvidia on top. Nvidia announces tens of billions in commitments to OpenAI. OpenAI turns around and uses that capital to buy Nvidia GPUs. Nvidia posts a record quarter and its stock soars. The whole cycle takes a few months, almost no real money changes hands, and the same dollar simply gets a new name at each stop. Add Oracle, CoreWeave, and AMD to the list, each one investing and then selling services to the next, each one booking revenue as the dollar circulates. The technical name for this is round tripping. In Silicon Valley, the narrator notes, it is called strategic partnership.

Figure 3. The round trip. One dollar travels Microsoft to OpenAI as an "investment" (largely Azure credits), OpenAI to Nvidia to buy GPUs, and on through Oracle, CoreWeave, and AMD. Every stop records it as either capital raised or revenue earned, so the same money powers headline numbers at six companies while barely moving. Silicon Valley's term for it is strategic partnership.

Chapter 6: the hardware debt trap

The spending is the part that cannot be papered over. In 2025 big tech is projected to spend roughly $320 to $400 billion on AI infrastructure, and updated forecasts for 2026 push the figure toward $500 billion: data centers, GPUs, cooling systems, power delivery, entire grids reinforced to handle the load. Meanwhile total global consumer spending on AI services is only about $12 billion, according to the Menlo Ventures State of Consumer AI report. Hundreds of billions flow out while $12 billion comes in. The gap is the size of a midsized country's entire economy, and it is filled not with revenue but with debt: corporate bonds, structured credit, and private lending. Meta alone raised $30 billion in bond markets in late 2025, plus another roughly $30 billion through a Morgan Stanley arranged joint venture built to keep the liabilities off Meta's public balance sheet. Microsoft signed a 20-year power purchase agreement to restart Three Mile Island. Google partnered with NextEra Energy to reopen nuclear power plants. Those promises do not go away if AI revenue underperforms.

And the hardware itself does not last. A high-end Nvidia GPU, the kind powering most of this boom, has a useful life of just one to three years before the next generation makes it outdated, and it loses most of its book value the moment a new generation ships, which now happens roughly every eighteen months. A data center full of three-year-old chips is, in industry terms, dead weight. The narrator contrasts this with the original dot com bust. When that bubble popped in 2000, telecom companies left behind millions of miles of fiber optic cable buried in the ground. New companies bought it for pennies on the dollar and built YouTube, Netflix, and Spotify on top of it. The crash was brutal, but the wreckage was useful. This AI bubble will instead leave behind warehouses of useless silicon, locked into 20-year power contracts, and concrete shells in the middle of nowhere that nobody will know what to do with. Utilities will pass higher electricity rates to households for decades whether or not the AI revenue ever shows up. A gap of hundreds of billions cannot be papered over for long, and the companies running the race know it, so they are quietly slowing the bleeding before the public catches on. Most users have already felt it. They just have not connected the dots.

Figure 4. The structural hole. Big tech is pouring $320 to $400 billion into AI infrastructure in 2025, with 2026 forecasts nearing $500 billion, while global consumers spend only about $12 billion a year on AI. That sliver at the bottom right is the entire revenue base. The rest is filled with bonds, structured credit, and 20-year power contracts, against hardware that goes obsolete in 18 months.

Chapter 7: the stealth nerf

The slowing of the bleeding has a texture, and you have probably noticed it. An AI model that used to one shot your code now forgets your project halfway through. A chatbot that used to write five paragraphs cuts off at three. An image generator that rendered a flawless portrait in thirty seconds now spits out something with seven fingers and asks you to upgrade to the next tier. Nobody is imagining it, the narrator insists. The product is getting worse. When the numbers stop working, the easiest lever a provider can pull is to quietly water the service down.

The signs are easy to spot once you look. Message caps that used to refresh every five hours suddenly refresh every eight. The default model in an app gets quietly swapped from a flagship to a smaller, cheaper version. Memory features get rolled back. Advanced reasoning gets locked behind a higher price tier. A god model promised in launch keynotes is quietly swapped for a cheaper, less intelligent one. Reddit threads about AI tools fill with users swearing their assistant has gotten lazier, and engineers post side by side screenshots of the same product producing visibly worse output than six months earlier. Companies almost always deny it, and sometimes they release selected benchmarks: clean prompts, controlled conditions, optimized scenarios designed to show performance at its best. It buys time. It does not fix the bigger problem, which has already started taking out the first wave of an entire ecosystem.

Chapter 8: the 2026 mass extinction

Roughly 40% of AI startups launched in 2024 have already shut down or been acqui-hired by bigger players, according to CB Insights data. Acqui-hire is the polite term for a fire sale, a struggling company sold for cents on the dollar to a rival who is not really buying a business at all. The buyer takes the engineers, shuts down the product, and absorbs whatever talent it can. These were not garage hobby projects. They closed series A rounds with serious investors, they had revenue, they had paying customers, they had glowing TechCrunch profiles. Then within eighteen months the lights went off.

The reason is almost always the same: cost of goods sold. The money these startups pay to model providers like OpenAI, Anthropic, and Google is so high it wipes out any margin they could charge. A startup that wraps a polished interface around GPT-4 might charge $50 a month while the same customer's API usage costs the startup $80. Every active user is negative revenue, so the more successful the marketing, the faster the company bleeds out. And then there is the cliff above them: when a foundation model provider ships a new feature, it often kills ten startups overnight. ChatGPT launches native voice mode, and a half dozen voice agent startups that closed series A rounds last quarter say goodbye. Claude releases native PDF reading, and a whole crop of document tools becomes useless in a single product update. An ecosystem of independent AI companies is collapsing under compute costs nobody can profitably absorb. And when those startups die, the cloud providers lose the round trip revenue that made the foundation model investments look like good business in the first place. That is when the final phase begins.

The startup wrapper	Per customer, per month
Price charged to the customer	$50
API cost that customer generates	$80
Result on every active user	negative $30
Effect of growth	more marketing, faster bleed out
Effect of a new foundation feature	whole category killed overnight
2024 cohort already gone	roughly 40%

Figure 5. Why the wrappers die. A startup that resells a foundation model charges $50 and pays $80 in API costs, so every customer is a $30 loss and success only accelerates the failure. Add the risk that a provider ships your feature for free and erases your category, and about 40% of the 2024 cohort is already shut down or sold for parts.

Chapter 9: the great AI rug pull

Venture capital firms are no longer willing to cover losses on a promise of future glory. They want a path to profit in writing, with quarterly milestones, and they want it now. For foundation model companies that means one of two things. The first is a brutal, sudden repricing. A $20 consumer plan becomes a $100 plan, or it quietly disappears and is replaced by a pro tier that costs ten times more for the same features. A Claude Code user who paid $1,200 a year suddenly faces an invoice closer to the $15,000 the API actually costs. A freelance designer relying on a $10 image generation subscription gets an email explaining the plan is being moved to a new structure. Small businesses that built workflows on cheap AI face a choice: pay ten times more, or go back to doing it the old way.

The second option is worse: the services simply get shut down. We have already seen the first signs. Smaller AI companies have folded with thirty days notice, leaving customers scrambling to move years of work to whatever competitor is still standing. Specialized models for legal research, medical imaging, and customer support have been pulled because their economics never worked. An era of cheap AI ends with a thousand small invoices and a thousand small shutdown notices.

The deeper truth, the narrator says, is uglier than any single price hike. AI in 2026 is on track to become a luxury, not a basic product. The cheap versions trained an entire generation to need it, and the expensive version is the only one the balance sheets now allow to exist. Big companies that can afford the new pricing tier lock in their advantage. Freelancers, small businesses, and the early adopters who created the buzz get priced out first. An economy built on the idea of cheap intelligence is about to slam into the reality of expensive intelligence. Productivity assumptions made in 2024 will not survive in 2027. The promised AI revolution will arrive, just not for everyone, and not at the price they were sold.

And the timing may be vicious. History says crashes do not take a year to play out. The dot com bust took two years from peak to trough, but the AI bubble has more leverage, more concentration, and more debt baked into its foundations. When it tips, it can move in months, maybe weeks. When the margins shrink, when the first big enterprise customer publicly walks away from a renewal, the confidence can vanish overnight. The tools millions rely on every day were never as cheap as anyone thought. They were held up by investor money that is finally starting to dry up. An AI age might still be coming, but the cheap AI age, the one that fooled a generation into rebuilding their working lives on top of it, is already over. The bill simply has not arrived yet. And when it does, the narrator closes, that price will never feel real again. The video signs off by handing you to its sequel: what happens to the economy if the $2 trillion AI bubble bursts.

Key takeaways

The unsubsidized cost of a Claude Code power user is about $15,000 a year for roughly 10 billion tokens, against the $1,200 a Max plan charges. That is a 92% hidden subsidy, and a $22 monthly plan covers only 1.7% of an active power user.
The subsidy is funded by losses. OpenAI is projected to lose $14 billion in 2026. The strategy is the Uber playbook: price below cost, build a habit, then raise prices. Analysts expect consumer plans to double within two years.
AI breaks the ride sharing analogy because agentic workflows burn 5 to 30 times more tokens than old chat sessions. Cheaper per word times far more words equals a rising bill. This is the token tax.
Generative search threatens Google's core margin: AI answers cost more per query and kill the ad clicks, so Google may serve more queries, pay more to run them, and earn less per query than ever.
The headline revenues are partly an accounting loop. Round tripping moves the same dollar through Microsoft, OpenAI, Nvidia, Oracle, CoreWeave, and AMD, booked as both investment and revenue at each stop.
The spending dwarfs the income: $320 to $400 billion in 2025 infrastructure (toward $500 billion in 2026) against about $12 billion in global consumer revenue, the gap filled with debt and 20-year power contracts against GPUs that go obsolete in 18 months.
The early warning is stealth degradation: longer message cap refreshes, flagship models swapped for cheaper ones, reasoning locked behind higher tiers.
About 40% of 2024 AI startups are already dead or acqui-hired because a wrapper charging $50 can pay $80 in API costs, and a single new foundation feature can erase a whole category overnight.
The endgame is repricing or shutdown. AI becomes a luxury, the early adopters get priced out first, and because of the leverage the correction could play out in months, not the two years the dot com bust took.

Chapters

Timestamps are clickable. Click one and the player jumps there and keeps playing while you read.

0:00 The Trillion Dollar House of Cards
0:36 Chapter 1: The $20 Illusion
2:38 Chapter 2: The Ghost Of Uber
5:03 Chapter 3: The Claude Code Math
5:56 Chapter 4: The Search Penalty
8:08 Chapter 5: The Round Trip Scam
9:35 Chapter 6: The Hardware Debt Trap
11:58 Chapter 7: The Stealth Nerf
13:21 Chapter 8: The 2026 Mass Extinction
15:02 Chapter 9: The Great AI Rug Pull

Notable quotes

You are not a customer. You are bait. narrator, 1:35

A $22 monthly subscription covers about 1.7% of what an active power user actually costs to serve. narrator, 1:28

Every single deal is being propped up by patient capital that expects 10 times returns. narrator, 2:11

It was never meant to. For the better part of a decade, an entire generation lived inside what economists later called the Millennial Lifestyle Subsidy. narrator, 3:00

A 100% price hike isn't a rumor. It's already penciled in on the calendar. narrator, 4:50

Each individual word an AI generates is genuinely cheaper to produce than 18 months ago. And that's the part they want people to hear. narrator, 5:18

A single user request can chew through hundreds of thousands of tokens before any answer shows up. narrator, 5:30

The total bill is shooting upward. It's known as the token tax. narrator, 5:38

Tech giants are willingly cannibalizing their most profitable businesses on purpose. narrator, 6:23

In Silicon Valley, it's called strategic partnership. narrator, 9:24

A data center full of three-year-old chips is in industry terms dead weight. narrator, 11:23

The crash was brutal, but the wreckage was useful. narrator, 11:31

Every active user is negative revenue. The more successful marketing, the faster a company bleeds out. narrator, 14:36

An economy built on the idea of cheap intelligence is about to slam into the reality of expensive intelligence. narrator, 16:40

The bill simply hasn't arrived yet. And when it does, that price will never feel real again. narrator, 17:45

Resources mentioned

The Infographics Show, the channel that produced this explainer, narrated by Josh Risser. Full source list for the episode on Pastebin.
Claude Code and the Claude models from Anthropic, the primary case study for token consumption and the $15,000 figure.
ChatGPT and its deep research tools from OpenAI, cited for high token agentic workflows, and the leaked $14 billion 2026 loss projection.
The Information, the publication that received OpenAI's leaked financial projections.
Uber, the take rate case study (32 cents to 42 cents per dollar, 2022 to 2024) and the Millennial Lifestyle Subsidy.
Google and Gemini, the search penalty and premium feature testing, with YouTube, Android, Waymo, and Gmail named as ad funded products at risk.
Microsoft and Azure, the $13 billion investment and $250 billion commitment at the center of the round trip.
Nvidia, Oracle, CoreWeave, and AMD, the other nodes in the round tripping cycle.
Meta and Morgan Stanley, the $30 billion bond raise and the $30 billion off balance sheet joint venture.
Three Mile Island (Microsoft's 20-year power purchase agreement) and NextEra Energy (Google's nuclear restart partner).
Menlo Ventures State of Consumer AI report, source for the $12 billion global consumer spending figure.
CB Insights, source for the 40% AI startup shutdown rate, and TechCrunch, the profiles those startups once earned.
The dot com comparison: fiber optic infrastructure that later powered Netflix, Spotify, and YouTube, versus AI's stranded silicon.

The one thing to walk away with

The cheap AI you use every day is a promotional rate, not a price. The number that matters is the gap between what you pay and what your usage actually costs to serve, and right now venture capital is quietly closing that gap on your behalf. The video's whole argument is that this cannot last: the spending dwarfs the revenue, the hardware rots faster than it can earn back, the headline numbers are partly an accounting loop, and the patient capital is losing patience. When the subsidy ends, the bill does not get smaller, it just gets handed to you. The smart move is to know your real unsubsidized cost before the invoice arrives, because once it does, the narrator warns, the old price will never feel real again.

Full transcript

You think your $20 AI subscription is the deal of the century. In reality, it's a trap. A power user on tools like Claude Code actually costs $15,000 a year to run. But you're only paying a fraction of that because venture capitalists are footing the bill. You're living inside the AI Uber moment, a temporary illusion built to get you hooked before the price tags change. But the money is running out. When this trillion dollar house of cards collapses, the tools you rely on every day will either vanish or cost you 10 times more. The economics of AI are broken. Chapter 1, the $20 illusion. It all starts with your wallet. A serious Claude Code user runs through roughly 10 billion tokens a year. Tokens are basically the thought units of AI. Every word it reads, every word it writes, every decision it makes relies on a token. If you paid for that usage through a standard API, those 10 billion tokens would cost you around $15,000 a year. That is the real unsubsidized price. No discounts, no incentives, just the raw compute costs. Now, that same user on a flat rate max subscription pays around $1,200 for an entire year for the same workload. From 15,000 down to 1,200. A 92% hidden subsidy. Imagine walking into a dealership, picking out a car priced at $15,000, and being told that you only owe $1,200 because someone somewhere else covered the rest. It doesn't make sense, and that's what makes this model so strange. But the answer lies in OpenAI's own financial projections leaked to The Information. The company is on track to lose $14 billion in 2026. Not revenue, losses. A $22 monthly subscription covers about 1.7% of what an active power user actually costs to serve. You are not a customer. You are bait. Every prompt typed, every line of code generated, every late night chat session is being paid for by investors and they are betting that nobody will be able to live without this product when the real bill finally lands. Whole industries are being signed up at a loss. Law firms running document review at 5 cents on the dollar. Marketing agencies are turning out campaigns at prices that would have been impossible 18 months ago. Hospitals trying diagnostic tools at sticker prices that no model provider could actually sustain at scale. Every single deal is being propped up by patient capital that expects 10 times returns. If companies are losing money on every user they sign up, why are they racing to sign up more? Because we have seen this exact playbook before and we know how it ends. Chapter 2. The ghost of Uber. Back in 2014, a black SUV would pull up outside your apartment in 3 minutes. The driver was polite, the car spotless. The trip to the airport cost you 11 bucks. You would wonder how any of it added up. It didn't. And that was the point. It was never meant to. For the better part of a decade, an entire generation lived inside what economists later called the Millennial Lifestyle Subsidy. Venture capitalists poured money into ride sharing, food delivery, co-working spaces, and meal kits on purpose. They set the prices below cost to crush legacy competitors and build a habit. The plan was to take over first and then raise prices until it made a profit. Uber's take rate, the slice of every fare a company keeps, tells the story. In 2022, Uber kept around 32 cents of every dollar a rider paid. By 2024, that figure had climbed to roughly 42 cents. Drivers got a smaller share. Riders paid more. The company eventually posted a profit. Now it's happening in the AI sector. It's the same investors, the same playbook, and the same pricing memo. Industry analysts expect consumer subscription tiers to roughly double in price over the next 2 years. Anthropic has rolled out new rate limits that gently push power users toward higher priced plans. Google is testing premium only Gemini features that used to be free. A 100% price hike isn't a rumor. It's already penciled in on the calendar. Enterprise contracts are following the same curve. Custom deals signed in 2024 are being quoted much higher in 2026 renewals. It's the same product. It's just costing multiple times the price. Users need to take it or leave it. Ride sharing only had to do one thing. Move a car from point A to point B. The cost of doing that doesn't explode as usage rises. If anything, it gets more efficient. More drivers, more density, better routing. AI works differently. The underlying math of thinking doesn't get cheaper in the same way. It gets complicated fast. AI executives continue to say that compute is getting cheaper every year. The unit economics will work out over time. It's not exactly a lie. It's more like a half truth. The price of running a query through a model has dropped year over year. Chips are more efficient. Models are leaner. Each individual word an AI generates is genuinely cheaper to produce than 18 months ago. And that's the part they want people to hear. Here's the part they don't. Chapter 3, the Claude Code math. Modern agentic workflows, the kind that power Claude Code and ChatGPT's deep research tools, burn through anything from 5 to 30 times more tokens than simple chat sessions of 2 years ago. When you ask a code assistant to fix this bug, it doesn't write 50 words of response. It quietly spawns subtasks. Then it rereads your files. It checks its own work. It writes draft after draft. Throws most of them away, and then quietly runs tests in the background. A single user request can chew through hundreds of thousands of tokens before any answer shows up. A model might be slightly cheaper per word than before, but it's also producing far more words per request. The total bill is shooting upward. It's known as the token tax. It bankrupts scrappy AI startups burning through their seed rounds. It's threatening to wipe out one of the most profitable business models in the history of the internet. Chapter 4, the search penalty. For 25 years, Google's printed money, and it's been brutally simple. A user types in a query, Google returns 10 blue links pulled from the open web. The total cost to Google, servers, electricity, indexing, is a fraction of a cent per search. And yet, the ads next to those results generate much more than that. Margin is one of those great financial miracles of modern times. Now, Google is rebuilding that entire system on top of generative AI. A single AI powered search response, the kind that writes a paragraph long answer instead of just showing you some links, costs significantly more to produce than a traditional keyword search. Now multiply that across billions of queries a day. If Google fully replaces traditional search with AI overviews, the most reliable profit machine of the 21st century vanishes. The margins that have funded YouTube, Android, Waymo, and Gmail begin to dry up. Wall Street analysts have quietly mapped out the worst case scenarios. And the numbers are catastrophic. And it gets worse. The advertising models become redundant, too. When AI just gives you an answer, nobody clicks on the links, so advertisers will stop paying. Google is staring at a future where it serves up more queries than ever before, costs more to run than ever before, and earns less revenue per query than at any point in its modern history. Tech giants are willingly cannibalizing their most profitable businesses on purpose. They've decided the only thing more dangerous than killing a cash cow is letting a competitor kill it first. Business school has a name for this, the innovator's dilemma. When a new technology threatens the core business, incumbents face two choices. Sit still and defend the existing cash engine while a competitor builds the future, or cannibalize it themselves on their own terms, hoping that they can build revenue on the next platform before the old one erodes. That's the path companies like Google, Microsoft, and Meta are effectively betting on with AI. They're betting that AI will eventually replace the current money makers. Nobody can prove that's true. Everybody is in too deep to back out. If unit economics are this bad, how are these same companies posting record AI revenues on Wall Street every single quarter? Chapter 5, the round trip scam. That's where things get clever. Microsoft commits very publicly to investing $13 billion into OpenAI. The press release is slick, the headlines dramatic, stock prices rise. It makes investors happy. But read the fine print and a different story shows up. A big chunk of that investment never actually hits OpenAI's bank account. It arrives in the form of Azure cloud credits. It's essentially a gift card that can only be redeemed at Microsoft's own data centers. OpenAI records that sum on its balance sheet as capital raised. Microsoft logs the cloud usage as revenue. It's an investment and a sale at the same time. OpenAI has separately committed to spending up to $250 billion on Azure services, locking the loop in for years to come. Now layer Nvidia on top of that. Nvidia announces tens of billions in commitments to OpenAI. OpenAI then turns around and uses that capital to buy Nvidia GPUs. Nvidia's quarterly revenue posts a record and their stock price soars. The whole cycle takes a few months and almost no real money has actually changed hands. It has simply been given a different name at each stop. Add Oracle, CoreWeave, and AMD to the list. Each company invests and then sells services to the next and records revenue as the same dollar flows through the cycle. The technical name for this is round tripping. In Silicon Valley, it's called strategic partnership. Chapter 6, the hardware debt trap. In 2025, big tech is projected to spend roughly 320 to 400 billion on AI infrastructure. Updated forecasts for 2026 push that figure toward 500 billion. Data centers, GPUs, cooling systems, power delivery, entire grids are being reinforced to handle it. Meanwhile, total global consumer spending on AI services is only about 12 billion. According to Menlo Ventures State of Consumer AI report, hundreds of billions are flowing out while only 12 billion going in. The gap is the size of an entire midsized country's economy. It's being filled not with revenue, but debt, corporate bonds, structured credit, and private lending. Meta alone raised $30 billion in bond markets in late 2025. There was another roughly $30 billion through a Morgan Stanley arranged joint venture set up to keep liabilities off of Meta's public balance sheet. Microsoft has signed a 20-year power purchase agreement to restart Three Mile Island. Google has partnered with NextEra Energy to reopen nuclear power plants. These promises don't go away if AI revenue underperforms, but the hardware itself doesn't last. A high-end Nvidia GPU that powers most of this boom has a short life of just 1 to 3 years before the next generation makes them outdated. It loses most of its book value the moment a new generation hits a market, which now happens roughly every 18 months. A data center full of three-year-old chips is in industry terms dead weight. Compare that to the original dot com bust. When that bubble popped in 2000, telecom companies left behind millions of miles of fiber optic cable buried in the ground. New companies bought it for pennies on the dollar and built YouTube, Netflix, and Spotify on top of it. The crash was brutal, but the wreckage was useful. This AI bubble will leave behind warehouses full of useless silicon, locked up into 20-year power contracts and concrete shells in the middle of nowhere. No one will know what to do with them. Utilities will pass higher electricity rates on to the households for decades, no matter whether the AI revenues show up. A gap of hundreds of billions of dollars cannot be papered over for long. Companies running this race already know it, so they're quietly taking steps to slow the bleeding before the public catches on. Most of the users have already felt it. They just haven't connected the dots. Chapter 7, the stealth nerf. An AI model used to one shot your code. Now it forgets your project halfway through. A chatbot used to write five paragraphs at a stretch. Now it cuts off at three. An image generator that used to render a flawless portrait in 30 seconds now spits out something with seven fingers and it asks for an upgrade to the next tier. Nobody's imagining these things. The product is getting worse. When the numbers stop working, the easiest lever a provider can pull is to quietly water the service down. The signs are easy to spot. Message caps that used to refresh every 5 hours suddenly refresh every 8. The default model in an app gets quietly swapped from a flagship to a smaller, cheaper version. Memory features get rolled back. Advanced reasoning gets locked behind a higher price tier. A god model promised in launch keynotes is quietly being swapped out for a cheaper, less intelligent version. Reddit threads about AI tools are full of users who swear their assistant has gotten lazier. Engineers are posting side by side screenshots showing the same product producing visibly worse output than 6 months earlier. Companies almost always deny it. Sometimes they'll release selected benchmarks, clean prompts, controlled conditions, optimized scenarios designed to demonstrate performance at its best. It buys them some time, but it doesn't fix the bigger problem. A deeper issue has already started taking out the first wave of an entire AI ecosystem. Chapter 8, the 2026 mass extinction. Roughly 40% of AI startups launched in 2024 have already been shut down or acqui hired by bigger players according to CB Insights data. That is the polite term for a fire sale where a struggling company is sold for cents on a dollar to a rival. The buyer isn't really buying a business. They're getting the engineers, shutting down the product, and absorbing whatever talent they can absorb. These weren't hobby projects in someone's garage. These were companies that closed series A rounds with serious investors. They had revenue. They had paying customers. They had glowing TechCrunch profiles. Then within 18 months, the lights went off. The reason is almost always the same. Their cost of goods sold, the money they pay to model providers like OpenAI, Anthropic, and Google, is so high it wipes out any margin they could hope to charge. A startup that wrapped a polished interface around GPT-4 might charge 50 bucks a month, but the API usage that the same customer generates can cost the startup $80. Every active user is negative revenue. The more successful marketing, the faster a company bleeds out. When a foundation model provider releases a new feature, it often kills 10 startups overnight. ChatGPT launches native voice mode. Say goodbye to half a dozen voice agent startups that closed series A rounds last quarter. Claude releases native PDF reading. A whole crop of document tools became useless in a single product update. An ecosystem of independent AI companies is falling apart under the weight of compute costs that nobody can profitably absorb. When startups die, cloud providers lose round trip revenue that made foundation model investments look like good business in the first place. And that's when a final phase begins. Chapter nine, the great AI rug pull. Venture capital firms are no longer willing to cover losses in the hope of future glory. They want to see a path to profit in writing with quarterly milestones. And they want to see it now. For foundation model companies, that means one of two things. The first is a brutal sudden repricing. A $20 consumer plan becomes a $100 plan, or it quietly disappears and is replaced by a pro tier that costs 10 times more for the same features. A Claude Code user who paid $1,200 a year suddenly faces an invoice closer to $15,000 that an API actually costs. A freelance designer who relies on a $10 image generation subscription gets an email explaining that their plan is being moved over to a new structure. Small businesses that built workflows on cheap AI face a choice. Pay 10 times more or go back to doing it the old way. The second option is worse. The services simply get shut down. We've already seen the first signs. Smaller AI companies have folded with 30 days notice, leaving customers scrambling to move years of work to whatever competitor is still standing. Specialized models for legal research, medical imaging, and customer support have been pulled because their economics never worked. An era of cheap AI ends with a thousand small invoices, a thousand small shutdown notices. A deeper truth is uglier than a price hike. AI in 2026 is on track to become a luxury, not a basic product. The cheap versions trained an entire generation to need it. An expensive version is the only one that balance sheets now allow to exist. Big companies that can afford a new pricing tier will lock in their advantage. Freelancers, the small businesses, and the people who powered early adoption, the ones who created the buzz, will be priced out first. An economy built on the idea of cheap intelligence is about to slam into the reality of expensive intelligence. Productivity assumptions made in 2024 will not survive in 2027. A promised AI revolution will arrive, just not for everyone, and not at the price they were sold. History says crashes don't take a year to play out. The dot com bust took 2 years from peak to trough. The AI bubble has more leverage, more concentration, and more debt baked into its foundations. When it tips, it can move in months, maybe weeks. When the margins shrink, when the first big enterprise customer publicly walks away from a renewal, that confidence can vanish overnight. The tools millions rely on every day were never as cheap as anyone thought. They were being held up by investor money that is finally starting to dry up. An AI age might still be coming. A cheap AI age, one that fooled an entire generation into rebuilding their working lives on top of it, is already over. The bill simply hasn't arrived yet. And when it does, that price will never feel real again. The confidence that made the whole AI industry feel inevitable is starting to crack. What once looked like unstoppable momentum is beginning to show the first cracks of pressure beneath the surface. Suddenly, the question shifts from how big can this get to who is going to take the hit when it doesn't. Find out in what happens to the economy if the $2 trillion AI bubble bursts.