Skip to content

Veritasium Science Philosophy

This Paradox Splits Smart People 50/50

You walk into a room. Two boxes. One is open with $1,000 you can see. The other is sealed. A supercomputer that has correctly predicted thousands of people before you has already decided what is inside the sealed box: $1 million if it predicted you would take only the sealed box, nothing if it predicted you would grab both. The boxes are set. The prediction is locked. Now choose. Take both, or take only the mystery box. This is Newcomb's paradox, and the unsettling thing is not that it is hard.

Published Mar 9, 2026 25:40 video 25 min read Added Jun 14, 2026 Open on YouTube →

At a glance

You walk into a room. Two boxes. One is open with $1,000 you can see. The other is sealed. A supercomputer that has correctly predicted thousands of people before you has already decided what is inside the sealed box: $1 million if it predicted you would take only the sealed box, nothing if it predicted you would grab both. The boxes are set. The prediction is locked. Now choose. Take both, or take only the mystery box.

This is Newcomb's paradox, and the unsettling thing is not that it is hard. It is that the answer feels obvious to everyone, and yet smart people land on opposite answers in almost equal numbers, each convinced the other side is being silly. Veritasium runs the experiment on its own staff and on strangers on the street, watches them split right down the middle, then carefully lays out both arguments with the actual expected utility math. The one-boxers walk away millionaires. The two-boxers have an argument that is airtight. Both are right, and that contradiction is the whole point.

This is not a magic trick to be debunked. It is a clean wedge that pries open three of the deepest questions there are: does free will exist, what does it actually mean to be rational, and is there an ideal way to act in life. The video follows the puzzle out from a parlor game to nuclear deterrence and ends somewhere genuinely useful. Below is the entire argument, both camps, every number, in the order Veritasium builds it.

The setup, stated exactly

The cold open is a confession. One Veritasium staffer admits there is a problem he cannot bring up without starting a fight, that it has infiltrated every single Veritasium meeting for two months, that he even argued with Derek about it. Then the setup, delivered with no spin.

You enter a room. On the table sit a supercomputer and two boxes. One box is open and contains $1,000. There is no trick. You can see the cash. The other box is sealed and you cannot see inside. You know one more thing: this supercomputer is extremely good at predicting people. It has correctly called the choices of thousands of people facing the exact problem you are about to face. It has been right almost every time.

Your options are only two. Take both boxes, the sealed one plus the visible $1,000. Or take only the sealed mystery box and leave the $1,000 on the table.

What is in the mystery box depends on a prediction the computer already made, before you ever walked in:

If it predicted you would take only the mystery box, it put $1 million inside.
If it predicted you would take both boxes, it put nothing inside.

The prediction is finished. The money is already in the box or already not. The computer is not trying to trick you and not trying to cheat you out of anything. Its single goal is to predict correctly. And it does not matter at all how it predicts. Replace the supercomputer with a super intelligent alien, a cunning demon, or a team of the world's best psychologists. All that matters is that the predictor is extremely accurate and made the call before you entered the room. Pause and decide.

Pick one box

Veritasium asks staff and strangers, and the answers come back instantly and confidently in both directions. "I should just take two boxes, like, obviously." "I'm just gonna take the $1 million and go with it." "Of course you take two boxes." "I would not get the two boxes." One person works through it out loud and lands on: "This is seeming less paradoxical than I thought because I should just go in and take the mystery box only." The reply, off camera, is a baffled "No! What?"

So there are two tribes. One-boxers take only the sealed box. Two-boxers take both. The American philosopher Robert Nozick captured exactly why this is famous: "To almost everyone, it is perfectly clear and obvious what should be done. The difficulty is that these people seem to divide almost evenly on the problem, with large numbers thinking that the opposite half is just being silly."

The puzzle is named for its inventor, the physicist William Newcomb, and popularized by Robert Nozick in 1969. It splits people for real, not just in theory. When The Guardian polled over 31,000 readers in 2016, the result was 53.5% one-boxers to 46.5% two-boxers. A coin flip, give or take. The video then does the responsible thing and gives the strongest version of each argument rather than picking a winner.

The one-boxer case, in the words of a one-boxer: "I'm a reasonable guy and I like money, so I'm gonna do whatever gets me the most money." So weigh the two decisions properly. Call C the probability the computer predicted you correctly, which means 1 minus C is the probability it got you wrong. Now compute the expected utility of each choice.

Two-box:  EU = C × $1,000      + (1 − C) × $1,001,000
One-box:  EU = C × $1,000,000  + (1 − C) × $0   =  $1,000,000 × C

If you try to two-box, there is a C chance the computer saw it coming and left the mystery box empty, so you walk with $1,000, and a 1 minus C chance it guessed wrong, left the million in, and you scoop $1,001,000. If you one-box, there is a C chance the million is there and a 1 minus C chance the box is empty. Set the two expected values equal and solve for the break-even accuracy. The crossover sits at C = 0.5005, just a hair over 50%. In other words, the instant the computer is even slightly better than a coin flip, one-boxing has the higher expected payout. And this computer is far better than a coin flip; it nailed thousands of people before you. So the one-boxer rests the case, takes the sealed box, and says "Here's my $1 million. Take that, Casper." The math is not vibes. It is a weighted average that tips to one box for any predictor worth the name.

Figure 1. The one-boxer's own math, drawn out. Once the predictor beats a coin flip by even a sliver, at C = 0.5005, one-boxing's expected payout overtakes two-boxing and keeps climbing toward the full million while two-boxing falls toward the bare $1,000. The whole one-box case lives in where these two lines cross.

Pick both

Now the other side, and it is just as clean. A two-boxer hears the one-box reasoning, finds it surprising, and says the answer is also obvious, only it is the opposite. "Here's how I think about the problem in a way that actually makes sense."

The supercomputer has already set up the boxes. Whatever you decide now does not change whether there is zero or a million dollars in the sealed box. The contents are fixed. So lay out the four possible worlds you could be in and ask what your choice does inside each.

	Mystery box has $0	Mystery box has $1,000,000
You one-box	you get $0	you get $1,000,000
You two-box	you get $1,000	you get $1,001,000

Read it column by column. If the sealed box is empty, two-boxing nets you $1,000 instead of nothing. If the sealed box holds the million, two-boxing nets you $1,001,000 instead of a flat million. Either way, whatever is already in that box, two-boxing leaves you exactly $1,000 richer. This is strategic dominance: one option beats the other in every possible state of the world, so you take it. "So, give me those boxes." Anyone in their right mind picks both.

Even the people who lean one-box feel the pull of this. One staffer admits "the two-boxer argument makes a lot of sense to me. Once you explain it, I'm like, okay, yeah, I can see why exactly you're right." But then he adds the thing that keeps him one-boxing: "I can also see that just having those thoughts in your brain are what might allow the computer to give you nothing. I think I'm grateful that I just don't have those thoughts." That single line is the whole tension. The dominance argument is correct, and being the kind of person who is moved by it is exactly what gets your box left empty.

What decision theory actually is

So you have two perfectly reasonable approaches that give two completely different answers. Veritasium's framing is the key that unlocks everything that follows: your choice reveals something fundamental about how you make decisions. It comes down to two statements, both of them true.

As far as you know, basically everyone who took one box walked away a millionaire, and everyone who took two walked away poor.
The supercomputer made its prediction before you even knew about the problem. The boxes are already set up, so your decision now cannot change whether the million is in there.

Both true. The split happens because each statement carries a hidden assumption, and which one you privilege quietly decides your camp. The two assumptions correspond to two formal schools of decision theory, the field that studies how to choose under uncertainty.

The one-boxers run on evidential decision theory, EDT. As one of them puts it, the expected utility calculations are based on probabilities that use the prior evidence of how accurate the computer is. The thousands of people it predicted correctly are evidence enough that when you reach for one box, the million is waiting. EDT asks: given that I choose this, what should I now expect to be true about the world? Choosing one box is strong evidence that the million is there, so choose it. Veritasium notes its own audience leans this way hard. They polled viewers and got more than 24,000 responses, and two thirds came back one-boxers.

The two-boxers run on causal decision theory, CDT. The motto: "I believe that whatever I do now can't influence and change the past. I only take into account things that I can actually influence." Since the prediction and the box contents were set before you learned about the problem, your choice cannot cause the million to appear. So CDT swaps in a different probability. Call P the probability, fixed back when the computer predicted, that you were going to one-box. Then:

One-box (CDT):  EU = $0      + $1,000,000 × P
Two-box (CDT):  EU = $1,000  + $1,000,000 × P

The $1,000,000 × P term is identical in both rows because your choice cannot move P now; that ship sailed. So two-boxing is just one-boxing plus a guaranteed extra $1,000, no matter what the computer predicted. Of course you two-box. The two camps are not doing the same math badly. They are doing different, internally correct math, because they disagree about which probability is the right one to plug in. That is the real paradox.

Figure 2. The fork. Everyone agrees on the facts. The split is upstream of the math, in which true statement you let drive: the predictor's track record, which makes your choice evidence and sends you down EDT to one box, or the fixed past, which says your choice causes nothing and sends you down CDT to two boxes.

On the street, the arguments get heated and human. A two-boxer to a one-boxer: "You guys think that your decision, whatever you think now, is gonna change the past. That's called wishful thinking." A one-boxer holding firm: "Your little thought does not change God's mind, bro." And the cleanest distillation of the disagreement, from a defiant two-boxer: "I'm not losing $1 million. It was never in the room, man. You're gonna walk into a room and there's either money in the room or there isn't money in the room. Your question is, do you pick it up?" Veritasium's verdict is not a tiebreak. Because both assumptions are true, both camps have valid answers. Which raises the real worry: if there is no right answer, is this just a meaningless game?

What it says about free will

It is not meaningless, because the puzzle quietly forces you to take a stance on three of the biggest questions there are. The first is free will.

Notice the only way to truly win, to walk out with the full $1,001,000, is to be the kind of person the computer reads as a one-boxer and then two-box at the last second anyway. But if the predictor is good enough, that move is impossible. Push it to the limit: suppose the predictor is 100% accurate. Then there is genuinely nothing you can do between entering the room and choosing that changes what was already predicted. Your future is, in a real sense, readable in advance. Does a perfect predictor mean free will does not exist?

One of the Veritasium hosts goes there directly, and it is the most personal moment in the video: "Maybe free will doesn't exist. I come down in this point of like free will is an illusion, but our world operates in a way that is indistinguishable from free will being real, and therefore, you have to act as though it's real, as though it's 100% real." The argument then gets a hard practical edge. If free will is an illusion, you might be tempted to say a criminal is not at fault, so swap prison for gardening classes. But that changes the environment: now everyone knows you can kill someone and just do the gardening. The conclusion holds even if determinism is true: "Whether we do or don't have free will, you have to live as though it exists." The metaphysics may be open. The way you must act is not.

What it means to be rational

The second question is what it actually means to be rational, and here the video lands its sharpest point.

The two-boxer made the textbook rational choice, refusing to believe his thoughts can rewrite the past, and ended up with $1,000 while the one-boxers shop for private islands. The needling name for this is the Why Ain'cha Rich? argument: if you are so smart, then why ain'cha rich? If winning means money, one-boxers win, every time.

The two-boxers have a real answer, not a dodge. In their 1978 paper, philosophers Allan Gibbard and William Harper argue the rational choice is both boxes, while openly admitting two-boxers fare worse. Their move: the game is rigged. "If someone is very good at predicting behavior and rewards predicted irrationality richly, then irrationality will be richly rewarded." You are being punished for being rational, which does not make rationality wrong. Veritasium calls this a bit of a cop out, and offers the more interesting reading: Newcomb's paradox shows that sometimes, in order to be a rational person, you must act irrationally.

This is the crux, and the video sharpens it with a distinction borrowed from the prisoner's dilemma: there is a difference between a rational person and a rational act. Most of the time a rational person does rational acts. Sometimes they come apart.

In the prisoner's dilemma, two players each choose to cooperate or defect:

Both cooperate: three coins each.
You defect, they cooperate: you get five, they get nothing.
Both defect: one coin each.

No matter what your opponent does, you are better off defecting, exactly the dominance logic of two-boxing. So defection is the rational act. But play the game repeatedly, as you do across a life or a society, and it flips: now you are better off cooperating. Hence the split-level insight: a rational society is full of cooperators, while a rational person, in the single shot, is a defector. You might expect a rational society to be made of rational people, but rationality at one level is not compatible with rationality at the other. Back to the boxes: the rational act is to two-box, but a rational society would be full of one-boxers.

The two camps	One-boxer	Two-boxer
Takes	only the sealed box	both boxes
Decision theory	Evidential (EDT)	Causal (CDT)
Key belief	my choice is evidence of what is in the box	my choice cannot cause an already fixed past
Core argument	expected utility, for any predictor better than a coin flip	strategic dominance, beats one-box by $1,000 in every world
Probability used	C, the predictor's track record	P, fixed at prediction time, unmovable now
Walks away with	~$1,000,000	~$1,000
Guardian 2016 poll	53.5%	46.5%
Veritasium audience	~67% (two thirds)	~33%
Defended by	Robert Nozick's framing, the Why Ain'cha Rich argument	Gibbard and Harper, 1978 (the game is rigged)

Figure 3. The two camps side by side. Note the verdict row: the camp with the worse single argument by the rules of rational choice is the camp that ends up rich. That is the whole knot Newcomb's paradox ties.

The three ways to flip a two-boxer

The self-described "fervent two-boxer" then names the only three conditions that would convert him to one-boxing. They are worth keeping because each isolates exactly what the standard puzzle denies.

Backward causation. If your choice now could actually change the past. Suppose the computer predicts by opening a tiny wormhole to peek at the future. Then choosing one box literally causes the million to have been placed there. He one-boxes.
Multiple trials. If the game repeats. Every round, your choices build a reputation; one-box consistently and you get predicted as a one-boxer, so the million arrives this round or a later one.
Precommitment. If you can talk to the computer and make your case before it predicts. Then he 100% one-boxes, because staying true to his word matters to him, and the computer would know that. As another puts it, "If I put my word on it, I'll take the one box."

That third lever, precommitment, is where the video turns from a thought experiment into something with stakes.

Mutually assured destruction

The same structure, a precommitment to a worse option that produces the better outcome, runs the entire logic of the Cold War.

On 29 August 1949, the Soviet Union detonated the RDS-1 bomb in its first nuclear test, igniting a furious arms race. By the mid-1960s the US held over 30,000 warheads and the USSR just over 6,000, each side more than able to destroy the other. US Secretary of Defense Robert McNamara did not push for disarmament. He recommended a strategy of assured destruction: deter a deliberate attack by maintaining a highly reliable ability to inflict an unacceptable degree of damage upon any single aggressor. This became mutually assured destruction, MAD. Attack first and the other side surely retaliates, ending in total annihilation. The commitment to retaliate is what stops the attack in the first place.

Then the video puts you in the chair. You are the US president. You have publicly committed to retaliate if attacked. Word arrives that the Soviets have launched. It is not a system error; it is real. Launch now and at best everyone in the US and USSR dies, at worst a nuclear winter kills nearly everyone on Earth. Do you push the button? The honest answer, from nearly everyone asked: "Everyone in the whole world dies, then I probably don't launch." But you precommitted. "Yep. I don't like the outcome of everyone on Earth dying, so I'm gonna just not."

There is the Newcomb structure exactly. The rational person, in the moment, does not launch, because launching helps no one now. But a country full of leaders known to flinch invites the first strike. So which leader do you elect? The chilling answer: "You want someone who maintains the posture of always pushing that button, and then you want someone who secretly will not actually push that button." The risk is obvious: if anyone finds out, the deterrent collapses.

The pure form is the game of chicken. Two cars speed at each other. The worst outcome is neither swerving, you both die; you win if the other swerves and you do not. The dominant strategy is brutal and visible: rip your steering wheel out and throw it out the window where your opponent can see it. Now you cannot swerve, you are a mad dog headed straight, and the opponent's best move becomes swerving. Making yourself unable to back down is what wins.

Cinema nailed this in 1964's Dr. Strangelove. The Russians build a perfect doomsday device: detect a nuclear attack or any tampering and it automatically triggers an explosion large enough to kill everyone on the planet. The tamper switch is not to stop enemies disabling it, it is to stop the Russians themselves from having second thoughts, removing their own ability to flinch. The whole point is to be so automatic and devastating that the US would never even consider attacking. And it only works if everyone knows the device exists, which, in the film's bitter joke, is the one thing the Russians forgot to announce.

Precommitment is the ultimate strategy

In both Newcomb's paradox and MAD, the best outcome flows from a precommitment to a worse option. That is what secures at least $1 million in the one case and a tense but stable peace in the other. The commitment is the thing. So maybe being rational is not about choosing well in the moment. It is about choosing well which rules you will live by.

The video gives this its most philosophical voice: the question is not how to act, it is what rules one ought to follow, and how one even decides which rules to follow. Imagine you were a robot who could rewire your own programming to obey one set of rules rather than another. What rules would you wire in? You would make yourself the kind of creature that always acts in line with the commitments it would have been good to form, even before you knew the specific problem. Then, dropped into a Newcomb case, you think: if I could have precommitted, the good precommitment would have been to be a one-boxer, and since I already wired myself to live up to the commitments I would have made, I am in effect already committed to one-boxing, even though I did not realize it.

This reframes the one-shot puzzle as an iterated problem in life. Treat it not as a single isolated case but as one instance among every future predictor, every future case, the steady building of your own reputation. You always want to live up to the commitments you have made, so even facing a brand new dilemma you stick to those ideal precommitments and act as the best version of yourself. That is what flips a committed two-boxer, on camera, to one-boxing, prompting the wry aside that it is rare to convince anyone to switch on the Newcomb problem.

The closing logic ties straight back to the prisoner's dilemma. Even if you never meet another generous supercomputer, life does not end when you leave the room. You should always defect in a one-shot prisoner's dilemma, because betrayal only gains. But across the many rounds of life and society, it pays to cooperate. So being the kind of person who sticks to an ideal precommitment is simply beneficial. Maybe you were a one-boxer all along; it just took a reframe and a new perspective.

And the single cleanest statement of the whole thing comes near the end: the core of Newcomb's paradox is deciding whether a strong correlation that you know is not causal should still matter in your decision. Your one-boxing does not cause the million; it correlates with it through the predictor. The entire fight is whether a correlation you cannot trace to a cause deserves a seat at the table when you choose. That is not a parlor question. Teasing causation from correlation is exactly how you tell whether a drug really works or whether the benefit is random chance, a problem that reaches far beyond thought experiments.

Key takeaways

Both answers are defensible, and that is the point. One-boxing and two-boxing each follow from a true premise. The puzzle does not have a hidden trick that breaks the tie; it exposes a real fork in how rational people reason.
The split is a split between decision theories. Evidential decision theory treats your choice as evidence about the world and one-boxes. Causal decision theory only counts what your choice can cause and two-boxes. Same facts, different probability plugged into the same expected utility formula.
The math favors one-boxing past a coin flip. Break-even predictor accuracy is C = 0.5005. Any predictor better than random makes one-boxing's expected payout higher.
Dominance favors two-boxing in every fixed world. With the boxes already set, two-boxing nets exactly $1,000 more than one-boxing no matter what is inside.
It cracks open free will. A sufficiently accurate predictor leaves nothing you can do to deviate from the prediction. The practical takeaway holds either way: you must live as if free will is real, even if it is an illusion.
Rational person and rational act can diverge. As in the prisoner's dilemma, the rational single act (defect, two-box) is not what a rational society would be built from (cooperators, one-boxers).
Precommitment is the unifying strategy. Newcomb, mutually assured destruction, chicken, and Dr. Strangelove all reward binding yourself in advance to a choice that looks worse in the moment. Throw the steering wheel out the window.
It is really about correlation versus causation. One-boxing correlates with the million without causing it. Whether a non-causal correlation should sway your decision is the live question, and it is the same question behind every drug trial.

Chapters

Timestamps are clickable. Click one and the player jumps there and keeps playing while you read.

0:00 What is Newcomb's Paradox?
3:24 Pick 1 Box!
5:24 Pick Both!
6:38 What is decision theory?
11:27 What does Newcomb's Paradox say about free will?
13:25 What does it mean to be rational?
16:49 Mutually Assured Destruction
20:02 Precommitment Is The Ultimate Strategy

Notable quotes

To almost everyone, it is perfectly clear and obvious what should be done. The difficulty is that these people seem to divide almost evenly on the problem, with large numbers thinking that the opposite half is just being silly. Robert Nozick, quoted at 2:01

Here's my $1 million. Take that, Casper. Thank you very much. Gregor Cavlovic, 3:13

I'm always better off by picking both boxes. This is known as strategic dominance. Casper Mebius, 3:48

I can also see that just having those thoughts in your brain are what might allow the computer to give you nothing. I think I'm grateful that I just don't have those thoughts. Henry van Dyck, 3:55

Your little thought does not change God's mind, bro. Henry van Dyck, 6:44

Free will is an illusion, but our world operates in a way that is indistinguishable from free will being real, and therefore, you have to act as though it's real, as though it's 100% real. Derek Muller, 11:47

If you're so smart, then why ain'cha rich? Casper Mebius, 13:39

Sometimes in order to be a rational person, you must act irrationally. Casper Mebius, 14:25

The best strategy in this game is to visibly take the steering wheel out of your car and throw it out the window so that the opponent can see that you've done that. Derek Muller, 19:00

Maybe being rational isn't deciding about what to choose in the moment, but it's about deciding what rules you're going to live by. Casper Mebius, 20:30

The core of Newcomb's paradox is deciding if a strong correlation that you know isn't causal should matter in your decision. Derek Muller, 24:10

Resources mentioned

Newcomb's paradox, invented by physicist William Newcomb and popularized by Robert Nozick in 1969.
Robert Nozick, the philosopher whose framing of the even split made the problem famous.
The Guardian's 2016 poll of over 31,000 readers: 53.5% one-boxers, 46.5% two-boxers.
Evidential decision theory and causal decision theory, the two frameworks that divide the camps.
Allan Gibbard and William Harper's 1978 paper arguing for two-boxing while conceding two-boxers fare worse.
The Why Ain'cha Rich? objection to two-boxing.
The prisoner's dilemma and the game of chicken, the game theory parallels.
Mutually assured destruction, Robert McNamara, and the RDS-1 Soviet nuclear test of 1949.
Dr. Strangelove (1964) and its doomsday device.
The video credits Dr. Arif Ahmed, Dr. Adam Elga, Dr. Kenny Easwaran, Dr. Peter Slezak, Dr. David Wolpert, Dr. Scott Aaronson, and Dr. Michael Huemer for their expertise. The causal expected utility calculation is based on a post by Michael Huemer. Full references at ve42.co/NewcombRefs.
Veritasium, Derek Muller's science channel, and the episode's sponsor Brilliant, whose Bayesian probability course covers the drug-trial version of correlation versus causation.

The one idea to walk away with

Newcomb's paradox is not asking you to find a clever escape. It is asking you to notice that two reasonable, well defined ways of reasoning, evidence and causation, can pull a perfectly smart person in opposite directions with total confidence. The trap is thinking your side is obvious and the other side is silly. The way out is to stop deciding the single act and start deciding the rule. Be the kind of person who keeps the precommitments worth keeping, and you tend to find that the money was in the box, the peace held, and the steering wheel was already out the window. Live the iterated game, not the one shot, and the paradox dissolves into a way to live.

Full transcript

There is a problem that I can't bring up without starting a fight. No, what? It just seems so obvious to me. Now I'm all screwed up, man. (Casper laughs) It has infiltrated every single Veritasium meeting in the last two months. It's trivial. (laughs) I didn't think you would fall for this side. Just makes sense. Let's go! That's crazy! And I even argued with Derek about it. There's no way you're trying to convince me. I don't care. So, here's the setup. You walk into a room, and there's a supercomputer and two boxes on the table. One box is open, and it's got $1,000 in it. (cash register ka-chings) There's no trick. You know it's $1,000. The other box is a mystery box, you can't see inside. You also know that this supercomputer is very good at predicting people. It has correctly predicted the choices of thousands of people in the exact problem you're about to face. Now, you don't know what that problem is yet, but you do know that it has been correct almost every time. Now, the supercomputer says you can either take both boxes, that is the mystery box and the $1,000, or you can just take the mystery box. So, what's in that mystery box? Well, the supercomputer tells you that before you walked into the room, it made a prediction about your choice. If the supercomputer predicted you would just take the mystery box and you'd leave the $1,000 on the table, well, then it put $1 million into the mystery box. But if the supercomputer predicted that you would take both boxes, then it put nothing in the mystery box. The supercomputer made its prediction before you knew about the problem and it has already set up the boxes. It's not trying to trick you, it's not trying to deprive you of any money. Its only goal is to make the correct prediction. So, what do you do? Do you take both boxes or do you just take the mystery box? Don't worry about how the supercomputer is making its prediction. Instead of a computer, you could think of it as a super intelligent alien, a cunning demon, or even a team of the world's best psychologists. It really doesn't matter who or what is making the prediction. All you need to know is that they are extremely accurate and that they made that prediction before you walked into the room. Pause the video now if you want to think about it. (soft playful music) Got your answer? So, I should just take two boxes, like, obviously. I'm gonna say I'm just gonna take the $1 million and go with it. Of course you take two boxes! I would pick both boxes, I think. I would not get the two boxes. I think I'm taking both boxes. (interviewee laughs) Okay. This is seeming less paradoxical than I thought because I should just go in and take the mystery box only. No! What? There are two camps, one-boxers who would take only the mystery box, and two-boxers who take both. But as American philosopher Robert Nozick wrote, "To almost everyone, it is perfectly clear and obvious what should be done. The difficulty is that these people seem to divide almost evenly on the problem, with large numbers thinking that the opposite half is just being silly." This is known as Newcomb's paradox, named for its inventor, William Newcomb. The Guardian newspaper polled over 31,000 people about this problem in 2016. 53.5% were one-boxers and 46.5% were two-boxers. Now, if you find it hard to see why anyone would pick the opposite side, well, here are the arguments for each camp. Look, I'm a reasonable guy and I like money, so I'm gonna do whatever gets me the most money. So, let's go weigh the outcomes of both of these decisions. First, I'm gonna say that the probability that the computer predicted my decision correctly is gonna be C, so the computer got it right. And because of that, the probability it got it wrong is gonna be 1 - C. So, let's look at what happens if I try to two-box. There is a C chance of me getting $1,000 and a 1 - C chance of me getting $1,001,000. If I add these two together, I get a weighted sum, which is gonna tell me how much I can expect to get if I try to two-box. This is also known as expected utility or the EU of two-boxing. And I can just simplify this expression a tiny bit. So, let's look at what happens if I try to one-box now. There's a C chance of me getting $1 million and there's a 1 - C chance of me getting nothing. So, we can cancel this out, simplify this to just $1 million times C. If I equate these two expressions, I'm gonna get the C at which these two expected utilities are equal, and it turns out that the C at which this happens is 0.5005, or 50.05%, which means if the computer is better at predicting than what is basically random, then the expected utility of one-boxing is gonna be higher. Now, I know that the computer is much better at predicting than that because it accurately predicted thousands of people before me, which means I'm sticking with my one box. Here's my $1 million. Take that, Casper. Thank you very much. So, I should go in and take the mystery box and leave the other. 'Cause I'm assuming that its prediction is pretty good, it's a supercomputer. Okay, so you're saying it's not paradoxical because the choice is obvious. I'm very surprised. Why? Because to me, the answer is also obvious, and to me, the answer is you take both boxes. So, here's how I think about the problem in a way that actually makes sense. You know that the supercomputer has already set up the boxes, so whatever I decide to do now, it doesn't change whether there's zero or $1 million in that mystery box, and that gives us four possible options that I've written down here. If there is $0 in a mystery box, then I could one-box and get $0 or I could two-box and get $1,000, but there could also be $1 million in a mystery box. And in that case, I would get $1 million if I one-box or I would get $1,001,000 if I two-box. So, I'm always better off by picking both boxes. This is known as strategic dominance where I always pick the dominant strategy, which in this case is to two-box. So, give me those boxes. The two-boxer argument makes a lot of sense to me. Once you explain it, I'm like, "Okay, yeah, I can see why exactly you're right." But I can also see that just having those thoughts in your brain are what might allow the computer to give you nothing. I think I'm grateful that I just don't have those thoughts. It's so funny because I was totally expecting you to go two box. No way. No way, man. It seems like there are two perfectly reasonable approaches that give two completely different answers, and that's because your choice actually reveals something fundamental about how you make decisions. It comes down to these two statements. First, as far as you know, basically everyone who has taken one box has walked away a millionaire, and everyone who has taken two has walked away poor. Second, the supercomputer made its prediction before you even knew about the problem. The boxes are already set up, so your decision now can't change if the million is in there or not. Both of these statements are true, but there's a hidden assumption in each that divides people. We pick both and the computer picks both? You just get the $1,000. I think I would just pick the mystery box. Just the mystery box, probably. Yeah. I might be taking the mystery box. Just the mystery box? Yeah, it might be. Okay, why? I don't know, I guess the supercomputer is right, no? Here's the hidden assumption for us one-boxers. My expected utility calculations are based on probabilities that are using prior evidence of how accurate the supercomputer is, because the thousands of people that it accurately predicted before me is evidence enough for me that when I go for one box, there's gonna be $1 million waiting in there for me. This is based on something called evidential decision theory. And using this decision theory, you get these expected utilities, and my choice is obvious from there. And it turns out a lot of you actually thought the same way. We polled our audience, got more than 24,000 responses, and it turns out two thirds of you are one-boxers. I know, it's crazy. I don't trust this. It's looking to be like you're less and less rational with these results, Casper. So, your argument is kind of that if you one-box, it will have predicted that you were gonna one-box and you'll walk out with the money. That is pretty convincing. Casper, I'm starting to really doubt this two box side of things. How could you not be a two-boxer? Gregor has this really funny way of thinking about things, but I make my decision based off something else, something a little more rational because I believe that whatever I do now can't influence and change the past. I only take into account things that I can actually influence. And clearly, whatever I do now, whatever I think now is not gonna change whether that $1 million is gonna be in the mystery box or not because it was already set up before I learned about the problem. This is known as causal decision theory, where you only take into account things that you can actually cause. And so, with this, your expected utility calculation changes, and that's because you need to use a different probability, one where you could actually cause that $1 million to be in the mystery box or not. So, right before the supercomputer made its prediction, there was some probability that it thought I was going to one-box. I know, it's weird, but bear with me. So, let's say that probability is P, then that's the probability that I'm going to use in my expected utility calculation. And the expected utility to one-box is just gonna be 0 + $1 million times P. That's pretty good, but the expected utility for two-boxing is gonna be $1,000 + $1 million times P. But that's just the same as the expected utility for one-boxing plus an extra $1,000. So, no matter what the computer predicted, my expected utility is always higher by picking both boxes. So, of course, you're gonna two-box. Anyone in their right mind would pick both boxes. It's made the prediction before you're in there, whether I have facing this thing being 90%, 100%, it actually doesn't really matter. I think you guys are imposing your will on- Yes, it does! He's cooking, bro. Like, yeah, you guys think that your decision, whatever you think now is gonna change the past. That's called wishful thinking. Yeah, like, come on, I'm two boxes. I'm two boxes. It only makes sense. I'm back with Casper. Exactly, welcome to camp two-boxers. (laughs) I'm not losing $1 million. It was never in the room, man. You're gonna walk into a room and there's either money in the room or there isn't money in the room. Your question is, do you pick it up? Of course. We're trying to do you a favor here! You're not doing me a favor 'cause your decision making does not affect... Like, your little thought does not change God's mind, bro. Henry, if you convinced yourself that you're a one-boxer, you've convinced the machine. I don't believe that. What do you mean you don't- I don't believe you convincing yourself should impact what the machine thinks of you. Because often, you're just walking into a room and it's already made the prediction. You can't impact it, you know? So, whatever you think is more important, whether that's the evidence of the supercomputer's accuracy or the fact that the boxes are already set up, well, that affects how you calculate the expected utility. And because both of those assumptions are true, both camps have valid answers. But if there's no right answer, then is this just a meaningless problem? Well, not really. Because it actually reveals a surprising amount about three important questions. Does free will exist? What does it mean to be rational? And is there an ideal way to act in life? For example, the only way you're going to win this game is to already be the kind of person to one-box, but then two-box at the last second anyways. That's the only way you're gonna get $1,001,000. But some would argue that that itself is impossible. If the predictor is so good, let's say it's 100% accurate, then that's not even possible. Would you say that's true? Then a follow-up question is, if such a perfect predictor would exist, does that mean that free will doesn't exist? Because you're saying there's nothing you can do in between walking into that room and making your decision that ends up changing what was predetermined. That's right, and maybe this reveals where I'm coming from, and I think where I'm coming from is maybe free will doesn't exist. I come down in this point of like free will is an illusion, but our world operates in a way that is indistinguishable from free will being real, and therefore, you have to act as though it's real, as though it's 100% real. Interesting. If we think that free will is not real and it's an illusion, and then you have someone who's committing crimes and then you wanna say, "Well, that's not his fault." Therefore, instead of putting, you know, murderers in jail for 25 years, we're just gonna give them some gardening classes or so. Like, the problem is, that then changes the environment where everyone knows you can kill someone and you can go to, like, do the gardening. So, you can't change the system based on the knowledge that it's an illusion. Whether we do or don't have free will, you have to live as though it exists. So, you've still gotta make a choice, one box or two boxes? Which brings us to our second question, what does it mean to be rational? I'm the guy who acted rationally and doesn't believe his thoughts can influence the past. But your rational choice will have given you a $1,000. I know, yeah, it's tough. It's tough. Let's see what I can buy with $1 million. None of these look great. I think we can be more creative. Private island sounds pretty nice. All right, Casper, what do you wanna go buy with 1,000, man? (Henry laughs) This is known as the "Why Ain'cha Rich?" argument, which boils down to one super annoying question, if you're so smart, then why ain'cha rich? You know, if winning is getting more money, then of course the one-boxers are gonna end up better off than the two-boxers. But maybe it's not about who wins, but about what's rational. In their 1978 paper, philosophers Gibbard and Harper argue that the rational choice is to pick both boxes. Although they do admit that two-boxers will fare worse. They instead say that the game is rigged. And "if someone is very good at predicting behavior and rewards predicted irrationality richly, then irrationality will be richly rewarded." But I think that's a bit of a cop out, because really, Newcomb's paradox reveals something surprising, that sometimes in order to be a rational person, you must act irrationally. There's one question of what's a rational person. There's another question of what's a rational act. Most of the time, rational people do rational acts, sometimes they just don't. And I think this is analogous to the situation in the prisoner's dilemma. In the prisoner's dilemma, you and another player compete for money by either cooperating or defecting. If you both cooperate, (audio chimes) then you get three coins each. (coins clinking) But if you defect and your opponent cooperates, then you get five coins and they get nothing. And if you both defect, (buzzer buzzes) you get one coin each. (coins clink) So, no matter what your opponent does, you are always better off by defecting. But if you play this game not once, but repeatedly, then everything changes. All of a sudden, you're better off by cooperating. What's a rational society? A rational society is full of cooperators. What's a rational person? Maybe a rational person is a defector. And normally, you might expect a rational society is made up of rational people, but I think it's familiar that rationality at one level isn't compatible with rationality at the other level. So, while the rational act is the two box, a rational society would actually be full of one-boxers. Now, I'm a fervent two-boxer, but there are three ways that you can get me to one-box. The first is if my choices now can actually change the past. For example, say the supercomputer makes its prediction by opening up some tiny wormhole to see the future. Well, in that case, if I choose one box now, that actually causes the $1 million to be placed in there in the past, so I one-box. Second, if there are multiple trials, because now with every game, each of my choices builds up a reputation, so if I one-box, then I'll be predicted as the one-boxer and so I get the $1 million either in this round or a later one. And third is, if I can pre-commit. If I can talk to the computer to make my case before it makes its prediction, then I will 100% one-box because staying true to my word is important to me, and the supercomputer would know that. If I put my word on it, I'll take the one box. Yeah, 100% one box, yeah. But there are some realistic scenarios where staying true to a worse option could have deadly consequences. On the 29th of August, 1949, the Soviet Union detonated the RDS-1 bomb as part of their first nuclear weapons test. This sent the US and the USSR into a furious arms race. By the mid-1960s, the US had over 30,000 nuclear warheads, and the USSR had just over 6,000. Both sides were more than capable of destroying the other. (explosion booms) The US Secretary of Defense at the time, Robert McNamara, didn't advocate for disarmament. Instead, he recommended a strategy of "assured destruction," where the US should be able to deter a deliberate nuclear attack by "maintaining a highly reliable ability to inflict an unacceptable degree of damage upon any single aggressor." This strategy eventually became known as mutually assured destruction, or MAD. If either country attacked first, the other would surely retaliate and lead to total annihilation of both sides. So, having that commitment to retaliate is beneficial. It stops the attack from happening in the first place. Now, say you are the US president during the Cold War. You have publicly committed to retaliate if the US is ever attacked. But you've just received word that the Soviets have launched their missiles. It's not a system error, it is a real attack. If you launch now, then at best, everyone in the US and USSR dies. And at worst, you get a nuclear winter that kills nearly everyone on the planet. Everyone in the whole world dies, then I probably don't launch. But you did pre-commit. Yep. I don't like the outcome. No, it's terrible. I don't like the outcome of everyone on Earth dying, so I'm gonna just not. When the country is electing their leader, which leader do you want to elect? Do you want to elect someone who's crazy and is always gonna press that button, or do you wanna elect someone who makes the seemingly rational choice of saving people and not press that button? I think you want someone who maintains the posture of always pushing that button. That's right. And then you want someone who secretly will not actually push that button. There is an inherent risk there, which is that if anyone finds out, you're exposed. There's this other game theory interaction, the game they call chicken. You're both driving your cars at each other. The worst thing is if neither of you swerves, because then you both die, but you win if the other person swerves and you don't. The best strategy in this game is to visibly take the steering wheel out of your car and throw it out the window so that the opponent can see that you've done that. Now they know you cannot swerve. You're this mad dog that's just going straight ahead. And now they realize, "Now my best action is just to swerve." And that similarly, like what you want with the nuclear deterrent as they set up in "Dr. Strangelove." In the 1963 film, "Dr. Strangelove or: How I Learned to Stop Worrying and Love the Bomb," the Russians built the perfect doomsday device. As soon as it detects a nuclear attack or any tampering, it automatically triggers a large enough nuclear explosion to kill everyone on the planet. The tampering kill switch isn't there to prevent enemies from disabling it. It's to prevent the Russians themselves from having second thoughts. Now, the whole point of the device is to be so devastating and automatic that the US would never even think about launching an attack. But this only works if everyone knows that the device exists, which is the whole point of the movie. In both Newcomb's paradox and MAD, the best outcome follows from a pre-commitment to a worse option. That's what gets you at least $1 million in the former and a tense but stable peace in the latter. It's the commitment that's important. So, maybe being rational isn't deciding about what to choose in the moment, but it's about deciding what rules you're going to live by. The question isn't how to act. The question is what rules one ought to follow, or how does one even decide what rules to follow? Sometimes it's put in the form of, if you knew that you were a robot with programming, that you could set and you could rewire yourself to make yourself obey one set of rules rather than another, the question is, what sort of rules would you wire yourself to obey? And what you would do is you would make yourself into the kind of creature that sort of always acts in line with the commitments that would've been good to form had you even known about the problem. When you're in a situation like the Newcomb case, you would end up finding yourself think, "If I had been able to make a pre-commitment, what pre-commitment would've been the good one to make? The good pre-commitment to make would've been to be a one-boxer. And since I've already wired myself up to be the kind of person that lives up to all the pre-commitments I would've made, then I'm already, in effect, committed to one-boxing, even though I didn't realize it." I love this approach, because for me, that kind of makes it an iterated problem, but maybe more an iterated problem in life. If I don't look at it as a single case and I sort of almost think about it as for every future predictor, for every future case, or like, almost building your own reputation, right? Exactly. Like, you always wanna live up to the commitments you've made, so even if you haven't heard of it before, you'd wanna stick to those ideal pre-commitments so that you are acting in line with the best version of yourself. Yeah, that would convince me to be a one-boxer. It's rare to convince anyone to switch on the Newcomb problem. (laughs) The thing is, even if I never run into another generous supercomputer again, life doesn't end after I walk out of that room. Like, I should always defect in a one-shot prisoners dilemma because I can only gain by betraying the other player. But when I play multiple rounds of the game, like in life or in society, everything changes. Suddenly, it pays to cooperate. So, being the kind of person that sticks to an ideal pre-commitment is beneficial. So, maybe I was just a one-boxer kind of guy all along. All it took was a little reframing and a new perspective. (screen beeping and chirping) The core of Newcomb's paradox is deciding if a strong correlation that you know isn't causal should matter in your decision. So, the question is, what do you do? Do you pick both boxes or do you just pick the mystery box? Might be taking the mystery box. Mystery box. I also pick both boxes, so. (laughs) I'll take the mystery box. Okay, so you would pick just the mystery box and probably get the $1 million. So that's based off, I guess that it's almost always been correct, so there's this correlation. Do we think that correlation is the same as causation? But how can you tease out what's causation and what's just correlation? That's a difficult problem that has applications far beyond thought experiments. For example, how can you tell whether a drug really works or if any beneficial effects are simply due to random chance? (screen chirping) Big questions like these combined with hands-on learning is what I love about today's sponsor, Brilliant. It forces me to slow down and actually think instead of just nodding along with an explanation. Brilliant helps you build skills in math and coding with step-by-step interactive lessons. You're not just watching, you're actively solving problems and testing your ideas as you go. I found their courses to be both fun and effective. So, whether you want to explore how to master the math of probability, start coding in Python, or really understand how AI works, Brilliant will help you achieve your learning goals. And their lessons start you at the right level based on your background, so no matter if you're 10 or 110, the practice sets and reviews are customized to help you advance at your ideal pace. So, how can you tell if a drug really does work? Well, Brilliant's course on Bayesian probability goes through that exact problem, and the best part is, you build up your intuition while you figure out the answer. By training your problem solving muscles, you can take what you learn on Brilliant and reason through lots of others situations. So, to learn on Brilliant for free for a full 30 days, scan this QR code or go to brilliant.org/veritasium. I will put that link down in the description. For viewers of this channel, Brilliant offers 20% off an annual Premium subscription. That gives you unlimited daily access to everything on the site. So, I wanna thank Brilliant for sponsoring this video and I wanna thank you for watching. </content> </invoke>