If Prime Numbers Become Increasingly Rare, Then Why Do They Keep Showing Up In Pairs?
Veritasium tells the full story of the twin prime conjecture, the 2000 year old claim that prime pairs two apart like 11 and 13 never run out even as primes thin away. It walks both lines of attack: Viggo Brun adapting the sieve of Eratosthenes into a war of main term against error term, and the bounded gap program that hit a 'one half barrier' everyone believed was a wall. The hero is Yitang Zhang, a former Subway bookkeeper who in isolation pushed past that barrier by one part in 584 and proved infinitely many prime pairs sit within 70 million of each other. James Maynard and Terence Tao's Polymath collaboration then showed the half barrier was a mirage and drove the gap to 246, where it stands today.
Published Jun 14, 202641:30 video27 min readAdded Jun 16, 2026Open on YouTube →
At a glance
On the 17th of April, 2013, the Annals of Mathematics received a 50 page manuscript from an unknown lecturer who had once kept the books at a Subway sandwich shop, and it cracked a problem Edmund Landau had called unattackable. The video tells the full story of the twin prime conjecture, the 2000 year old claim that primes two apart, like 11 and 13, never run out even as primes themselves thin away to nothing. It walks through both great lines of attack: Viggo Brun adapting the sieve of Eratosthenes to fight a war of main term against error term, and the bounded gap program that ran into a "one half barrier" everyone thought was a wall. The hero is Yitang Zhang, who in isolation pushed past that barrier by one part in 584 and proved that infinitely many prime pairs sit within 70 million of each other. Then James Maynard and Terence Tao's Polymath collaboration showed the half barrier was a mirage and drove the gap to 246, where it stands today.
The twin primes, and why they should not exist
The twin primes are prime numbers separated by just one number in between, so they differ by two: 11 and 13, or 17 and 19. The twin prime conjecture makes a bold claim about them, that there are infinitely many. You never run out, no matter how far up the number line you walk.
The reason this is hard to believe is that primes get rarer as you climb, and twin primes get rarer still. The video makes this precise. If you look at the gaps between consecutive primes they seem chaotic at first, but averaged out a clean trend appears: the average gap between two primes near a number N grows roughly as the natural logarithm of N. Around 100 the average gap is about 4.6. Around 1000 it is about 6.9. The logarithm grows painfully slowly, but it never stops growing, so as N runs to infinity the average gap also runs to infinity. That is exactly the wrong news for anyone hoping to keep finding primes a mere two apart.
Figure 1. The average gap between consecutive primes grows like ln(N). It rises forever, which is why twin primes "should" eventually vanish, yet it rises so slowly that there is always room for two primes to sit right next to each other. The whole conjecture lives in that tension.
And yet they keep turning up. After a million you quickly hit the twins 1,000,037 and 1,000,039. Past a billion there is 1,000,000,007 and 1,000,000,009. The largest known twin prime in the video is 2,996,863,034,895 times two to the power 1,290,000, plus or minus one, a pair of numbers each 388,342 digits long. Printed in a book, each number would run about 260 pages. As far as anyone has ever looked, the twins keep coming. But checking is not proving. You cannot physically walk the number line to infinity, so brute force is out. You need an argument.
Hardy and Littlewood: a guess so good it is almost a proof
About 100 years ago it felt like the finish line was close. In 1923 the English mathematicians G. H. Hardy and J. E. Littlewood worked out how many twin primes there ought to be. They started from one of the crown jewels of the subject, the prime number theorem, which says the odds of a large number near N being prime are roughly one over ln(N).
The video runs the example cleanly. The odds that 137,037 is prime come out to about 8.5 percent. The odds that 137,039 is prime are also about 8.5 percent. Treat the two as independent and multiply, and the odds of both being prime are about 0.7 percent. That independence assumption is a fib (primes are not independent), but set it aside and the odds of a number N and its partner N plus two both being prime work out to one over ln(N) times one over ln(N plus two). For large N the plus two is noise, so the chance that any pair of size N is a twin prime is roughly one over ln(N) squared.
To count all twin primes up to some number you add up those odds at every position, which is to say you integrate one over ln(N) squared from the first prime, two, on up. Hardy and Littlewood then bolted on a correction factor out front to repair the independence fib, and the finished formula is startlingly accurate. Plotted out to one trillion it tracks the true count so tightly that by a trillion the estimate is off by only 0.001 percent.
The trouble is the word "estimate." This is a heuristic, not a proof. It tells you how many twins to expect but cannot promise they never stop. Terence Tao's framing in the video is exactly right: for all we know there is a "fast conspiracy" in which every time a number N decides to be prime it strikes a secret deal with its neighbor N plus two telling it not to be prime anymore. A heuristic cannot rule that out. Only a rigorous, airtight proof can, and that is brutally hard.
Viggo Brun and the sieve: counting what is left
The first serious assault came from a 29 year old Norwegian, Viggo Brun, working in quiet isolation in Norway during the early years of the First World War with Europe's math centers out of reach. His goal was modest in statement, savage in difficulty: prove that the count of twin primes always keeps increasing. His tool was 2000 years old.
That tool is the sieve of Eratosthenes. To find the primes up to 100, lay them in a 10 by 10 grid. Cross out 1, since it is not prime. Circle 2 and cross out every multiple of 2. Circle 3, cross out its multiples. Do the same for 5 and 7. After those four quick passes, every number still standing is prime. You can stop at 7 because of a neat fact pointed out in the video by Eratosthenes himself: you only need to sieve by primes up to the square root of your range. The next prime is 11, but every multiple of 11 below 100 (22, 33, 55, 77) was already struck by a smaller factor, and 11 times 11 is 121, already off the board. So sieving by 11 removes nothing new.
The video offers a second, prettier way to see the sieve: let each prime emit a wave whose wavelength equals its value. The 2 wave lands on every second number, the 3 wave on every third, and so on. Wherever any wave lands, the number is composite. The numbers that dodge every wave are exactly the primes.
Figure 2. The wave picture of the sieve. The 2 wave strikes every even number, the 3 wave every third, the 5 wave every fifth. Composites get hit; the survivors (7, 11, 13) are primes. Brun did not care which numbers survived, only how many, and that subtle shift is what turned a 2000 year old trick into a weapon against the twin primes.
Brun did not care which numbers survived. He only needed the count. So he counted the crossings out and subtracted from the total. The bookkeeping is the famous principle of inclusion exclusion. Start with 100 numbers. Remove multiples of 2, leaving 50. Remove multiples of 3 (about 33, ignoring the leftover one third of a number, since you do not cross out fractions). Do the same for 5 and 7, and the count overshoots into the negatives because some numbers, like 6, got struck twice (once as a multiple of 2, once as a multiple of 3). So add the double counted overlaps back: 6, 10, 14, 15, and the rest. The count climbs to 28. But now numbers like 30, equal to 2 times 3 times 5, were added back too many times, so subtract the triples (42 is 2 times 3 times 7, 70 is 2 times 5 times 7). Then add back the quadruple, 2 times 3 times 5 times 7 equals 210, which is off the board and contributes nothing. Finally add the four sieving primes themselves and subtract one for the number 1. The count lands on exactly 25 primes below 100.
That messy alternating sum rearranges into a clean product: roughly N times one minus one half, times one minus one third, times one minus one fifth, times one minus one seventh, and so on for every sieving prime P up to the square root of N. Each new prime factor just tacks on one more "one minus one over P" term.
Brun adapts the sieve for twins, and meets the error term
Now the twist that makes it a twin prime tool. To find pairs where both N and N plus two are prime, Brun sieved twice as hard. When sieving by 5, you remove the N divisible by 5 as usual, but you also remove the N where N plus two is divisible by 5, which strikes 23, 28, 33, and so on. So where the ordinary sieve removes one in every P numbers, the twin sieve removes two. Every term after the first picks up a 2 in the numerator, and the predicted count of twin primes ends up growing like N over ln(N) squared, in perfect agreement with the Hardy-Littlewood heuristic. It looks like Brun won. He did not, and the reason is the deepest idea in the video.
Remember the fractions you threw away. One hundred divided by three is not 33, it is 33 and a third, and you discarded that third. Each discarded fraction is a rounding error of at most one. The killer is how many of them there are. With one sieving prime there is one error term. With two primes there are three. With three primes there are seven. The pattern is two to the power K minus one error terms when you sieve by K primes, so the total error for ordinary primes grows like two to the power K. For the twin sieve, where every prime contributes a factor of two, the error grows like four to the power K. As Andrew Granville puts it in the video, all of analytic number theory is the fight between the main term you believe is the truth and forcing the error term to be provably smaller.
Figure 3. Why the naive twin sieve fails. The main term (top) crawls up like N over ln(N) squared. The error term (rising steeply) grows like four to the K. Subtracting the error from the main term gives the only thing you can actually prove, the lower bound, and it plunges below zero. A negative lower bound proves nothing, so the conjecture stays out of reach.
Brun's escape was to weaken his own sieve. Instead of sieving all the way up to the square root of X, he sieved only up to X to the power one tenth, far fewer primes, which kept the error count small enough to control. There was a price. Sieving by fewer primes means some survivors are not actually prime, just numbers with few prime factors (up to nine in Brun's case). So what Brun actually proved is striking but not the real thing: there are infinitely many pairs of numbers two apart where each number has at most nine prime factors. The trade off illustrated in the video is concrete: to find all primes up to 10 billion the square root method sieves up to 100,000 with a flood of error terms, while Brun's method sieves only up to 10 (that is 10 billion to the one tenth) at the cost of letting near primes through.
Brun's nine was hammered down over the decades, to seven, then to three. In 1973 the Chinese mathematician Chen Jingrun reached the wall: infinitely many primes P such that P plus two has at most two prime factors. As Granville says, this is as close as you can get to the twin prime conjecture without actually getting there. Stuck, as close as possible and no further.
The other road: small gaps between true primes
There is a second way to sneak up on the twins, and it flips the trade. Instead of insisting the gap is exactly two and relaxing the primality (Brun and Chen's road), you insist both numbers are genuinely prime and relax the gap. Just bring two real primes close together.
Since consecutive primes sit about ln(N) apart on average, the natural question is: can you prove primes sometimes come closer than that average gap? For decades mathematicians chipped away. By 1988 the gap had been pushed down to about a quarter of the average, meaning at huge scales where the average gap is 100, primes must sometimes fall within about 25 of each other. Then in 2005 Daniel Goldston, János Pintz and Cem Yildirim, known as GPY, shocked everyone. They proved bounded gaps of zero percent of the average. You can make the fraction as small as you want, one tenth, one hundredth, one millionth of the average gap, and infinitely often primes come that close. Their method also flirted with something far bigger, an absolute, fixed bounded gap, not just a shrinking fraction. The prize there is obvious: we believe the true infinitely repeated gap is two, and a fixed bound would be the first real foothold toward it.
But the GPY tool hit a wall, and in 2005 the American Institute of Mathematics gathered the field's leading experts (GPY themselves, Granville, Kannan Soundararajan) for a week in California with the single goal of proving a bounded gap. The verdict, which the young graduate student in the video witnessed firsthand, was that it was impossible. Soundararajan demonstrated why the method could not do it. People packed up and moved on to other problems. But one person was not in that room: Yitang Zhang.
The stencil, the averaging machine, and the one half wall
Zhang grew up in China, moved to the U.S. around age 30 for a math PhD, but graduated without recommendation letters and could not land an academic job. He lived in his car for a time and worked odd jobs for seven years, including at a Subway where he kept the books and sorted receipts, driving to the local library in his spare time to read number theory journals. In 1999 a friend helped him land a lecturer post at the University of New Hampshire, and he could finally do math full time. He had told himself as a child that one day he would solve a major problem, and by 2010 he had picked his: bounded gaps between primes.
To see what he was up against, the video rebuilds GPY's machine. Picture a stencil, a strip with holes punched in it, say diameter six with holes at positions 0, 2 and 6. Lay it on the number line starting at 12 and read off which holes land on primes. At 12 you see 12, 14, 18, none prime, so you record zero. Slide it one step to 13, 15, 19, two primes, record two. Slide again to land on 23 and 29, two primes again. If you could prove the stencil keeps catching at least two primes forever, you would have proven infinitely many prime pairs within a bounded gap of six. Two problems block this: you do not know exactly where the primes are, and you cannot slide a stencil along an infinite line by hand.
The fix for the first problem is to stop tracking exact primes and track averages. Over a stretch from a large X to 2X, the prime number theorem tells you how many primes to expect on average. GPY built the stencil version of this, an averaging machine: feed it a stencil and a range, and it returns the average number of primes the stencil catches per position. The video works a toy example with the stencil above over the range 20 to 40: total 14 primes across 21 starting positions, an average of about 0.67 primes per spot.
Here is the crucial logic. An average alone does not guarantee any single position caught two primes at once. An average of 0.67, or even exactly 1.0, could come from every position holding exactly one prime, spread out evenly. But push the average above one and the pigeonhole kicks in: if you have more primes than positions, then even after giving every position one prime, at least one prime is left over and must double up somewhere. So if you can force the weighted average above one, you have guaranteed that at least one position holds two primes. That is the whole game.
And the infinite line problem dissolves for free. The proof holds for any large X, so swap X for 2X, then 2X for 4X, then 4X for 8X, on forever. Prove the average exceeds one for arbitrary X and the bounded gap follows all the way to infinity.
The catch is that the raw machine never breaks one, because the stencil lands on too many barren positions (all even numbers, say, which carry no primes except 2). So GPY weighted the positions, downranking spots whose numbers are divisible by 2, 3, 5, 7 or other small primes and therefore unlikely to be prime. The weighted average shot up, but stopped just short. To assign those weights they needed to know how primes distribute across arithmetic progressions, sequences like 3, 7, 11, 15 with a fixed step size. A theorem from the 1960s (the Bombieri-Vinogradov theorem, which Granville calls a substitute for the Riemann hypothesis on average) gives exactly that, but only for step sizes up to the square root of X, that is X to the one half. That exponent one half is the level of distribution, written theta. With theta capped at one half, GPY's best possible weighted average came out to two times theta, which is exactly one. They could approach one, never cross it. If anyone could push theta to even 0.5000001, GPY's method would snap from zero percent gaps to true bounded gaps. The 2005 meeting tried for a week to break past one half and concluded it could not be done.
Figure 4. The GPY stencil. Holes punched at 0, 2 and 6 slide along the number line; at this position two of them land on the twin primes 23 and 29. You cannot track every prime, so you track the average count per position instead. The instant that weighted average exceeds one, the pigeonhole guarantees some position always catches two primes, which proves a bounded gap out to infinity.
The backyard, the deer that never came, and one over 584
From 2010 Zhang spent two years internalizing the GPY argument, working day and night, and by the summer of 2012 he was exhausted with nothing to show. He went to clear his head at a friend's place in Colorado. One evening, waiting to leave for a concert, he stepped into the backyard alone to watch for the deer that usually wandered through. No deer came. He paced and thought, and the answer arrived. He had no paper, but he knew it would work.
GPY needed prime counts across arithmetic progressions of every step size. Zhang narrowed his attention to a special class of step sizes, those built only from small prime factors (the smooth numbers), and found he could reorganize the error terms so that most of them canceled. That cancellation let him nudge theta past one half by the smallest of margins, one part in 584. It was enough. He spent the next year writing it up and on 17 April 2013 mailed the 50 page proof to the Annals of Mathematics.
The journal's reaction, told vividly in the video, is the heart of the story. The Annals receives a claimed proof of the Riemann hypothesis every other day, so the editors planned to spend an afternoon finding the mistake. Instead, flipping through as experts do, they kept hitting the spots where such a proof always dies, and Zhang had handled every one. Granville's image is of laying carpet in a room: you know that one corner is going to defeat you, but Zhang had cut it to fit. By the end of the week they had reconstructed the proof and it was right.
Zhang's stencil had 3.5 million slots spread across a span of 70 million, and by proving two of them always catch primes, he proved a bounded gap of 70 million. Infinitely many pairs of primes lie within 70 million of each other. The number is huge, but it is finite, and that was the impossible part. Mathematicians did not believe it. Soundararajan, reading the paper and realizing it was probably correct, first guessed it was one of his colleagues writing under a pseudonym to avoid embarrassment if it failed. It was not. Zhang was real, an unknown lecturer, and a year later he received a MacArthur Fellowship, the "genius grant." Granville's reflection is that mathematics culture works: an outsider sent in an argument, the field took it seriously, gave it the time it was due, and made him a hero. And maybe, Granville adds, it was precisely Zhang's isolation that saved him, because he never absorbed the group think that the problem was impossible.
Polymath, Maynard, and the mirage of one half
Once everyone knew a bounded gap was possible, they swarmed. Terence Tao launched an open online collaboration, Polymath, to optimize Zhang's method, and the record fell almost daily. One participant remembers a conference where everyone kept refreshing their screens to see who held the world record now. Polymath ground the gap from 70 million down to 4,680.
Meanwhile a young postdoc, James Maynard, fresh from a PhD at Oxford under Roger Heath-Brown and now working with Granville in Montreal, had a completely orthogonal approach. His advisor had warned him not to work on the problem full time because he was "pretty confident you're going to fail." Maynard ignored it and within months made it work, dropping the gap to 600. His method did more: it could trap three primes in a bounded window, not just two, with the bound widening as you demand more primes. And critically, his approach had nothing to do with the exponent one half. As the video puts it, one half was a pure mirage, a red herring, never a fundamental limit at all.
The mechanism is clean. Where GPY's average was chained to two times theta, Maynard's average grows roughly like theta over two, times the natural logarithm of K, where K is the number of slots in the stencil. So Maynard needed only some theta greater than zero, any tiny positive level of distribution, and then enough slots. Going up to one half just gives you better numbers; you could prove bounded gaps with theta as small as 0.01. The wall everyone broke their heads on for a decade was not load bearing.
In a twist the video lingers on, Tao had independently found the same idea. He mentioned it to Ben Green, who was meeting with Granville, who realized his postdoc Maynard was doing exactly the same thing. They put the two in touch. Tao, a Fields medalist and heavyweight, told the fresh PhD: you take it, it is your idea, you go with it. By early 2014 Maynard had joined the Polymath effort, and the combined ideas pushed the record to its current value: infinitely many pairs of primes differ by no more than 246. In 2022 Maynard received the Fields Medal, mathematics' highest honor, for his work on prime gaps.
The video closes the loop with the four minute mile. Everyone believed it was impossible until Roger Bannister ran it in 1954, and then John Landy broke it just 46 days later, and 10 people had broken it by the end of 1956, all unlocked by knowing it was possible. Zhang was the Bannister of bounded gaps, and the only reason he ran the impossible race was that he had not been told he could not win.
How low can it go, and will the twins ever be caught
Sharper results exist, but they are conditional. The Elliott-Halberstam conjecture assumes primes spread evenly across arithmetic progressions, equivalently that the level of distribution theta can be pushed as high as one. In 2013 Maynard showed that if Elliott-Halberstam holds, the gap drops to 12. A year later Polymath showed that under an even stronger version it drops to 6. But six is not two, and twelve is not two, and none of it is unconditional. Without assuming anything, the record stands at 246.
Will the twin prime conjecture itself fall? Tao gives the honest non answer in the video. One trick for guessing how long an open problem will stay open is to assume you are randomly placed in time and that the problem is roughly halfway through its lifespan, so it will stay open about as long as it already has. But that fails for the twins because nobody agrees whether the problem is 125 years old or 2000 years old. Tao calls guessing a fool's game. It will take a really big idea, he says, but maybe only one big idea. The video's final note is almost a moral: if we had known for certain the bounded gap was impossible, we would have skipped a century of invention. Sometimes it pays not to know everything.
Key takeaways
The twin prime conjecture says prime pairs two apart never run out, even though the average gap between primes grows without bound like ln(N).
Hardy and Littlewood's 1923 heuristic is breathtakingly accurate (off by 0.001 percent at a trillion) but it is only a heuristic, not a proof, and cannot rule out a "conspiracy" that ends the twins.
The sieve approach is a war between a slow main term and a fast error term. The naive twin sieve's error grows like four to the K and swamps the count, driving the provable lower bound negative.
Brun weakened the sieve to control the error, proving infinitely many pairs two apart where each number has at most nine prime factors. Chen Jingrun reached "P and P plus two with at most two factors" in 1973, the closest approach that is not the real thing.
The bounded gap road insists on real primes but relaxes the distance. GPY's stencil and averaging machine reduce everything to forcing a weighted average above one.
The "one half barrier" (the level of distribution theta) stopped GPY cold, capping their average at two times theta, which is exactly one.
Yitang Zhang pushed theta past one half by one part in 584 using smooth step sizes, proving a bounded gap of 70 million from near total obscurity.
Maynard and Tao showed one half was a mirage. Maynard's method needs only theta greater than zero plus enough stencil slots, and the Polymath collaboration drove the unconditional gap to 246.
Under the Elliott-Halberstam conjecture the gap falls to 12, or 6 under a stronger form, but unconditionally 246 still stands, and whether the twins are infinite remains open.
Chapters
Timestamps are clickable. Click one and the player jumps there and keeps playing while you read.
00:00 What are twin primes?
03:08 How To Find Prime Numbers
06:11 The Sieve of Eratosthenes
11:26 The Closest We've Come To Proving Twin Primes
15:54 Searching For A Bounded Gap
18:47 How A Subway Worker Changed The Game
29:20 Finding An Upper Bound
32:04 Maynard and Polymath
36:42 Will we solve the twin prime conjecture?
Notable quotes
Oh, damn, he did it.
on the Annals editors reading Zhang's proof, 00:46
For all we know, there could be this fast conspiracy that every time a number N decides to be prime, it has some secret agreement with its neighbor N plus two saying you're not allowed to be prime anymore.
Terence Tao, 03:12
That's all of analysis, in general analytic number theory in particular is the fight between the main term that you think is the truth, if it actually is the truth, and getting the error term to be provably lower order.
Andrew Granville, 07:58
This is as close as you could get to proving the twin prime conjecture without actually getting there. We're stuck, like we're as close as we can get and no further.
on Chen Jingrun's 1973 result, 09:21
And the upshot of this week was basically that it's impossible.
on the 2005 AIM meeting, 16:43
I imagined there would be a day that I would solve a major math problem.
Yitang Zhang, 11:39
I thought it was probably one of the people I knew under a pseudonym, trying to avoid being embarrassed by being wrong.
Kannan Soundararajan on first reading Zhang's paper, 17:54
One half was a pure mirage. It was a red herring.
on Maynard's method, 35:33
It clearly needs a really big idea, but maybe it only needs one big idea.
Terence Tao, 41:27
Resources mentioned
Veritasium, Derek Muller's science channel, which produced and presented the video.
Brilliant and its Koji AI tutor, the video's sponsor.
Where it stands
Everything in the video that is presented as proved, is proved: Brun's sieve bound, Chen's theorem, GPY's zero percent gaps, Zhang's 70 million, Maynard's 600, and the Polymath record of 246 are all peer reviewed mathematics. The honest gap is between those results and the conjecture itself. A gap of 246 is not a gap of 2, and Maynard's and Polymath's tighter numbers of 12 and 6 are conditional on the Elliott-Halberstam conjecture, which is itself unproven. So the twin prime conjecture remains open, and as Terence Tao says, it will likely take one genuinely new idea to close the distance from a finite bound to a gap of exactly two. The video's framing is accurate: this is a story of how far ingenious approximation can carry you, and of an outsider who got further than the experts because he never learned where the wall was supposed to be.
Full transcript
On the morning of the 17th of April, 2013, the journal Annals of Mathematics received a curious email. It claimed to contain a 50 page proof relating to one of the oldest unsolved problems in mathematics, a problem that great mathematician Edmund Landau called unattackable. But this proof didn't come from a famous professor, it came from an unknown. Someone who had once spent years working at a Subway restaurant.
"So they're like, okay, surely this isn't gonna work out, but whatever. You know, we'll send it to a referee."
They expected to find a mistake in an afternoon, but they didn't. So they went through it again, closely studying the fragile parts where a proof like this normally falls apart, but still nothing. Soon they realized they were witnessing a breakthrough.
"Oh, damn, he did it."
So how did he do it? What did the experts miss and what was the problem he was working on? He was working on a new way to attack one of the hardest unsolved problems in number theory, the twin prime conjecture.
The twin primes are prime numbers separated by just one number, like 11 and 13 or 17 and 19. As you go up the number line, primes become rarer and twin primes become rarer still. But the twin prime conjecture claims that there are infinitely many of them, you never run out. But is it true?
Well, one way to approach this problem is to look at the gaps between consecutive primes as you go up the number line. Now at first they seem chaotic, but if you average them out, a clear trend emerges. The average gap between two primes grows roughly as the natural logarithm of the number N. So for example, the average gap between primes around 100 is approximately 4.6. Primes around 1000 is 6.9. Logarithms grow very slowly, but they do keep growing forever. So as N approaches infinity, the average gap between primes also goes to infinity, which is not particularly encouraging if you expect to always be able to find primes that are just two apart.
But if you start checking large numbers, then after a million you quickly find the twin primes, 1,000,037 and 1,000,039. Past a billion, there's 1,000,000,007 and 1,000,000,009. In fact, we have found a twin prime that's as large as 2,996,863,034,895 times two to the power of 1,290,000 plus or minus one. That is a pair of numbers each 388,342 digits long. If you wanted to print those in a book for some reason, you would need around 260 pages per number. All of this is to say that as far as we've looked, we've kept finding twin primes. But of course, that is not how you solve the twin prime conjecture, because you cannot physically check all the numbers out to infinity. So we need another way.
And around 100 years ago, it felt like mathematicians were getting close with a more sophisticated method. In 1923, English mathematicians Hardy and Littlewood figured out how you can estimate how many twin primes there should be. To do it, they started with one of the crowning achievements of number theory, the prime number theorem, which tells you that the odds of a large number near N being prime are roughly one over the natural logarithm of N.
So let's do a quick example to see how they use this. Say you wanna find the odds that 137,037 is prime. Well, you just plug that in and find that it has about an eight and a half percent chance of being prime. Of course, you could use the same trick to find the odds that 137,039 is prime, which is also around 8.5%. So what are the odds that both of them are prime? Well, you just multiply those odds together, that gives you around a 0.7% chance. Now we are simplifying a little here because we're assuming that primes are independent, which they're not. But putting that to one side, the odds of both a large number N and N plus two being prime is one over ln(N) times one over ln(N) plus two. For large N the plus two is insignificant. So we get the odds of any pair of numbers of size N being a twin prime are roughly one over ln(N) squared.
But of course, the problem with this approach is that the odds of a pair being a twin prime decrease as you go to larger numbers. So to count all twin primes up to N, we add up the odds at every single position. In other words, we integrate this expression from the first prime two up to that number. Now Hardy and Littlewood refine this argument further to account for the fact that primes aren't really independent. But all that added is just this correction factor upfront. So the final expression looks like this. Now let's plot how many twin primes this predicts up to 1 trillion. We see that it keeps growing and if we fill in the actual count, you'll notice that it is extremely close, to the point where by 1 trillion our estimate is only off by 0.001%.
"The only issue though, is that this is just an estimate or what mathematicians call a heuristic. It can tell you roughly how many twin primes to expect, but it can't guarantee that they never stop."
As Terry Tao puts it, for all we know, there could be this fast conspiracy that every time a number N decides to be prime, it has some secret agreement with its neighbor N plus two saying you're not allowed to be prime anymore. So what we need is a rigorous airtight mathematical proof, but finding such a proof it turns out is extremely difficult.
One of the first mathematicians to really try was a 29 year old Norwegian, Viggo Brun. Brun was working during the early years of the first World War, and with travel disrupted and Europe's mathematical centers largely out of reach, much of his work happened in quiet isolation back in Norway. His goal was simple: to try and prove that the number or count of twin primes always increases. And to do this, he took a 2000 year old prime counting tool and tried to adapt it for twin primes.
Now the normal version of this tool is called the sieve of Eratosthenes, and it works something like this. Say we want to find the primes up to 100. First we remove one because that's not prime. Then we circle the first prime, that is two, and we remove all of its multiples. We do the same for three. Circle it and remove its multiples, and then repeat that for five and seven. And with these four very quick operations, every single number that's left on your 10 by 10 grid is a prime number. Now we stop the sieve at seven, and we could keep going, sieve by 11 or 30, but it turns out that we don't have to. So why is that? Well, the next prime is 11. So let's check the numbers below 100 that are divisible by 11. 22 is two times 11, but since it's divisible by two, we already caught it. We also have 33, but that is divisible by three. And a similar logic holds for 55 and 77. But what about 11 times 11? Well notice that is 121, which is not in our range. So every multiple of 11 below 100 was already caught by a smaller prime. So sieving by 11 would cross out nothing new.
"So Eratosthenes has this idea that you only need to sieve out the primes that are at most square root the size of the number."
But there's also a second way to visualize this sieve. Take the number line and let each prime send out a wave with the wavelength equal to the size of that number. So the wave from two lands on every second number and the wave from three on every third and so on. Now notice that wherever a wave lands, that number has to be composite. But the numbers that escape every preceding wave, well those have to be prime. And so you end up with this beautiful visualization of the sieve. And both of these methods make it incredibly easy to identify the primes.
But Brun didn't care about which primes were left, he just needed to know how many were left. And to do this, he counted all the numbers that were crossed out and then subtracted this from the total, since whatever is not crossed out must be prime. So let's use this to find the number of primes below 100. We can keep a running count. So we'll start with 100. We're gonna throw out all the multiples of two. Well, that's 50 numbers left, okay? Then we're gonna throw away multiples of three. Well, how many multiples of three are there less than 100? 100 divided by three is not a whole number. It's like 33 and a third, but we don't throw out fractions. So okay, forget that stupid little fraction. We do the same for the multiples of five and seven, but now the count has gone negative. Well, it turns out that some numbers got crossed out twice, like six for example, it's a multiple of two and three. So we crossed it out once when we removed multiples of two and again when we removed multiples of three. So we have to add it back once. The same thing happens for multiples of 10, which is two times five, 14 which is two times seven, 15 which is three times five and so on. So we add these overlaps back and now the count rises to 28, but it's still not quite right.
"Now we've triple added back in the multiples of two times three times five, so that we need to subtract one more time."
The same thing happens for 42, which is two times three times seven, and 70, which is two times five times seven. There's a fourth triple, two times three times five times seven, which is 105, but that's already bigger than 100. So it contributes nothing here. But we'll write it in to keep the pattern complete. After the triples, we add back the quadruple, two times three times five times seven, which is 210. That's also bigger than 100, so it contributes zero as well. And finally, we add back the four sieving primes themselves and subtract one because it was included in the count. So the count lands at 25 primes. This process of alternating between subtracting and adding numbers back in is called inclusion exclusion. And it makes it super easy to find out how many primes there are up to some given number.
Now this equation still looks a little bit like a mess, but actually we can rearrange it into this form. Now notice that every prime factor we added just adds an additional term in this series. We've got the two that goes over here, the three goes over here, five goes over here, and seven goes over here. And if we were trying to find primes for some higher number and we wanted to add 11 as a prime factor, it would just add one additional one minus one over 11 term. Now in general, if we keep the sieve going up to some prime P, which is less than the square root of N, then the count becomes approximately this.
But this is still just the count of primes. What Brun really wanted was the count of twin primes. And so this is the part where he adapted his sieve, to find a case where both N and N plus two are prime. For example, when you're sieving by five, you remove the numbers where N is divisible by five, just as before. But in addition, you need to remove the numbers where N plus two is divisible by five. So you also remove 23, 28, 33 and so on, because in those cases, N plus two is 25, 30, 35 and so on. So this means that while in the ordinary sieve a prime P removes about one out of every P numbers, the twin prime sieve removes two. And this changes the count to this, where the numerator for all the terms after two ends up getting a two. Now, if we plot the count of twin primes that this predicts, we see that it grows roughly like N over the natural logarithm of N squared, just as for our heuristic. And so it seems like Brun did it, but unfortunately that is not the case.
"We said that 100 divided by three is 33 and a third, but forget about that little error, those little errors, it's very difficult to control those errors because there are so many of those little errors. Each one of them is at most one, it's a rounding error."
Take the traditional sieve of Eratosthenes again. When we're sieving with just one prime, we just have an over two. So one rounding error. With two primes, two and three, there are three separate rounding errors. But with three primes, two, three, and five, there are already seven separate rounding errors, each for one term in the inclusion exclusion. And so you might start to see the general pattern here. One prime gives two to the power one minus one terms, two gives two to the power two minus one terms, and three primes gives two to the power three minus one terms. In general, if you sieve by K primes, you get roughly two to the power K minus one error terms, but we can drop the one because it doesn't really make much of a difference. So that means that if you're sieving for normal primes, the error grows roughly like two to the power K. But if you're sieving for twin primes, it gets even worse. And now the error grows roughly like four to the power K. And so this causes a lot of trouble very, very quick.
"And then here for the twin primes, it's even worse. The main term grows a bit more slowly, but the error term grows much faster. So it still starts off slow. And so you can quickly see that once these error terms start dominating, you get into trouble. So the issue he was facing is he had this main term and the main term, you know, it grew in the way he wanted to, but then it just got overtaken and dominated by those error terms."
"Exactly. And that's all of analysis, in general analytic number theory in particular is the fight between the main term that you think is the truth, if it actually is the truth, and getting the error term to be provably lower order."
So for twin primes, the main term grows roughly like this, something like N over the natural logarithm of N squared. But that's not the true count, because to get the true count, we must add and subtract the error terms, which gives us this upper and lower bound. And now you can see the problem. Because the true count can be anywhere in this range, and as you can see, a big swath of it is negative. So to prove the conjecture, what we must do is we must show that this lower bound is always positive and growing. But that is where we run into a problem with the sieve we've been using. Because the more primes we sieve by, the more those error terms start to accumulate and we just can't show this.
"And so what Brun eventually realized is that if you weaken the sieve, if you don't sieve all the way up to square root X, but to up to something less than that, and you are very careful about your inclusion exclusion, you can actually take care of those error terms."
So run a sieve by fewer factors, only up to N to the power one over 10. And as a result, he gained enough control over the error term. But this chooses a trade off. Say you wanted to find all the prime numbers up to 10 billion, then using the square root method, you would need to sieve up to the square root of 10 billion, which is 100,000. But that gives you a lot of error terms. But with Brun's method, you would only need to sieve up to 10 billion to the power one over 10, which is just 10. But there's a catch. Because Brun didn't sieve by all the primes, there were some survivors that weren't prime at all, but numbers with many prime factors, in Brun's case up to nine. So what he actually ended up proving is that there are infinitely many pairs of numbers two apart, where each number has at most nine prime factors.
"Brun's techniques were improved and improved and improved, and we went from nine prime factors to seven prime factors to three prime factors. Until in 1973, a Chinese mathematician Chen Jingrun comes along and proved that there are infinitely many prime P, where P plus two has at most two prime factors."
"This is as close as you could get to proving the twin prime conjecture without actually getting there. We're stuck, like we're like as close as we can get and no further."
"Right. So that's one approximation, one mechanism for approximating twin primes. There's another. So the other mechanism is what's the smallest gap between two consecutive primes? So instead of saying they differ by two and one of them's prime and the other one, well, we're gonna try to reduce the number of prime factors that it has. Instead we're gonna say both numbers must be prime, but let's reduce the distance between them."
Now remember, on average consecutive primes sit about the natural logarithm of N apart. And so the question became, can we at least prove that primes come closer than that average gap? For decades, mathematicians kept chipping away at this problem. By 1988, the gap had dropped all the way down to roughly a quarter of the average gap. This means that say the average gap is 100 at some enormous scale, then primes must sometimes come within about 25 of each other. But then in 2005, Goldston, Pintz and Yildirim proved a result that shocked the mathematical community.
"And they announced the proof of a spectacular result, 0% bounded gaps, of 0% of the average."
"Wait, what?"
"They proved that you can make the fraction as small as you want."
This means the gap could be one tenth or one hundredth, or even one millionth of the average gap. It could be as arbitrarily small as you like, and infinitely often primes get that close.
"When I was young, we didn't know there were gaps of size less than say one tenth log X for infinitely many prime pairs, and Goldston and Yildirim came up with a method to attack that."
But the method also came close to something even bigger, an absolute bounded gap between two primes.
"And so that was like a big thing."
"What was the big desire to go from 0% or arbitrarily small to a concrete bounded gap?"
"Well, because we think that the bound between two consecutive primes infinitely often is two, and right now we're showing that it's log X, and X grows."
If they could get to bounded gaps, then they had another method to attack the conjecture. The only problem was that it seemed like their tool ran into a wall. And so in 2005, the American Institute of Mathematics convened a meeting, gathering all the world's leading experts on this problem. GPY, Andrew Granville, Kannan Soundararajan, all gathered for a week in California with one explicit goal, to prove a bounded gap between primes.
"I was a young graduate student, I was very lucky to get to go to this meeting. You're surrounded by the world's top experts on this subject. We spent an entire week. And the upshot of this week was basically that it's impossible. And Soundararajan shows how it's not possible to do this thing. And so as far as I was concerned, that was that, and I, you know, this wasn't gonna be my thesis problem, I'd have to do something else and I went off and did other things."
But there was one person who was not at this meeting, Yitang Zhang.
Zhang grew up in China, and around the age of 30 moved to the U.S. to get his PhD in mathematics, but he never got any recommendation letters and so struggled to find a job. He ended up living in his car for a time and ultimately working odd jobs for seven years, including at Subway, where he kept the books and sorted receipts. Yet while doing all of that, in his spare time he would drive down to the local library to read books and journals on number theory. Then in 1999, he got a lucky break. One of his friends helped him get a job as a lecturer at the University of New Hampshire. So now he could spend all his time doing what he loved most, math. Zhang said that when he was a kid, he "imagined there would be a day that I would solve a major math problem." And by 2010, he had identified his major problem to focus on, bounded gaps between primes.
But to understand what he was working on, we need to understand what exactly it is that GPY had built. See, they wanted to find two primes within a bounded gap. And to do this, they imagined a stencil of sorts with some holes in it. So let's do the same, and for the sake of this example, let's say the stencil has a diameter of six and holes at spot zero, two and six. Next we place the stencil anywhere on the number line, say starting at 12. And we ask a simple question, how many primes do we see? In this case, we see 12, 14, and 18, none of which are prime. So we write down zero. Next we shift the stencil over one spot and repeat, 13, 15, and 19, two of which are prime. So we write down two. And then we keep doing this. If you slide it a bit further, we catch two primes again, 23 and 29. Now, if you keep moving the stencil across the number line and it keeps catching two primes, then you have proven that infinitely often pairs of primes exist within a bounded gap, six in this case, since that's the diameter of our stencil.
But there are two problems with this approach. The first is that we don't know exactly where all the primes are. And the second is that of course you can't keep sliding a stencil down the number line forever because well, it's infinitely long. Fortunately, there is an easy way to get around that first problem. See, while we don't know exactly where all the primes are, we do know how they behave on average. For example, say we take some stretch of the number line from some very large number X to two X, then we can't predict exactly which numbers in this stretch will be prime. But what we can do is put this into the prime number theorem and estimate how many primes to expect on average. And GPY realized they could do a similar thing for their stencil method, to build an averaging machine of sorts. All you do is you set up the stencil you want, input your range, and out comes the average number of primes it caught per position.
So let's do a quick example without the machine to see how this helps. Let's use the same stencil from before and use the range 20 to 40. Now, in reality, you would use a much larger range, but for simplicity, let's stick to this. To find the average, you just place the stencil at the start of your range, and you'll look at which numbers you see, 20, 22 and 26. None of which are prime and so the total prime count is zero. Then we shift the stencil over one spot and see 21, 23 and 27. One of which is prime, so we increase the total by one. Now let's keep shifting the stencil and keep adding all the primes we see. No primes, so add zero, two primes, add two, and so on. Now, if you keep shifting the stencil all the way until the end, you find a total of 14 primes across 21 starting positions. And so the average is 14 over 21, or about 0.67 primes per location.
Now in this case where we checked every spot by hand, we could see that the stencil caught several positions with two primes in them. But the averaging machine doesn't tell us this. All that spits out is an average, and that average alone can't guarantee that it caught two primes at once, because you could just as well imagine spreading all of these 14 primes over 14 individual positions. And the same would be true if the average was 0.8, 0.9, or even one. I know it's unlikely, but it could be that every position only caught one prime to bring the average to one. But what if the average was larger than one? What if we found 22 primes in total? Well, even if every position had caught one prime, you would still have one prime left over and that needs to go in one of those other positions. So in other words, if the average is above one, then that guarantees that at least one position caught two primes. And so that's the game. Get the average number of primes above one.
But that still leaves one question. Even if you could do that, then how does that help you prove that you keep getting bounded gaps all the way up to infinity? Well, remember you just proved that for any large number X, so there's nothing stopping you from replacing that X with two X, and that two X with four X, and the four X with eight X, and so on all the way up to infinity. So if you can prove that the average gap is above one for any large number X, then with this clever setup, it immediately extends all the way out to infinity, and you would've proven your bounded gap.
Unfortunately, running the machine with just the current inputs, the stencil visits many positions where it catches zero primes, this drags the average down and means it will never get above one. So GPY made one final tweak to their machine. They realized that some starting positions are more likely to catch primes than others. For example, if the stencil only lands on even numbers, it will always give zero primes unless it includes two. So those positions should not be weighted equally to the others. Similarly, if you can divide one or more of the numbers in the stencil by three, five, seven, or other small prime factors, it's also more unlikely you have primes in your stencil. So they should also be weighted less. Now, GPY did a few simple checks like this for each position to assign it a weight, an estimate of how likely it is that this position contains primes. This turned their averaging machine into a weighted averaging machine. And as a result, that weighted average shot up.
With their updated machine they proved that if you pick any large number X, then in the range X to two X, you could get the weighted average arbitrarily close to one, but not above one. They were close, but they ran into a wall. A wall that came from their weights, because to figure out how to weigh different positions, they needed to know how primes are distributed in something known as an arithmetic progression. These are basically just series of numbers where they're all the same step size apart. So 3, 7, 11, 15, that's an arithmetic progression with a step size of four. Now, GPY needed to know how primes were distributed over many such arithmetic progressions with all kinds of different step sizes. Fortunately, they knew of a theorem from the 1960s they could use to do exactly this.
"It's the substitute for the Riemann hypothesis on average, and it turns out that that's what's actually necessary in a lot of these arithmetic applications."
But that theorem doesn't work everywhere. It only works for step sizes up to the square root of X, or X to the one half. The technical term for this one half is the level of distribution, or theta. It tells you how large the step sizes can be while still getting a reliable count of primes in those progressions. And mathematicians believed that the theorem worked beyond X to the half, but this was never proven, and that is where the trouble lies. When GPY ran their calculation with this ceiling, they found the maximum weighted average they could get was two times theta. In other words, they got really close to one, but they could never actually cross it.
"They showed in their method that if you could get beyond a half, you could do 0.5000001, then instead of just getting 0% of the average gap, they can get bounded gaps. The primes would be some fixed distance apart."
That's what the 2005 meeting was all about. They kept trying to push past that one half barrier, but they ended up concluding that it was impossible.
From 2010 on, Zhang spent two years hacking away at it as well, internalizing the GPY argument, working day and night just to try and find a way through. But by the summer of 2012, Zhang was exhausted, and had nothing to show for it. Hoping to clear his head, he visited a friend in Colorado, where one evening while they were waiting to leave for a concert, Zhang stepped outside alone into the backyard, looking out for the deer that usually roamed the property, but no deer came. So he was just walking and thinking when all of a sudden the answer came to him. He hadn't brought any notes or paper, but Zhang believed his idea would work.
You see, GPY had to count primes in many different arithmetic progressions with all sorts of step sizes. But Zhang focused on a special class of step sizes, ones built only from small prime factors, and he realized he could reorganize the error terms into a form where most of them get canceled out. This allowed him to push past the half barrier by a tiny fraction, just one over 584. Zhang spent the next year finalizing his proof, and then on the 17th of April, 2013, he sent off that curious email to the Annals of Mathematics.
"Now, the Annals gets a proof of the Riemann Hypothesis every other day, so they're like, okay, surely this isn't gonna work out, but whatever. And they're like, well, let's spend a few hours this afternoon. We'll find a mistake. We'll tell the annals, you know, sorry, it didn't work out. Here's where the gap is. And they start reading it, and then they flip back, because they're experts. So they can just flip through like, yeah, this'll work, this will work, but wait a second, you're gonna get stuck here. And they flip five pages and they're like, oh, that's interesting. That's how you're gonna handle that. Okay, but then you're gonna have another, it's like trying to lay down a carpet in a room. You're like, okay, I know that corner, that corner's gonna screw you up even if you managed to make it work over there. And then he goes over there, no, he cut it just right. It fits in that corner. Wait a second. And then they flip, you know, five more pages and by the end of the week, they've reconstructed the proof and everything's right."
"Wow."
Zhang's stencil had 3.5 million slots, spread across a span of 70 million. And by proving two of those slots always catch primes, he proved a bounded gap of 70 million. When the news broke, mathematicians were in disbelief.
"It was basically an unknown in the field. I actually thought when I started reading Yitang Zhang's paper and started realizing it was probably correct, I thought it was probably one of the people I knew under a pseudonym, trying to avoid being embarrassed by being wrong, if they were wrong."
Of course it wasn't, Zhang was real. And just over a year later he was awarded the MacArthur Genius Grant.
"To me, this is a really interesting aspect of mathematics in particular. I think it's one of the very few fields where we have a truly honest approach to success and what counts as success. He sent in an argument, people took it seriously. They looked at it. They didn't think that he had a true proof. They sat there, they gave him the time that the argument was due, and he was immediately made a hero as he should have been. To me, it shows that mathematics culture works. We're doing the right thing. I think the cover of like Scientific American or whatever in 2013 is the story Yitang Zhang and how he got this bounded gap between primes toiling in complete isolation. And maybe it's because he was in isolation that he didn't have the group think that the rest of us did. To know that this was not, we were all told it's an impossible problem."
After realizing a bounded gap was not an impossible dream, people reworked Zhang's method to optimize it. Terence Tao spearheaded an online group called Polymath, and they sharpened the method. So every month, week or day, that upper bound kept coming down. One of the attendees described being at a conference at the time: "as I remember, sort of day by day, everyone was refreshing their screens to see who had the world record now." And ultimately they got the number all the way down to 4,680.
Meanwhile, there's a young postdoc named James Maynard, who just got his PhD at Oxford with Roger Heath-Brown, another one of these analytic number theory world experts. And he's working with Andrew Granville in Montreal. And he has a completely orthogonal approach to this problem that he'd been making some very incremental progress on independently.
When Maynard started, his advisor explicitly told him, "I hope you won't work on this problem full time, because I'm really pretty confident you're going to fail." But Maynard ignored the warning, and came up with a different way to attack the problem, and within just a few months, he got it to work. Further bringing down the gap to 600. But his method also proved something else.
"He can get three primes in a bounded window. The bound has to change depending on how many primes you wanna put in that window, and his number is better. And the method has nothing to do with the exponent one half."
"One half was a pure mirage. It was a red herring."
"That is crazy."
One half was not a fundamental limit at all. See where GPY's average was stuck at two times theta, Maynard's average grew roughly like theta over two times the natural logarithm of K, where K is the number of slots in the stencil. So all Maynard needed was enough slots and it worked.
"You need any number greater than zero. When you go up to one half because you have one half, you'll get better numerics. But the bounded gaps you could do, just going up to, you know, 0.01, not 0.50101."
Now, curiously, Terry Tao independently has the same approach. He tells Ben Green about it, Ben Green is meeting with Andrew Granville, and says, "Hey, Terry's got this new idea that he thinks is gonna get even farther." And Granville says, wait a second. My postdoc, Maynard, is doing exactly the same thing. We gotta get these two guys to talk to one another. Terry's a Fields medalist, he's like a super heavyweight by this point, whereas James is this, you know, fresh PhD, and Terry says, you take it, I don't need this. You know, this, it's your idea, you go with it.
By early 2014, Maynard joined forces with Tao's Polymath group.
"It was clear that somehow 600 was just a proof of concept, and the same methods would give something smaller, but there were extra ideas that you could maybe use to squeeze everything out."
And so the current world record is that there's infinitely many pairs of primes that differ by no more than 246. And that for now is where it stops. In 2022, James Maynard was awarded the Fields Medal, mathematics' highest honor, for his work on prime gaps.
"Yeah, I think that's the basic outline, is the kind of two directions of approximation to twin primes, Chen's direction, Brun, Chen, whatever, versus Zhang and Maynard, and this story that Zhang wasn't at this meeting, so he didn't know it was impossible."
It reminded me of the four minute mile where everyone thought it was impossible. No one did it. And then for the first time ever, Roger Bannister broke it in 1954, and after knowing it was possible, just 46 days later, another runner called John Landy broke it as well. And by the end of 1956, 10 people had broken it, all from knowing that it was possible.
So what about pushing that gap down even lower? Is that possible? Well, mathematicians have found ways to bring it down even more, but all of these results are conditional. For example, there is the Elliott-Halberstam conjecture, which assumes primes are spread evenly across arithmetic progressions, or that the level of distribution can be taken as large as one. And in 2013, Maynard showed that if you assume this conjecture is true, then the gap plummets to just 12. A year later, the Polymath Group proved that if you assume an even stronger version of this conjecture, the gap drops all the way down to six. But without assuming anything, 246 still stands.
"Let me ask you, do you think we're gonna solve the twin prime conjecture?"
"I am totally convinced that humanity will eventually solve it. Often when I'm asked about some of these big famous problems like Twin Primes or Riemann or something like that, one way of kind of avoiding the question is that if you imagine you are randomly distributed in time, you'd expect to be sort of typically roughly halfway through when the conjecture's been open for. So an actual guess, if you have no other information, is to guess that a problem will be open for as long as it has been open for already. But obviously this doesn't work for twin primes, because we don't know whether it's 125 years old or whether it's like 2000 years old. So I think it's a fool's game to guess, but it clearly needs a really big idea, but maybe it only needs one big idea."
Although maybe, just maybe, not knowing this with certainty is for the best. Because if we knew for sure that this was impossible, then we would've likely missed out on most of these inventions and new methods over the past century. So sometimes it pays not to know everything.