At a glance
Nate B Jones is obsessed with Codex the way a kid is obsessed with a new PlayStation, and this video is his attempt to explain why before people bounce off the tool for the wrong reason, namely that the name sounds like it is only for coders. His claim is bigger than code. Codex changed how his computer feels, because his files, browser, folders, drafts, and screenshots now belong to an agent he can hand work to in plain English, not just an app he operates by hand. He maps a year of his own token usage and shows that the biggest jump came in the last month, when computer use plus a Codex 5.5 model unlocked a pile of his workflows at once, pushing him to 300 to 500 million tokens on an ordinary day. The throughline is that we are living through the first change in the computing paradigm in roughly forty years, from an application-first world where the human is the router to an agent-first world where the human sits above the machine and delegates. The practical payload is a repeatable delegation loop you can run today: pick one annoying, valuable job and give Codex five things, a goal, sources, a standard, a permission boundary, and the proof that it is done.
Codex makes the computer feel different
Jones opens with the feeling, not the feature list. He keeps wanting to grab people and say "no, no, no, you have to see what this thing just did," because Codex is not just giving him better AI answers, it is making his computer feel different. His files, his browser, his folders, his drafts and screenshots all belong to Codex now. The weird little personal systems he used to wire together by hand, Codex moves across all of them. That is why his token dashboard has gotten ridiculous lately. Not because he is chatting more, but because he is handing Codex bigger jobs.
He draws the line precisely. Before Codex, most of his AI work still ended up looking like chat unless it was code: draft this, summarize this, clean this up, help me think through this. Useful, but it was still him asking for help. With Codex he started doing something else. He started giving his computer jobs. Find the transcript, read the folder, compare the versions, render the Word file, check that it opens, open the browser, use the site, keep going until there is something real for me to inspect. That is the shift that blows his mind. Not that Codex writes code, but that it makes the computer feel like something you can hand work to.
Then the disclaimer he wants out of the way early, stated as a refusal rather than a hedge: this is not a take-a-side video, he is not asking you to join team OpenAI, it is just a deep dive into why Codex works for him. And if you have avoided it because the word Codex sounds like code, that is exactly why you should stay. The name is bad, honestly. It sounds like a developer tool. Developers see it first only because coding has a clean working environment, clean tests, files, diffs, logs, that made it easy for an agent to engage. But the habit Codex teaches is much bigger than code. If you write, research, make documents or Excel spreadsheets, run a tiny business, organize projects, manage content, or just spend your day switching apps and opening a dozen Chrome tabs, the point is not that Codex can write software. The point is that Codex helps you get all of that done on the computer you already use. And yes, it is on Windows now too.
He previews the whole tour: the Chief of Staff thread, goals, multiple threads, computer use, plugins, skills, drafting several artifacts at once, using websites, checking work, and turning repeated corrections into reusable workflows. He also frames the video as a learning thread, an invitation to people already pushing the tool to tell him what he is missing in the comments, because the real playbook is still being written by the people who use it hard.
The token dashboard as a work receipt
On May 20th his local Codex log showed 510 million tokens in one day. He knows that sounds insane, so he heads off the wrong conclusions immediately. This is not all of his AI tokens, just what ran under his Codex Max account. It is not a surprise billing story, he is not paying extra. The point is that the way he uses his computer changed completely, and he thinks we are sleeping on it. More of his computer stopped being app-by-app manual work and started running through agents. Most of the work he does now goes through agents and Codex, not apps directly, and when he does open an app it feels like a hassle. His files, browser sessions, documents, code, and terminal output all route through Codex.
The number itself is not the point, and he is blunt about that: do not wake up and decide "Nate said burn half a billion tokens," that would be a dumb target. The number matters because it shows the computer itself is changing. We have computed for decades in bits and bytes, and now we are moving to tokens, and he says he can prove this is the biggest shift. He mapped his token burn back over a year, and behaviorally it is very clear the biggest jump came in roughly the last month, as computer use plus the 5.5 model in Codex unlocked a huge number of his workflows at once.
So when he cites the giant number, half a billion, 800 million, whatever, he is not bragging about volume. If that only meant he was typing more prompts, it would be embarrassing. It went up because the unit of work changed in scale. He stopped asking AI only for answers and started asking Codex to carry more of the job: go find the source files, go read the transcript, go make the artifact, go render the document, go check the package, go inspect the browser, go keep working until the goal is done. When he says Codex 10x'd parts of his workflow, he does not mean he got 10x smarter. He means the size of the job he is willing to hand the machine changed. The chart is not a scoreboard or a vanity metric. It is a receipt that reflects how the work has changed.
The unit of work gets bigger, and the new compute model
He names the deeper question: what is the compute model, and what is changing? For most of our lives computers have been application-first, and that was itself a revolution. He remembers DOS, when the app became the unit of work. In the 1990s he could write a document without writing code, open a browser, he name-checks Netscape Navigator, open a spreadsheet and just do the work. Huge productivity gains. But the human moved between the apps, the human remembered why each one was open, the whole experience was built around the human first.
He grounds the new model in a story. He made a TikTok recently pointing out that his computer feels like it belongs to Codex as much as to him, because sometimes he cannot use it: it is burning literally 100 million tokens an hour and you can hear it hissing in the background. It runs at max memory capacity while he records, and he does not mind, because it is doing ten things at once that he could never do at once. He just hands out assignments, goes for a walk, touches grass, comes back, and ten things are done.
That is the paradigm shift, and he dates it: the first change in the computing paradigm in about forty years. We are moving from a world where humans sit at the center of the compute paradigm to one where humans sit above it and delegate to agents who run the compute for us. Codex is a way into that future, not the only one, and he is explicit that he is not claiming Anthropic will not get there, he knows they will. Underneath Codex sit the old primitives, files, source notes, templates, the applications themselves, and Codex can drive all of them with agents. The mechanism, in plainer words, is a state machine, which is a fancy way of saying an agent in a loop that remembers what it is doing and can work the whole computer. Tokens are the cost of letting that agent compute for you, so the more of your work runs through agents, the more your computer activity becomes token activity. That is the simplest answer for half a billion tokens a day, and he insists it is not an anomaly: he is easily doing 300, 400, 500 million tokens a day without trying hard. The goal is never to burn tokens or be wasteful. The goal is to put an active layer between your intent and the machine, so the intelligence can start to scale for you.
| Application-first (the last 40 years) | Agent-first (the shift now) | |
|---|---|---|
| Unit of work | the app (DOS, the document, the browser) | the job, handed off in plain English |
| Who routes | the human moves between apps human is the router | the agent runs the loop human delegates |
| Who remembers | you, why each app is open and what is current | the thread, the goal, folders, artifacts, standard |
| Cost is measured in | bits and bytes | tokens |
| Your posture | operating the machine, hands on keys | sitting above it, assigning and checking |
| Example move | open the app, do the step, switch apps | go find the files, make the artifact, keep going until done |
Chief of staff threads
The first thing that made Codex click was that he stopped treating every thread like a random chat. Most people use AI as a pile of separate conversations: one chat for a draft, one for a bug, one for a note, one for a random question. The problem is that the human becomes the router. You have to remember where everything is, what matters, what the next move was, which version is current, what standard the work is supposed to meet. That does not scale, because our brains get tired.
The better pattern is one thread that stays pointed at the work. It knows the goal, the folders, the current artifacts, and the standard, and then it can spin out smaller jobs without making you re-explain the whole project every time. That is what he means by a Chief of Staff thread. It is not magic memory, he is careful here: you still have to give it sources, still have to correct it, still have to make it show receipts. But used this way it stops feeling like a chatbot and starts feeling like a home base for the work.
Threads, goals, and subagents
The next change was getting serious about goals and threads, which sounds small until you use it on a real project. A normal chatbot tends to stop the moment it has produced something that looks like an answer. Codex gets far more useful when you give it the actual objective: not "help me with this," but "read these sources, produce this artifact, check it against the standard, and do not stop at the first plausible draft, keep going." That changes the relationship. You are not asking for a response, you are assigning a job out.
He clears up a common confusion. A thread is not one agent doing every step itself. A thread is the run that owns the job, and a subagent is a smaller helper inside that job, used for a narrow piece of work so the main thread does not get buried in noise. One thread can plan the goal, using subagents for discovery, source checking, scouting, and reading through messy material. When the goal is cleaner, you can send it to another thread to execute. The execution thread owns the deliverable but still uses subagents inside the job: one scouts a site, another checks sources, another inspects output, another summarizes a noisy folder. The thread owns the job as a whole, the subagents handle contained pieces. Once that clicks, thread mode stops looking like a bunch of chats and starts looking like a way to separate planning, execution, and checking. And with the Chief of Staff pattern you manage most of this just by talking to your Chief of Staff. You do not hand-assign work to individual agents anymore, that is not how it works.
Computer use, plugins, and skills
The power is not one magic prompt, it is the setup around the model. He breaks the setup into named parts. Computer use is literal: it can see a screen, click, type, and use an app. Tools and connectors call real systems, plugins let it reach the places where your work already lives. Skills let you teach it a reusable way to do a job instead of explaining the same process every single time, and that last part matters most. If you correct Codex once, that is just a chat you had. If you turn the correction into a skill, a checklist, a reusable instruction, the work begins to compound.
This is where the code label is most misleading. Developers understand it first because they already live in a world of tools, files, tests, and workflows. But that same pattern now applies to documents, reports, research, invoices, dashboards, meeting prep, family logistics, and customer support. All of it can use code patterns to get better with Codex. If the work lives on your computer, Codex can help you get it done using patterns it learned from code, and you do not have to know code to do it.
A heads-up dashboard for work: the big example
The workflow he picks to demonstrate a big loop is a heads-up dashboard for your whole work day. Imagine, instead of buying some SaaS that forces a fixed shape on you, plug into Slack, plug into email, and still does not cover everything, you build an exact heads-up display that surfaces what matters in your workplace, custom-tuned to your tools. You can do it now, and it is not that hard. The recipe has three explicit steps you tell Codex:
- Tell Codex about all the sources you use to do work: email, Slack, WhatsApp messages, the "carrier pigeon" messages, whatever it is. Those are all your sources.
- Tell it what matters to you, how you actually move the needle in your job, and have an honest discussion about how you refer to those sources and what is salient in each one.
- Ask it to design a dashboard that is live-updateable based on the sources it can pull from, via computer use or via an MCP server. Some sources have an MCP server, Slack has an MCP server skill, others it reaches with computer use in the browser, and that is fine.
The result is a personal heads-up display for work. At any point you can glance at it and know what matters in Slack, what matters in email, what you have to do, and what your prioritized list is, then go get it. Crucially, nobody built it with a seed round and a pile of VC money. You built it just for you, the way you work, with your data, and Codex can do that today. He notes the full readout lives on his Substack, and you can see examples over his shoulder because he actually built it on screen.
The dashboard is also a teaching device for two ideas: big loops and automations. You can wire an automation that updates the dashboard every 15 minutes or every half hour, your call. It checks all the data sources, runs the saliency analysis to see what really matters, then comes back and says "this is what I think matters, this is how I re-rank the priority, this is what to emphasize for work." It becomes your headquarters for work every day, and you custom-built it. He flags why this is new: as cool as the models were even two or three months ago, and as far into the long-running agentic revolution as we are, we did not have the computer availability and the computer use availability to unlock this. He picked it because it shows something only Codex could do today, and he expects other models to follow soon.
He highlights one feature that makes the loop reliable, the set-of-goal feature (the "set of gold" in his telling). It zeros Codex in on the goal you define and runs through walls until it reaches the done state. He loves it because he wants agents that do not stop early. He recalls the Ralph Wiggum loop era, back in January and February, when everyone was excited because Claude kept stopping on agent loops and "Ralph" made it keep going. With Codex you do not need that trick: you set a goal and it just keeps going.
The first useful Codex loop
His advice for newcomers is to not start by automating your whole life. Pick one loop that is annoying and valuable. He rattles off candidates: turn this transcript into a brief, organize this source folder, build a simple dashboard to track inbound email subscriptions, prepare my day from calendar, email, and Slack, draft three versions of this document and explain the difference, check this package and tell me what is missing. Then give Codex the five things: a goal, sources, a standard, a permission boundary, and the proof that it is done. That is the most basic way to set up a loop. It is not a fancy prompt or a hack, you are just setting up a loop, a real assignment with real sources and a way to check the results.
For people already using Codex, the next level is to hunt for the loops you keep repeating. Every time you find yourself giving the same correction, writing the same setup note, asking for the same kind of review, or checking the same kind of output, ask whether that should become a skill, a standing workflow, an automation, or a memory for Codex. That is the moment it stops being one-off help and starts becoming a system that evolves with you through a series of automated loops.
Boundaries, receipts, and responsible delegation
When he says Codex is blowing his mind, he does not mean he wants agents running around his life without rules, he means the opposite. The more powerful the tool gets, the more the boundaries matter. Do not paste API keys or passwords into the chat, learn to use a .env file, it is not hard and it keeps secrets out of the prompt. Do not give it write access just because read access would be useful. Do not let it send, publish, delete, or spend money unless you truly understand the workflow.
And when it produces something important, make it show the receipts. This is the part of Codex he finds most interesting, not just that it is powerful, but that it is easy to inspect. It will show you the files, the logs, the tests, the renders, and the command output, so you can build a habit of getting proof from your agent. That is what keeps the whole thing from turning into hype and wishful thinking. The tool matters because it lets you hand off more work responsibly, and the skill is learning to do that without getting sloppy.
The new computer literacy
He closes on why he made the video. Codex is changing how he works, and the story is not only for developers. He repeats the refusal: he is not asking you to pick a side in a platform fight between OpenAI and Anthropic, he is asking you to pay attention to what Codex lets you practice and see if it is useful. If you do knowledge work, write, research, manage projects, build documents, run support, plan your life, or spend your day moving between apps, this matters to you.
Codex, he says, is one of the first tools that lets you practice a new kind of computer literacy, the literacy of the future. Not typing, not prompting, but handing work to agents that can truly use the computer and then learning how to check what came back. That is why he built the token dashboard and why he wanted a real deep dive instead of a quick reaction. He points to the practical checklist, examples, and setup notes on his Substack, where there is an active community and a Slack already building with Codex, and he wants the comments to be useful too: if you use Codex, tell him how, if you have a better workflow, show it. The closing note is the same energy he opened with. Computing is changing, Codex is the tip of the spear, and he is excited to see what you build.
Key takeaways
- The real story is not that Codex writes code. It is that it makes the computer feel like something you can hand whole jobs to in plain English, so your files, browser, drafts, and screenshots come into range of an agent.
- The token number is a receipt, not a target. Jones hit 510 million tokens in a day, and routinely does 300 to 500 million, because the unit of work got bigger, not because he prompts more. Do not chase the number.
- We are in the first computing paradigm shift in about 40 years: from application-first, where the human is the router moving between apps, to agent-first, where the human sits above the machine and delegates to agents in a loop.
- A thread is the run that owns a job; a subagent is a narrow helper inside it. Separate planning, execution, and checking into threads, and manage it all through a single Chief of Staff thread that holds the durable context.
- The power is the setup around the model, not a magic prompt: computer use (see, click, type), tools and connectors, plugins, and especially skills, which turn a one-time correction into compounding, reusable instruction.
- The flagship example is a personal heads-up dashboard built from your real sources (email, Slack, WhatsApp) via MCP servers and computer use, refreshed by an automation that runs a saliency analysis every 15 to 30 minutes. You build it for yourself, with your data, no VC required.
- The repeatable delegation loop is five things: a goal (with a clear definition of done), sources, a standard, a permission boundary, and proof that it is done. The set-of-goal behavior keeps the agent going until done instead of stopping at the first plausible draft.
- Start with one annoying, valuable loop, not your whole life. Then graduate by spotting the corrections and reviews you repeat and turning them into skills, workflows, automations, or memory.
- More power demands tighter boundaries. Keep secrets in a .env file, withhold write and spend access unless you understand the workflow, and always make the agent show receipts (files, logs, tests, renders, command output).
- The new computer literacy is not typing or prompting. It is handing work to agents that can truly use the computer and then learning how to check what came back.
Chapters
Timestamps are clickable. Click one and the player jumps there and keeps playing while you read.
- 0:00 Codex makes the computer feel different
- 3:07 The token dashboard as a work receipt
- 4:45 The unit of work gets bigger
- 6:37 A new computing paradigm
- 8:12 Chief of staff threads
- 9:44 Threads, goals, and subagents
- 10:53 Computer use, plugins, and skills
- 12:04 A heads-up dashboard for work
- 16:08 The first useful Codex loop
- 17:09 Boundaries, receipts, and responsible delegation
- 18:34 The new computer literacy
Notable quotes
Codex is not just giving me better AI answers, it's making my computer feel different. Nate B Jones, 0:08
That is why Codex is blowing my mind. Not because it writes code, but because it makes the computer feel like something I can hand work to. Nate B Jones, 0:44
The number went up because the unit of work fundamentally changed in scale. I stopped asking AI only for answers, and I started asking Codex to carry more of the job. Nate B Jones, 4:09
It's the first change in the computing paradigm in like 40 years. We're moving from a world where humans were the center of the computing paradigm to where humans sit above the computing paradigm and we delegate to agents who run the compute for us. Nate B Jones, 6:45
The problem is that the human becomes the router. You have to remember where everything is. Nate B Jones, 8:25
If I correct Codex once, that's just a chat that I had. If I turn the correction into a skill, into a checklist, into a reusable instruction, the work begins to compound. Nate B Jones, 11:13
Give it a goal, give it sources, give it a standard, give it a permission boundary, and give it the proof that it's done. That's the most basic way to set up a loop. Nate B Jones, 16:52
The more powerful the tool gets, the more important the boundaries get. Nate B Jones, 17:13
Codex is one of the first tools that lets you practice a new kind of computer literacy, the computer literacy of the future. Not typing, not prompting, but handing work to agents that can truly use the computer and then learning how to check what came back. Nate B Jones, 18:52
Resources mentioned
- Nate B Jones, the channel (AI News & Strategy Daily), where this deep dive lives.
- Codex, OpenAI's agentic tool and the subject of the whole video, including the Codex Max plan he runs his token burn under.
- Anthropic and Claude, named as the other side he is explicitly not asking you to take a side against, and as a lab he expects to reach the same place.
- The Model Context Protocol (MCP) and MCP servers, how Codex reaches sources like Slack with a server skill rather than browser computer use.
- Slack, WhatsApp, and email, the everyday work sources you wire into the heads-up dashboard.
- Microsoft Word and Excel, examples of the documents and spreadsheets Codex renders and checks.
- Google Chrome and Windows, the dozen tabs and the platform Codex now runs on.
- MS-DOS and Netscape Navigator, his touchstones for the last big shift to application-first computing.
- The Ralph Wiggum loop, the early 2026 trick for making an agent keep going that Codex's set-of-goal behavior makes unnecessary.
- A .env file, his recommended way to keep API keys and secrets out of the prompt.
- Nate's Substack newsletter, where the full dashboard readout, the checklist, the examples, and the active building community (and Slack) live.
Where it stands
This is an enthusiasm piece, and Jones flags it himself, he is "obsessed" and "learning in public," so the honest footnotes are about claims versus hopes. The strongest, most durable idea is the delegation loop: a goal, sources, a standard, a permission boundary, and proof of done is a clean, tool-agnostic discipline that works whether or not Codex stays ahead, and the boundaries-and-receipts section is genuinely good security hygiene. The softest material is the framing around the token numbers. A 510 million token day is a vivid receipt, but he is careful to say it is not a target, and it is worth keeping that caveat front of mind, raw token burn is not a measure of value or even of efficiency, and a heavier loop is not automatically a better one. His "first paradigm shift in 40 years" line is a marketing-scale claim, persuasive as narrative, unfalsifiable as history, and the GUI era and the mobile and web eras would each have a claim to the title too. Finally, several specifics rest on his own setup and may shift fast: the exact model name (Codex 5.5), the "set of goal" feature, and which sources expose an MCP server are all moving targets in a tool this new. He is honest that other models will do this soon, which is the right read: the loop is the lasting lesson, the particular vendor is the example of the week.


