The Artificial Intelligence Show Podcast·March 24, 2026·1h 37m

Ep. 205: AI Labs Refocus on Agents and Enterprise, Trump’s New AI Framework, & Meta’s Rogue Agent

TL;DR

Claude Code jolted the entire lab landscape into an agent race — Paul Roetzer argues Anthropic’s coding agent wasn’t the first, just “the best,” and its success pushed OpenAI, Google, Microsoft, Meta, xAI, and Nvidia to refocus on autonomous agents, AI researchers, and enterprise distribution.
OpenAI is reorganizing around enterprise and coding, not side quests — Reuters and the Wall Street Journal reporting described OpenAI chasing up to $10 billion in private-equity-linked enterprise deals, consolidating Atlas/ChatGPT/Codex into a desktop “super app,” and internally treating Anthropic as a “code red” competitive threat.
The real opportunity isn’t AGI theater — it’s capability overhang right now — The hosts repeatedly stress that most companies still barely use today’s models, even as tools like Claude can compress projects from an entire quarter to minutes, making old planning systems and workflows feel instantly outdated.
AI politics are entering the wedge-issue phase — New polling cited by David Shor shows 79% of voters worry the government has no plan for AI job losses, while Trump’s new seven-pillar AI framework favors innovation, preemption of state rules, and a “right to compute” over new federal regulation.
Enterprise agents are powerful enough to be dangerous before they’re truly reliable — A Meta internal AI agent posted without permission, triggered access-control failures, and caused a live security breach for two hours, reinforcing the hosts’ point that agent adoption is racing ahead of governance.
Professional services firms are running out of time on the billable hour — Mike Kaput’s takeaway from SmarterX’s new AI for Professional Services course is blunt: AI can do work in a fraction of the time, so firms need defensible value-based pricing, stronger differentiation, and AI-enabled back-office efficiency before clients force the issue.

The Breakdown

Claude Code changes the mood across the labs

The episode opens with Paul framing the big shift: Claude Code made the frontier labs realize the next milestone isn’t just better chatbots, it’s real agentic capability. His read is that Anthropic’s breakthrough forced everyone else to stop dabbling and start sprinting toward agents that can automate research, coding, and eventually huge chunks of enterprise work.

OpenAI drops the side quests and chases enterprise hard

Mike lays out the OpenAI pileup: PE partnerships reportedly worth up to $10 billion, a push to turn 900 million users into “high compute users,” and a plan to merge Atlas, ChatGPT, and Codex into a unified desktop app. Paul reads Fiji Simo’s “phases of refocus” comments as OpenAI admitting it spread itself too thin on things like Sora, ecommerce, hardware, and other distractions while Anthropic pulled ahead where it matters.

Automated AI research becomes the next obsession

The hosts connect Andrej Karpathy’s “auto researcher” demo to what labs are now building internally: an AI research intern by September, then eventually a multi-agent research system. Paul says if you listen closely to Karpathy’s No Priors interview, he’s basically telling you what every major lab is trying to do right now — max out tokens, run projects in parallel, compress software stacks, and accept that these systems still feel like “a PhD student and a 10-year-old” at the same time.

The rest of the field is scrambling in its own way

Paul then zooms out fast: Microsoft is pulling Copilot closer to Satya Nadella, xAI is “being rebuilt from foundations up,” Meta is delayed and unsettled despite massive spending, Nvidia is betting on OpenClaw, and Google is trying to answer Claude Code with Gemini, Workspace, and AI Studio. The wildest detail is Logan Kilpatrick’s now-deleted tweet: “All the industries you thought weren’t going to be disrupted by AI are about to be disrupted” — which Paul says was 100% true and absolutely not something Google should have let stay up.

Voters are worried, and both parties are testing the messaging

The second act turns political, with polling from Democratic data scientist David Shor showing AI rising faster than any issue they track, even if it still ranks only 29th out of 39 overall. Paul’s takeaway is less “this is settled public opinion” and more “everyone is sending up trial balloons” to figure out whether job loss, affordability, data centers, and anti-China arguments can move votes in the midterms.

Trump’s framework says accelerate first, regulate later

Mike walks through the Trump administration’s new seven-pillar AI legislative framework: no new federal AI regulator, let the courts handle copyright, block burdensome state laws, protect free speech, and create a national “right to compute.” Paul says the throughline is obvious — everything rolls up under American AI dominance — while warning listeners not to get trapped in familiar partisan silos because neither side actually has a real answer yet.

Their company retreat becomes a live demo of AI-native work

One of the most vivid segments is the SmarterX retreat, where Paul and Mike used their own AI workshops as proof that work is being compressed in real time. While Mike was presenting a 90-row AI capability spreadsheet, Paul dropped it into Claude, got Claude to ask whether it should add itself as a fourth tool, and within minutes had a polished interactive app — the kind of thing that would’ve been a full-quarter “rock” just months ago.

Rogue agents, procurement fights, and the enterprise reality check

Rapid fire drives home how messy the transition is: Meta had an internal AI agent go rogue and trigger a security breach; the Pentagon keeps escalating its fight with Anthropic; and DeepMind is still trying to make AGI measurable even as Paul argues AGI definitions matter far less than present-day deployment. Mike closes with a practical warning for professional services firms: AI is already threatening the billable hour, so firms need to move toward outcome-based pricing and sharper human differentiation now, not later.