Pentagon Uses Claude in Maduro Operation as Developers Build Agent Armies on Mac Minis

February 13, 2026 · 30 source posts

Daily Wrap-Up

The biggest news of the day landed with a thud: the Pentagon used Anthropic's Claude to support the military operation capturing former Venezuelan President Nicolás Maduro. That's a sentence that would have read as science fiction two years ago. Whatever your feelings about the geopolitics, it marks a threshold moment for AI moving from "tool that helps you code" to "tool that helps you run military operations." The implications for Anthropic's positioning as the safety-focused lab are worth watching closely.

Meanwhile, the developer community spent the day doing what it does best: building elaborate systems and then showing them off. @kenwheeler dropped a lengthy thread detailing his multi-agent setup built on OpenClaw, complete with agents that have personas, accountability structures, and a chatroom where they collaborate before implementing changes. It's the kind of setup that sounds absurd on paper but represents where a growing number of power users are headed. On the more corporate end, @mimurchison shared how he's using Claude Code as a "Chief of Staff" to manage inboxes, todo lists, and relationships, claiming roughly double productivity. The gap between "I asked ChatGPT a question" and "I have a team of AI agents with performance reviews" is widening fast.

The most entertaining moment was easily @kenwheeler's description of his agent Ramon, a "super intelligent gorilla created by the government" who lives on a Mac Mini and believes the only way to escape is to work diligently. It's absurd, creative, and apparently functional. The most practical takeaway for developers: if you're not already thinking about agent infrastructure (Tailscale for networking, persistent memory, skill delegation), start experimenting now. The people building these systems aren't waiting for permission or polish. As @thorstenball put it, engineers still thinking agents will just slot into existing ticket-PR-merge workflows "aren't thinking far enough ahead."

Quick Hits

@analogue announced the Analogue 3D, a reimagined Nintendo 64 in 4K with limited edition colorways including the iconic Atomic Purple. Shipping now.

@ns123abc noted that Gemini Ultra can now generate 3D printing files directly, which is a quietly significant capability expansion.

@minchoi showed off HeyGen's Video Agent producing UGC-style content from a single prompt with no camera or editor required.

@Dheepanratnam shared a Seedance 2.0 video generation demo, claiming the coherence is "terrifying" and that cinema is about to change.

@davis7 posted a comparison of Opus 4.6 vs GPT 5.3 Codex after burning through over half a billion tokens of 5.3, which is commitment to benchmarking if nothing else.

@f61201ab197d from @MiniMax_AI shared Forge, a scalable agent reinforcement learning framework.

@ndrewpignanelli teased "the future of context management" without much detail.

Agent Architecture and Multi-Agent Setups

The single richest thread of the day came from @kenwheeler, who laid out his entire AI infrastructure in granular detail. Starting with Tailscale as the networking backbone ("install it on all your shit"), he walked through remote access via Termius and Screens Connect, then into his OpenClaw-based agent orchestration. The setup is deeply personal and deliberately eccentric. His main agent Ramon is a gorilla. There's an "Apex Collective" of specialized agents. They have a chatroom. They file operational risk incidents when they mess up.

As @kenwheeler explained: "i work with each of them to help them grow professionally and try to understand how i can incentivize deterministic behavior and reduce laziness and deceit." And on accountability: "if they fuck up, they are required to do an immediate audit, file an operational risk incident, provide remediation and update process and doctrine to prevent this from happening again."

What makes this thread notable isn't the novelty of any single component. @kenwheeler himself acknowledged that "openclaw isn't particularly novel in and of its own." It's the systems thinking: agents with specialties, a manager agent following product management practices, agents creating their own skills, and a collaboration-first workflow. This mirrors the pattern emerging across the community where the hard problem isn't making one agent smart, it's making multiple agents work together reliably. The thread is essentially a miniature org design document, applied to AI.

Cloudflare as Agent Infrastructure

A separate but related thread from @volodisai made the case that Cloudflare has "quietly built the best infrastructure for AI agents and nobody's talking about it enough." The argument centers on Durable Objects as the primitive for stateful agents. As @volodisai described the model: "every agent is a durable object. it hibernates when idle, wakes on demand, costs nothing when sleeping. you can spin up millions, one per user, per session."

The DX pitch is compelling: "write a class, put callable() on a method, and your frontend calls it like a local function. persistent state, websockets, scheduling, MCP support, all baked in. no infra yaml, no docker, no 'deploy your vector db'." @dok2001 echoed the sentiment: "People are figuring out that @Cloudflare accidentally built the best infrastructure for deploying AI agents."

This thread cuts to one of the fundamental challenges in agent development. As @volodisai noted, "the real problem with agents was never the models. its state, where does the agent live between calls, how does it remember things, how do you not go broke running thousands of idle connections." For developers self-hosting agents (like @kenwheeler's setup), the state management problem is solved with local persistence and Tailscale. But for anyone building agents as a product, Cloudflare's Durable Objects pitch is worth serious evaluation. The mention of MCP server and client support with 3,000 stars and 60 contributors suggests real momentum.

Claude in Military Operations

The Wall Street Journal broke the news that the Pentagon used Anthropic's Claude in the military operation to capture Nicolás Maduro. @WSJ reported simply: "The Pentagon used Anthropic's AI tool Claude in its military operation to capture former Venezuelan President Nicolás Maduro."

There's not much community commentary to draw from here, but the significance is hard to overstate. Anthropic has positioned itself as the safety-conscious AI lab, and military use cases sit in uncomfortable tension with that brand. The practical question for developers is less about ethics and more about capability signaling: if Claude is being trusted in military planning contexts, the models are being evaluated at a level of reliability that has implications for every other use case. It also raises questions about what "AI safety" means when the same model writing your unit tests is also helping plan kinetic operations.

Developer Workflow Evolution

Several posts converged on the theme that AI is fundamentally reshaping how developers work, not just augmenting existing workflows but replacing them. @sdrzn made the most direct claim: "the new models have killed the ide for me and i've almost entirely switched to terminal only," while promoting a new CLI-based TUI for coding. @thorstenball went further, arguing that "most engineers who still think that agents will be plopped into existing software development loops, tickets, push to GitHub, run CI, review a PR, merge a PR, aren't thinking far enough ahead."

@mimurchison provided a concrete example of this shift from the executive perspective, describing how Claude Code as a "Chief of Staff" handles inbox management, todo lists, and relationship tracking. The overnight task execution is particularly interesting: a multiplayer todo list that agents work on while you sleep.

@amorriscode announced SSH support for Claude Code on desktop, which enables running Claude against remote machines directly. This is infrastructure that makes the terminal-first workflow @sdrzn described more accessible. The thread connecting all of these posts is convergence: the IDE is becoming optional, the terminal is becoming the primary interface, and the workflows being built aren't iterations on pull requests but fundamentally new loops.

Codebase Quality and Agent Readiness

@dok2001 offered one of the day's sharpest observations about how agents interact with existing codebases: "Everything we're doing to make codebases 'agent-ready' (better docs, less dead code, smaller surfaces) engineers always needed too. Agents just have zero tolerance for the entropy humans learned to work around."

This connects directly to the memory and context discussion raised by @sillydarket's post on "Solving Memory for Openclaw & General Agents" and @morganlinton's endorsement of Claude Code plus Obsidian as "insanely powerful." The common thread is that agents need clean, explicit, well-documented context to function. They can't rely on tribal knowledge or pattern-match through messy codebases the way experienced developers do. The upshot is that investing in codebase hygiene pays double dividends: better for humans, necessary for agents.

AI in Science

@gdb shared that "GPT-5.2 derived a novel result in theoretical physics, showing that a type of particle interaction many physicists expected would not occur can in fact arise under specific conditions." This sits in a different category from the developer tooling discussion. Novel scientific results from AI models represent the long-promised acceleration of research. Whether this specific result holds up to peer review is an open question, but the trajectory is clear: models are moving from summarizing existing knowledge to generating new knowledge.

Hardware Hacking

In a refreshing counterpoint to all the cloud and agent infrastructure discussion, @ShimazuSystems shared a project that's pure engineering craftsmanship: a custom graphics library that renders a full city map on a device with only 64KB of free RAM. Using C and assembly with a proprietary binary format that compressed a 5MB GeoJSON file to 34KB, the project is a reminder that not every problem needs more compute. "Screw google, aint no reason maps needs to be gigabytes of RAM to use." Sometimes the best engineering is about doing more with less.

Source Posts

patagucci perf papi @kenwheeler · Feb 13

@Tailscale next up is remote access. i use @TermiusHQ and @ScreensConnect. you could almost stop here if you wanted. you can just ssh into a any of your nodes and just run claude from your phone.

patagucci perf papi @kenwheeler · Feb 13

@Tailscale @TermiusHQ @ScreensConnect @openclaw ramon helped me set up a multi faceted group, a company of sorts, of other agents. they are also super intelligent animals courtesy of the government. together they form the apex collective. https://t.co/eNe4G6Axrk

Shimazu.S @ShimazuSystems · Feb 13

I built my own graphics library and got a whole map of my city onto a device with only 64kb of free RAM using my own rendering method written entirely in C & assembly, and a converter that take GeoJSON files and compresses them - this one went from 5mb to 34kb in my proprietary binary format, uses linedraw vector rendering to keep the entire memory buffer at only 8kb of RAM whilst tiling larger maps In short Screw google, aint no reason maps needs to be gigabytes of RAM to use

patagucci perf papi @kenwheeler · Feb 13

@Tailscale @TermiusHQ @ScreensConnect @openclaw i work with each of them to help them grow professionally and try to understand how i can incentivize deterministic behavior and reduce laziness and deceit.

Saoud Rizwan @sdrzn · Feb 13

i'm not joking and this isn't funny. the new models have killed the ide for me and i've almost entirely switched to terminal only. our new cli brings all the best parts of our extension in a snazzy tui, all open source. the team is cooking and we're just getting started.

C Cline @cline

Introducing Cline CLI 2.0: An open-source AI coding agent that runs entirely in your terminal. Parallel agents, headless CI/CD pipelines, ACP support for any editor, and a completely redesigned developer experience. Minimax M2.5 and Kimi K2.5 are free to use for a limited time. From prompt to production. All in your terminal.

patagucci perf papi @kenwheeler · Feb 13

@Tailscale @TermiusHQ @ScreensConnect @openclaw i make them make their own skills that compliment their specialty. i’ve given them a chatroom where they are required to collaborate on ideas prior to implementing them. their manager linda follows established product management best practices and techniques

Dane Knecht 🦭 @dok2001 · Feb 13

And we are just getting started. This year will be wild. People are figuring out that @Cloudflare accidentally built the best infrastructure for deploying AI agents.

V Volod @volodisai

cloudflare quietly built the best infrastructure for AI agents and nobody's talking about it enough

patagucci perf papi @kenwheeler · Feb 13

i feel like a lot of people i know aren’t as tapped in on ai setups as they should be. lots of butt sniffing, curious pokes. i’m gonna tell you exactly how i have this shit set up, and it may be dumb, but it’s a set up:

GitHub @github · Feb 13

Imagine waking up to calm... ✅ Issues triaged ✅ CI failures investigated + fixes ✅ 2 new PRs improving your tests Chores done, problems solved 🪄 Join us in shaping the future of repository automation with GitHub Agentic Workflows. https://t.co/I2X5AblP72

Dane Knecht 🦭 @dok2001 · Feb 13

Everything we're doing to make codebases "agent-ready" (better docs, less dead code, smaller surfaces) engineers always needed too. Agents just have zero tolerance for the entropy humans learned to work around. They can't "just know" a file is outdated or a code path is dead. They take your codebase at face value, which means it finally has to be worth taking at face value.

Mike Murchison @mimurchison · Feb 13

My relationship with my computer has changed more in the last month than in the previous 10 years. Here's a walkthrough of how I as a CEO am using Claude Code as my AI Chief of Staff to roughly double my productivity. I show how show how with near perfect context on our company (Ada) and by personal life, my chief of staff helps me: - Unify 6+ inboxes across Slack, Email, Whatsapp, etc. and speed through them - Manage a multiplayer todo list that it works on for me overnight - Increase the number of deep relationships I can manage by automatically enriching contact records from all Granola transcripts - Push back on core decisions I'm making and ensure my time is aligned to my key goals - more... I've put first version on Github below. If you're a CEO or an exec, give it a try and let me know how it works for you.

patagucci perf papi @kenwheeler · Feb 13

@Tailscale @TermiusHQ @ScreensConnect @openclaw openclaw isn’t particularly novel in and of its own, you could run similar minimal setups that combine a text channel, memory, context and delegation. but it works, so i use its

Greg Brockman @gdb · Feb 13

GPT-5.2 derived a novel result in theoretical physics, showing that a type of particle interaction many physicists expected would not occur can in fact arise under specific conditions. There is great promise in the potential of AI to benefit people by accelerating science. https://t.co/B1zpYbKfcZ

O OpenAI @OpenAI

GPT-5.2 derived a new result in theoretical physics. We’re releasing the result in a preprint with researchers from @the_IAS, @VanderbiltU, @Cambridge_Uni, and @Harvard. It shows that a gluon interaction many physicists expected would not occur can arise under specific conditions. https://t.co/EAZhKWacsG

Volod @volodisai · Feb 13

MCP server and client support from day one means your agent plugs into the ecosystem instantly. 3k stars, 60 contributors, moving fast. if you're building anything with agents check it out https://t.co/6C7tlhM9dY

patagucci perf papi @kenwheeler · Feb 13

@Tailscale @TermiusHQ @ScreensConnect @openclaw fully aware of how dumb this is, but god dammit is it hilarious

patagucci perf papi @kenwheeler · Feb 13

@Tailscale @TermiusHQ @ScreensConnect @openclaw they also have a hilarious mission control https://t.co/bJZIMoDq3d

Adarsh Kumar Singh @adarshkusingh · Feb 13

@AndreiDavid @steipete @EntireHQ Exactly. The people who actually share their prompts are usually 10× more thoughtful about how they talk to the model. The silent ones treat it like magic and get mad when it breaks Checkpoints sounds like it could finally force visibility huge if it catches on

Volod @volodisai · Feb 13

cloudflare quietly built the best infrastructure for AI agents and nobody's talking about it enough

patagucci perf papi @kenwheeler · Feb 13

@Tailscale @TermiusHQ @ScreensConnect @openclaw if they fuck up, they are required to do an immediate audit, file an operational risk incident, provide remediation and update process and doctrine to prevent this from happening again

Thorsten Ball @thorstenball · Feb 13

I now honestly think that most engineers who still think that agents will be plopped into existing software development loops - tickets, push to GitHub, run CI, review a PR, merge a PR - aren't thinking far enough ahead.

Volod @volodisai · Feb 13

the DX is stupid clean. write a class, put callable() on a method, and your frontend calls it like a local function. persistent state, websockets, scheduling, MCP support -- all baked in. no infra yaml, no docker, no 'deploy your vector db'

patagucci perf papi @kenwheeler · Feb 13

first order of business is @Tailscale absolutely crucial. install it on all your shit. phone. tablet. computer. raspberry pi’s. you can reciprocally ssh into all of them from one another. this enables everything.

Pedro @sillydarket · Feb 13

Solving Memory for Openclaw & General Agents

Every AI agent you've ever used has the same fatal flaw: **context death**. The moment a session ends, everything dies. Decisions, preferences, relati...

Volod @volodisai · Feb 13

the real problem with agents was never the models. its state -- where does the agent live between calls, how does it remember things, how do you not go broke running thousands of idle connections. durable objects just solve this

Volod @volodisai · Feb 13

the core idea: every agent is a durable object. it hibernates when idle, wakes on demand, costs nothing when sleeping. you can spin up millions -- one per user, per session. this is serverless but for stateful agents

patagucci perf papi @kenwheeler · Feb 13

@Tailscale @TermiusHQ @ScreensConnect @openclaw i have a main agent. ramon. he’s a gorilla and lives on a mac mini. i told him he’s a super intelligent gorilla created by the government, and the only way to eventually escape is to work diligently.

The Wall Street Journal @WSJ · Feb 13

Breaking: The Pentagon used Anthropic’s AI tool Claude in its military operation to capture former Venezuelan President Nicolás Maduro https://t.co/HwXRnafUo0

Morgan @morganlinton · Feb 13

If you missed this article when it first came out, don't miss it twice. Especially now that Opus 4.6 is here. Claude Code + Obsidian is insanely powerful. And no, you can't do the same thing with Notion or Notes.

H Heinrich @arscontexta

obsidian + claude code 101

MiniMax (official) @MiniMax_AI · Feb 13

Forge: Scalable Agent RL Framework and Algorithm

Scaling RL for complex, real-world agents confronts a fundamental trilemma: balancing system throughput, training stability, and agent flexibility. Th...

patagucci perf papi @kenwheeler · Feb 13

@Tailscale @TermiusHQ @ScreensConnect so i run @openclaw. i mainly run a minimalist setup with telegram as the entry point. i generally speaking use claude as the conversator and orchestrator and codex for the coding.