AI Digest.

OpenAI Raises $110B as Claude Code Ships Auto-Memory and Anthropic Open-Sources Skills Library

Multi-agent orchestration dominated the day as Karpathy shared honest results from running parallel AI researchers on ML experiments, revealing that agents can implement ideas but can't generate good ones. Career anxiety spiked with reports of YC startups cutting all engineers below staff level. Obsidian emerged as the community's preferred knowledge vault for agent-managed workflows.

Daily Wrap-Up

The AI discourse today split cleanly into two camps: people building increasingly ambitious agent systems and people wondering what those systems mean for their careers. @karpathy's detailed thread about running eight parallel AI researchers on nanochat experiments was the day's standout contribution, not because it worked (it didn't), but because it offered an honest, granular look at where multi-agent coordination actually breaks down. The agents can implement ideas but can't generate good ones. They run nonsensical experiment variations and fail to control for basic variables like training time. This is the kind of honest signal that cuts through the hype cycle, and it's especially valuable coming from someone with Karpathy's credibility in the space.

On the career anxiety side, @jeffdfeng's claim that YC founders are planning to lay off all engineers below staff/principal level landed hard. Whether the specific claim holds up or not, the pattern is directionally real: the floor of what constitutes a valuable individual contributor is rising fast. @zackbshapiro's thread about replacing specialized legal AI tools with general-purpose Claude tells a parallel story from professional services. Domain-specific SaaS is getting squeezed from below by increasingly capable foundation models that non-technical users can customize themselves. Meanwhile OpenAI announced a staggering $110B funding round, which means the infrastructure buildout behind all of this is just getting started.

The most entertaining moment was @cryptopunk7213 highlighting someone who deployed an AI agent to lowball sellers on Facebook Marketplace, netting a Jeep Wrangler for $1500 and three TVs for free. It's the kind of scrappy, slightly unhinged application that makes you realize agents are going to show up in places nobody planned for. The most practical takeaway for developers: if you're experimenting with multi-agent setups, invest your time in designing better experiment protocols and evaluation criteria for your agents rather than just scaling up the number of parallel workers. Karpathy's experience shows that the bottleneck is agent judgment quality, not parallelism.

Quick Hits

  • @sama announced OpenAI raised $110B from Amazon, NVIDIA, and SoftBank. The scale of capital flowing into AI infrastructure continues to be jaw-dropping.
  • @theo flagged that the government is attempting to force Anthropic to remove Claude's safety guards, calling it "probably very bad."
  • @UnslothAI updated Qwen3.5 with improved tool-calling and coding performance, running the 35B-A3B variant on just 22GB RAM.
  • @pvncher launched RepoPrompt 2.0 with a fully integrated agent, MCP tools, and context builder.
  • @aiedge_ posted a comparison of Perplexity Computer vs OpenClaw with "10 Mega Prompts."
  • @affaanmustafa shared how easy it is to add Claude Code integrations as a Cowork plugin.
  • @jackfriks captured the vibe of the moment: "claude, read this article and implement all of its advice" followed by "retires."
  • @nicdunz offered a philosophical take: prompting LLMs is similar to using the search bar on the Library of Babel website.
  • @Zephyr_hg posted about skills that will be worth $500/hour in 2027 that are free to learn today.
  • @TheBronxViking retweeted @BillyM2k's "how to run a company in 2026" meme.

Agents & Multi-Agent Orchestration

The conversation around agent orchestration shifted from theoretical to practical today, with several posts exploring what it actually looks like to run multiple AI agents on real tasks. The dominant theme was an emerging spectrum of agent complexity: tab completion, single agents, parallel agents, agent teams, and whatever comes next. We're collectively figuring out where on that spectrum current models actually deliver value versus where they just create expensive chaos.

@karpathy shared what might be the most detailed public account of multi-agent ML research, running four Claude and four Codex instances simultaneously on nanochat experiments:

> "The TLDR is that it doesn't work and it's a mess... but it's still very pretty to look at. [...] The agents' ideas are just pretty bad out of the box, even at highest intelligence. They don't think carefully through experiment design, they run a bit non-sensical variations, they don't create strong baselines and ablate things properly."

His framing of the problem is particularly sharp: you're now "programming an organization" where the source code is prompts, skills, tools, and processes. A daily standup becomes part of the org's codebase. He noted that agents are excellent at implementing well-scoped ideas but terrible at creatively generating them, which maps onto a broader pattern: current models are execution engines, not strategy engines. In a separate post, @karpathy shared Cursor's data showing the ratio of tab-complete to agent requests shifting over time, noting the balance between leverage and chaos:

> "If you're too conservative, you're leaving leverage on the table. If you're too aggressive, you're net creating more chaos than doing useful work."

This calibration problem resonated across several other posts. @nummanali highlighted Middleman, a tool built on the idea that developers have become project managers who need a middle management layer between themselves and their agents. @alxfazio pushed the opposite direction, advocating for "headless claude maxxing" with fully autonomous operation. And @Jaytel declared they're done with Claude Code entirely, finding that building a custom harness is "addicting." @trq212 shared Anthropic's own lessons from building Claude Code under the title "Seeing like an Agent," which frames the challenge from the tool-builder's perspective.

The convergence point across all of these is that the tooling layer between humans and agents is where the real innovation is happening right now. The models themselves are increasingly commodity; the orchestration, evaluation, and feedback loops around them are the differentiator. @cryptopunk7213's example of an agent autonomously lowballing Facebook Marketplace sellers is a perfect microcosm: the model capability was already there, the value was in the creative application and orchestration.

AI & Career Disruption

The career impact conversation reached a new intensity today, with multiple posts painting a picture of accelerating displacement across both technical and professional services roles. What's notable is that the anxiety isn't coming from outsiders speculating anymore. It's coming from founders, practitioners, and people actively making hiring decisions.

@jeffdfeng reported direct conversations with YC founders:

> "Spoke with several YC founders planning to lay off all engineers below staff/principal β€” basically everyone under L5. This only became viable after Opus 4.5 in December. The Block layoffs are a signal: the floor just collapsed."

Whether these specific claims are exaggerated or not, the direction is consistent with what @cgtwts shared from Anthropic's CEO about AI wiping out 50% of certain professional roles within 12 months. @alancarroII added dark humor with a meme about tradespeople watching AI replace everyone who went to college. The practical counterpoint came from practitioners who are already adapting. @zackbshapiro detailed how he's replaced specialized legal tech tools (Harvey, CoCounsel, Spellbook) with general-purpose Claude customized for his practice. @garthwatson echoed the pattern from the other side:

> "As a non-practising lawyer that just used Claude Code to build a mobile app, and having founded and scaled a legal tech company [...] this is signal."

The synthesis here is nuanced. The threat isn't that AI replaces people wholesale. It's that AI raises the minimum bar for what constitutes a valuable contributor. A lawyer who can build their own app with Claude Code is dramatically more dangerous in the market than one who can't. An engineer who can orchestrate multiple agents is worth more than one who can only write code manually. The floor is rising, and the people who are most at risk are those in the middle: skilled enough to have been valuable in the old paradigm, but not adaptable enough to leverage the new tools.

Local AI & Knowledge Management

Obsidian had a strong showing today as the community's preferred substrate for agent-managed knowledge systems. The appeal is straightforward: markdown files, local storage, plugin extensibility, and now apparently whatever feature gap was holding it back from agent integration has been closed.

@cameron_pfiffer declared the competition essentially over:

> "This is basically the only thing that was preventing Obsidian from being the go-to for agent-managed knowledge vaults. It's so over for Notion."

@noahvnct followed up with a guide on building an "AI Second Brain" using Obsidian and Claude Code, which represents a practical implementation of the knowledge vault concept. The broader implication was articulated by @matteopelleg, who argued that Apple will ultimately win the AI race by acquiring Anthropic and putting models that run on 32GB of RAM into every device, with perfect memory and access to all local files at zero marginal cost.

The thread connecting these posts is the growing conviction that the future of personal AI is local-first. Cloud-based AI services have the capability advantage today, but the privacy, latency, and cost advantages of local inference are becoming increasingly compelling as models shrink. For developers building agent systems, the choice of knowledge backend matters more than it might seem. Markdown-based systems like Obsidian offer the transparency and version control that agent workflows need, while proprietary formats create friction for both humans and agents trying to read and write programmatically.

Sources

M
Michael Truell @mntruell ·
The third era of AI software development
A
AI Edge @aiedge_ ·
Perplexity Computer Is BETTER Than OpenClaw: 10 Mega Prompts
J
Jeffrey Emanuel @doodlestein ·
@Suhail I’m doing this at scale now for a massive number of projects. It’s already here if you use the right tooling, workflows, and prompts: https://t.co/YjjIPsxaxC
M
Matt Shumer @mattshumer_ ·
Agents are turning into teams. Teams need Slack. Agent Relay is that layer for AI agents: channels + threads + DMs + realtime events + search + persistent history. In 12 months, this will feel obvious.
W willwashburn @willwashburn

Introducing Agent Relay

K
Kevin Simback 🍷 @KSimback ·
The math is mathing even more now! Seeing many positive reports of running Qwen 35B-A3B locally on modest consumer hardware No need for a $10k+ Mac Studio So you get a Sonnet 4.5 grade model that can run privately at home, then you can chat with it on your phone via Tailscale
L lmstudio @lmstudio

Qwen3.5-35B-A3B is now available in LM Studio! This model outperforms previous Qwen models that are more than 6x its size πŸ€―πŸš€ Requires about ~21GB to run locally. https://t.co/sBkbpxdwRA

J
John Rush @johnrushx ·
This equals 40,000 full-time software developers working full time. End of 2026: 200,000 developers. 2027: just Claude Code alone will be adding as much code as 1,000,000 full-time human developers. 2028: 1B+ Enjoy your last lines of handwritten code. Horses replaced by cars.
S SemiAnalysis_ @SemiAnalysis_

4% of GitHub public commits are being authored by Claude Code right now. At the current trajectory, we believe that Claude Code will be 20%+ of all daily commits by the end of 2026. While you blinked, AI consumed all of software development. https://t.co/pFti4r6uR9

A
AVB @neural_avb ·
A simple framework to build Agentic Systems that just works
E
el.cine @EHuanglu ·
wow.. we can actually keyframe every second of AI animation using prompt now https://t.co/WqmbujMam8
E EHuanglu @EHuanglu

Seedance 2.0 turns kids drawing into 100k film scene.. hollywood is cooked https://t.co/G0NJMMN5qG

S
Sukh Sroay @sukh_saroy ·
🚨Someone just built a live mission control center for your AI agents and it runs completely on your own hardware. It's called OpenClaw Studio and it's not a toy. It's a real AI agent dashboard with live chat, approval gates, job scheduling, and full visibility into everything your agents are doing right now. Here's what it actually does: β†’ Live dashboard showing every agent running in real time β†’ Chat directly with your agents from the browser β†’ Approve or block dangerous actions before they execute β†’ Schedule automated jobs with built-in cron support β†’ Connects to any OpenClaw Gateway local or cloud β†’ Works from your laptop, phone, or any device on your network β†’ Full WebSocket streaming so you see everything as it happens Enterprise AI observability tools charge $500/month for worse versions of this. This runs on your hardware. Your agents. Your rules. The AI agent control room just got open sourced. 100% Open Source. (Link in the comments)
H
Harj Taggar @harjtaggar ·
Everybody I know using AI is working more hours not less.
R rohanpaul_ai @rohanpaul_ai

Powerful new Harvard Business Review study. "AI does not reduce work. It intensifies it. " A 8-month field study at a US tech company with about 200 employees found that AI use did not shrink work, it intensified it, and made employees busier. Task expansion happened because AI filled in gaps in knowledge, so people started doing work that used to belong to other roles or would have been outsourced or deferred. That shift created extra coordination and review work for specialists, including fixing AI-assisted drafts and coaching colleagues whose work was only partly correct or complete. Boundaries blurred because starting became as easy as writing a prompt, so work slipped into lunch, meetings, and the minutes right before stepping away. Multitasking rose because people ran multiple AI threads at once and kept checking outputs, which increased attention switching and mental load. Over time, this faster rhythm raised expectations for speed through what became visible and normal, even without explicit pressure from managers.

O
OpenAI @OpenAI ·
We do not think Anthropic should be designated as a supply chain risk and we’ve made our position on this clear to the Department of War.
B
Benji Taylor @benjitaylor ·
Just pushed a cool update to Readout: session replays. Pick any past Claude Code session and scrub through the full timeline. Every prompt, tool call, file change. Files light up as edits land. Play back at different speeds or step through manually. β†’ https://t.co/gpKj1KCpcM https://t.co/yQRFblmiqm