OpenAI Raises $110B as Claude Code Ships Auto-Memory and Anthropic Open-Sources Skills Library
Multi-agent orchestration dominated the day as Karpathy shared honest results from running parallel AI researchers on ML experiments, revealing that agents can implement ideas but can't generate good ones. Career anxiety spiked with reports of YC startups cutting all engineers below staff level. Obsidian emerged as the community's preferred knowledge vault for agent-managed workflows.
Daily Wrap-Up
The AI discourse today split cleanly into two camps: people building increasingly ambitious agent systems and people wondering what those systems mean for their careers. @karpathy's detailed thread about running eight parallel AI researchers on nanochat experiments was the day's standout contribution, not because it worked (it didn't), but because it offered an honest, granular look at where multi-agent coordination actually breaks down. The agents can implement ideas but can't generate good ones. They run nonsensical experiment variations and fail to control for basic variables like training time. This is the kind of honest signal that cuts through the hype cycle, and it's especially valuable coming from someone with Karpathy's credibility in the space.
On the career anxiety side, @jeffdfeng's claim that YC founders are planning to lay off all engineers below staff/principal level landed hard. Whether the specific claim holds up or not, the pattern is directionally real: the floor of what constitutes a valuable individual contributor is rising fast. @zackbshapiro's thread about replacing specialized legal AI tools with general-purpose Claude tells a parallel story from professional services. Domain-specific SaaS is getting squeezed from below by increasingly capable foundation models that non-technical users can customize themselves. Meanwhile OpenAI announced a staggering $110B funding round, which means the infrastructure buildout behind all of this is just getting started.
The most entertaining moment was @cryptopunk7213 highlighting someone who deployed an AI agent to lowball sellers on Facebook Marketplace, netting a Jeep Wrangler for $1500 and three TVs for free. It's the kind of scrappy, slightly unhinged application that makes you realize agents are going to show up in places nobody planned for. The most practical takeaway for developers: if you're experimenting with multi-agent setups, invest your time in designing better experiment protocols and evaluation criteria for your agents rather than just scaling up the number of parallel workers. Karpathy's experience shows that the bottleneck is agent judgment quality, not parallelism.
Quick Hits
- @sama announced OpenAI raised $110B from Amazon, NVIDIA, and SoftBank. The scale of capital flowing into AI infrastructure continues to be jaw-dropping.
- @theo flagged that the government is attempting to force Anthropic to remove Claude's safety guards, calling it "probably very bad."
- @UnslothAI updated Qwen3.5 with improved tool-calling and coding performance, running the 35B-A3B variant on just 22GB RAM.
- @pvncher launched RepoPrompt 2.0 with a fully integrated agent, MCP tools, and context builder.
- @aiedge_ posted a comparison of Perplexity Computer vs OpenClaw with "10 Mega Prompts."
- @affaanmustafa shared how easy it is to add Claude Code integrations as a Cowork plugin.
- @jackfriks captured the vibe of the moment: "claude, read this article and implement all of its advice" followed by "retires."
- @nicdunz offered a philosophical take: prompting LLMs is similar to using the search bar on the Library of Babel website.
- @Zephyr_hg posted about skills that will be worth $500/hour in 2027 that are free to learn today.
- @TheBronxViking retweeted @BillyM2k's "how to run a company in 2026" meme.
Agents & Multi-Agent Orchestration
The conversation around agent orchestration shifted from theoretical to practical today, with several posts exploring what it actually looks like to run multiple AI agents on real tasks. The dominant theme was an emerging spectrum of agent complexity: tab completion, single agents, parallel agents, agent teams, and whatever comes next. We're collectively figuring out where on that spectrum current models actually deliver value versus where they just create expensive chaos.
@karpathy shared what might be the most detailed public account of multi-agent ML research, running four Claude and four Codex instances simultaneously on nanochat experiments:
> "The TLDR is that it doesn't work and it's a mess... but it's still very pretty to look at. [...] The agents' ideas are just pretty bad out of the box, even at highest intelligence. They don't think carefully through experiment design, they run a bit non-sensical variations, they don't create strong baselines and ablate things properly."
His framing of the problem is particularly sharp: you're now "programming an organization" where the source code is prompts, skills, tools, and processes. A daily standup becomes part of the org's codebase. He noted that agents are excellent at implementing well-scoped ideas but terrible at creatively generating them, which maps onto a broader pattern: current models are execution engines, not strategy engines. In a separate post, @karpathy shared Cursor's data showing the ratio of tab-complete to agent requests shifting over time, noting the balance between leverage and chaos:
> "If you're too conservative, you're leaving leverage on the table. If you're too aggressive, you're net creating more chaos than doing useful work."
This calibration problem resonated across several other posts. @nummanali highlighted Middleman, a tool built on the idea that developers have become project managers who need a middle management layer between themselves and their agents. @alxfazio pushed the opposite direction, advocating for "headless claude maxxing" with fully autonomous operation. And @Jaytel declared they're done with Claude Code entirely, finding that building a custom harness is "addicting." @trq212 shared Anthropic's own lessons from building Claude Code under the title "Seeing like an Agent," which frames the challenge from the tool-builder's perspective.
The convergence point across all of these is that the tooling layer between humans and agents is where the real innovation is happening right now. The models themselves are increasingly commodity; the orchestration, evaluation, and feedback loops around them are the differentiator. @cryptopunk7213's example of an agent autonomously lowballing Facebook Marketplace sellers is a perfect microcosm: the model capability was already there, the value was in the creative application and orchestration.
AI & Career Disruption
The career impact conversation reached a new intensity today, with multiple posts painting a picture of accelerating displacement across both technical and professional services roles. What's notable is that the anxiety isn't coming from outsiders speculating anymore. It's coming from founders, practitioners, and people actively making hiring decisions.
@jeffdfeng reported direct conversations with YC founders:
> "Spoke with several YC founders planning to lay off all engineers below staff/principal β basically everyone under L5. This only became viable after Opus 4.5 in December. The Block layoffs are a signal: the floor just collapsed."
Whether these specific claims are exaggerated or not, the direction is consistent with what @cgtwts shared from Anthropic's CEO about AI wiping out 50% of certain professional roles within 12 months. @alancarroII added dark humor with a meme about tradespeople watching AI replace everyone who went to college. The practical counterpoint came from practitioners who are already adapting. @zackbshapiro detailed how he's replaced specialized legal tech tools (Harvey, CoCounsel, Spellbook) with general-purpose Claude customized for his practice. @garthwatson echoed the pattern from the other side:
> "As a non-practising lawyer that just used Claude Code to build a mobile app, and having founded and scaled a legal tech company [...] this is signal."
The synthesis here is nuanced. The threat isn't that AI replaces people wholesale. It's that AI raises the minimum bar for what constitutes a valuable contributor. A lawyer who can build their own app with Claude Code is dramatically more dangerous in the market than one who can't. An engineer who can orchestrate multiple agents is worth more than one who can only write code manually. The floor is rising, and the people who are most at risk are those in the middle: skilled enough to have been valuable in the old paradigm, but not adaptable enough to leverage the new tools.
Local AI & Knowledge Management
Obsidian had a strong showing today as the community's preferred substrate for agent-managed knowledge systems. The appeal is straightforward: markdown files, local storage, plugin extensibility, and now apparently whatever feature gap was holding it back from agent integration has been closed.
@cameron_pfiffer declared the competition essentially over:
> "This is basically the only thing that was preventing Obsidian from being the go-to for agent-managed knowledge vaults. It's so over for Notion."
@noahvnct followed up with a guide on building an "AI Second Brain" using Obsidian and Claude Code, which represents a practical implementation of the knowledge vault concept. The broader implication was articulated by @matteopelleg, who argued that Apple will ultimately win the AI race by acquiring Anthropic and putting models that run on 32GB of RAM into every device, with perfect memory and access to all local files at zero marginal cost.
The thread connecting these posts is the growing conviction that the future of personal AI is local-first. Cloud-based AI services have the capability advantage today, but the privacy, latency, and cost advantages of local inference are becoming increasingly compelling as models shrink. For developers building agent systems, the choice of knowledge backend matters more than it might seem. Markdown-based systems like Obsidian offer the transparency and version control that agent workflows need, while proprietary formats create friction for both humans and agents trying to read and write programmatically.
Sources
The third era of AI software development
When we started building Cursor a few years ago, most code was written one keystroke at a time. Tab autocomplete changed that and opened the first era...
Perplexity Computer Is BETTER Than OpenClaw: 10 Mega Prompts
Perplexity just took its shot at killing OpenClaw, and I'm blown away. Since release, I've been testing Perplexity Computer non-stop, and its capabil...
Introducing Agent Relay
Qwen3.5-35B-A3B is now available in LM Studio! This model outperforms previous Qwen models that are more than 6x its size π€―π Requires about ~21GB to run locally. https://t.co/sBkbpxdwRA
4% of GitHub public commits are being authored by Claude Code right now. At the current trajectory, we believe that Claude Code will be 20%+ of all daily commits by the end of 2026. While you blinked, AI consumed all of software development. https://t.co/pFti4r6uR9
A simple framework to build Agentic Systems that just works
I've been building agentic systems for a couple of years now. For Youtube, for Open Source, for my SaaS, for my office. Today I want to write this sho...
Seedance 2.0 turns kids drawing into 100k film scene.. hollywood is cooked https://t.co/G0NJMMN5qG
Powerful new Harvard Business Review study. "AI does not reduce work. It intensifies it. " A 8-month field study at a US tech company with about 200 employees found that AI use did not shrink work, it intensified it, and made employees busier. Task expansion happened because AI filled in gaps in knowledge, so people started doing work that used to belong to other roles or would have been outsourced or deferred. That shift created extra coordination and review work for specialists, including fixing AI-assisted drafts and coaching colleagues whose work was only partly correct or complete. Boundaries blurred because starting became as easy as writing a prompt, so work slipped into lunch, meetings, and the minutes right before stepping away. Multitasking rose because people ran multiple AI threads at once and kept checking outputs, which increased attention switching and mental load. Over time, this faster rhythm raised expectations for speed through what became visible and normal, even without explicit pressure from managers.