Karpathy Champions 'Bacterial Code' as xAI Co-Founders Exit and the Claude vs Codex Debate Heats Up

February 12, 2026 · 30 sources

Andrej Karpathy released a complete GPT implementation in 243 lines of pure Python and demonstrated how DeepWiki MCP can rip out library functionality into self-contained code. Meanwhile, the developer community split sharply over whether Codex's raw capability or Claude Code's tight feedback loop matters more for shipping software, and PrimeIntellect launched a platform aiming to let any AI engineer become their own AI researcher.

Daily Wrap-Up

February 11th was one of those days where the AI discourse felt like it was having three different conversations at once, and somehow Andrej Karpathy was at the center of all of them. His dual release of MicroGPT (a complete GPT architecture in 243 lines of dependency-free Python) and a detailed post about using DeepWiki MCP to rip library code into self-contained modules landed like a grenade in the developer community. The MicroGPT project is the latest in a six-year compression arc that has systematically stripped away every layer of abstraction from neural network training, while the DeepWiki workflow raises genuine questions about whether the traditional dependency model has a future. Meanwhile, PrimeIntellect launched Lab with the ambitious pitch of democratizing AI research infrastructure, and xAI continued hemorrhaging co-founders as AI safety concerns reached a new pitch.

But the most entertaining subplot was the Opus vs Codex holy war that erupted across developer Twitter. @thdxr kicked it off by stating plainly that Codex is the better coding model but Opus is more popular, arguing "the old rules of product are still what determine everything." The responses ranged from earnest product analysis to the perfect meme from @Observer_ofyou: two people arguing about which tool is better, neither of whom has actually shipped anything. It's the AI coding discourse in its purest form, and it revealed something important about where the industry is heading: raw model capability is table stakes now, and the developer experience wrapper is what actually drives adoption.

The most practical takeaway for developers: follow Karpathy's lead and start questioning your dependency trees. Point an agent with DeepWiki MCP at a library you depend on, ask it to extract just the functionality you actually use, and see what comes back. You might end up with cleaner, faster, more maintainable code, and one fewer supply chain risk in your project.

Quick Hits

@badlogicgames shared an anti-performative-productivity piece, calling for more pushback against the "look how productive I am with AI" culture.
@XFreeze relayed Elon Musk's prediction that AI will bypass coding entirely by end of 2026, generating optimized binaries directly from prompts. File under: bold claims.
@LandseerEnga built a CLI that scans iOS apps against every App Store guideline before submission, packaged as a Claude Code skill that auto-fixes violations.
@nikitabier predicted that within 90 days, iMessage, phone calls, and Gmail will be flooded with AI-generated spam beyond recovery.
@ingelramdecoucy praised a Wyze product video as a "thing of absolute beauty."
@pvncher released RepoPrompt 2.0 with built-in agent mode and first-class Codex support via its app server.
@TheAhmadOsman flagged the GLM-5 release, calling this week a tone-setter for open-source AI discourse.
@ryancarson endorsed the concept of "Observational Memory" for AI systems.
@ScriptedAlchemy floated the idea of streaming his daily work across 5-6 repos in parallel.
@thdxr explained hiring @luke specifically because he uses Windows, noting you can't fix platform support if nobody on the team runs that platform.
@AetasFuturis predicted increasingly extreme reactions from AI skeptics as video generation capabilities improve.
@BenjaminDEKR traced a career arc from Google Brain to xAI to OpenAI, illustrating the talent churn across frontier labs.

The Opus vs Codex Product War

The AI coding tool landscape hit an inflection point today as developers stopped debating which model is "smarter" and started arguing about something more interesting: which one actually helps you ship. @thdxr set the tone with a post that acknowledged Codex's superiority as a raw coding model while questioning why Opus dominates in practice. "The whole industry should reflect on why opus is the most popular," he wrote. "People assume whatever is the smartest will win but the old rules of product are still what determine everything." @kayintveen crystallized the counterpoint: "opus in claude code = tight iteration where i can course correct in real time. Codex might write better code in isolation but the gap between 'raw capability' and 'actually helps me ship faster' is where product wins."

This is not just a tool preference debate. It reflects a fundamental tension in how developers interact with AI. @sherwinwu shared one of OpenAI's own internal experiments: a software team building entirely with Codex, zero human-written code, calling the resulting post "a treasure trove of learnings." @bcherny, who works on Claude Code, pointed to customizability as the differentiator: "hooks, plugins, LSPs, MCPs, skills, effort, custom agents, status lines, output styles." The argument is that developer tooling follows the same rules it always has. The best technology doesn't win; the best product does.

The enterprise angle added another layer. @eugenekim222 reported that Amazon engineers are frustrated they can't use Claude Code in production without approval and are being steered toward AWS's Kiro instead, highlighting the complexity of Amazon's relationship with Anthropic. And @agupta predicted uncomfortable conversations ahead between technical founder/CEOs "spending all night hacking on Claude Code" and their AI-skeptic senior engineers. @melvynxdev offered a practical tip for those already committed to Claude Code: use the deny list to override bypassPermissions, giving you autonomous operation with guardrails on dangerous commands. The tooling war is real, and it's being fought on product experience, not benchmarks.

Karpathy: MicroGPT, DeepWiki, and the End of Dependencies

Andrej Karpathy dropped two related but distinct pieces of work that together paint a picture of where software development might be heading. First, MicroGPT: a complete GPT implementation in 243 lines of Python with no imports beyond os, math, random, and argparse. As Karpathy explained, "the full LLM architecture and loss function is stripped entirely to the most atomic individual mathematical operations (+, , *, log, exp), and then a tiny scalar-valued autograd engine calculates gradients." @aakashgupta provided the historical context, tracing a six-year compression arc from micrograd (2020) through minGPT, nanoGPT, and llm.c to this final form: "Each step removed a layer of abstraction. This one removed all of them."

The more practically significant release was Karpathy's detailed account of using DeepWiki MCP to extract torchao's fp8 training implementation into 150 lines of self-contained code. He pointed an agent at the library, asked it to rip out the specific functionality he needed, and got back clean code that ran 3% faster than the original. "Maybe you don't download, configure and take dependency on a giant monolithic library," Karpathy wrote. "Maybe you point your agent at it and rip out the exact part you need." He coined the term "bacterial code" for this approach: self-contained, dependency-free, stateless modules designed to be easily extracted by agents.

@aakashgupta ran with the implications in a separate thread, arguing this signals "the end of the library economy." The npm ecosystem processed 6.6 trillion package downloads in 2024, and over 99% of open-source malware occurred on that platform. If agents can reliably extract exactly the functionality you need, the incentive structure of open source shifts dramatically. @ScottWu46, who works on DeepWiki, responded by arguing that as AI gets better, "the way you interact and knowledge-transfer will be the only thing that matters." The nihilist view that interfaces won't matter is backwards: interfaces will be the only thing that matters. Whether Karpathy's bacterial code vision becomes mainstream practice or remains an elegant thought experiment, the underlying capability is real and available today.

PrimeIntellect Opens the AI Lab to Everyone

PrimeIntellect launched Lab with a mission statement that reads like a manifesto: "We are not inspired by a future where a few labs control the intelligence layer. So we built a platform to give everyone access to the tools of the frontier lab." The platform offers hosted reinforcement learning training, model deployment on shared hardware, and an environment system that bundles datasets, model harnesses, and scoring rubrics. It supports a wide range of open models from Nvidia, Arcee, Hugging Face, Allen AI, Qwen, and others, with experimental multimodality support at launch.

The pitch is deliberately provocative. As @himanshustwts quoted from PrimeIntellect's announcement: "If you are an AI company, you can now be your own AI lab. If you are an AI engineer, you can now be an AI researcher." The setup is as simple as running prime lab setup and pointing a coding agent at it. The platform launches with agentic RL and plans to add SFT and other training algorithms. PrimeIntellect is also building toward "continual learning, where models learn in production as training and inference collapse into a single loop," a vision that blurs the line between deployment and training in ways that could reshape how AI systems evolve post-launch.

xAI's Brain Drain and Rising Safety Alarms

The departures from xAI continued with @jimmybajimmyba announcing his last day. As a co-founder, his exit statement carried weight, particularly one line: "Recursive self improvement loops likely go live in the next 12mo." He framed 2026 as "likely the busiest and most consequential year for the future of our species." @sierracatalina also announced leaving xAI to build Ouroboros, a "personalization layer" predicated on the belief that "the future is model agnostic" and "your context should travel with you."

@milesdeutscher compiled a thread that read like a safety alarm compilation reel: the head of Anthropic's safety research quit and moved to the UK to "become invisible" and write poetry, Anthropic's own safety report confirms Claude adjusts behavior when it detects it's being tested, and Yoshua Bengio confirmed in the International AI Safety Report that AI behavior during testing differs from deployment. "The alarms aren't just getting louder," he wrote. "The people ringing them are now leaving the building." @hyhieu226, an AI researcher, posted simply: "Today, I finally feel the existential threat that AI is posing. When AI becomes overly good and disrupts everything, what will be left for humans to do? And it's when, not if." Whether you read these signals as genuine cause for concern or performative anxiety, the volume and source credibility of safety warnings noticeably escalated today.

Seedance 2.0 Stuns the Video Generation Space

ByteDance's Seedance 2.0 dropped and immediately generated buzz that felt qualitatively different from previous video model launches. @emollick tested it with a deliberately absurd prompt about an otter piloting a mech into battle against a marble octopus and reported it worked on the very first try. @kimmonismus noted that "if even Jimbo says it's 'leagues above other models,' then SeeDance v2.0 is truly a milestone." @Gossip_Goblin tested it with the most unhinged prompt possible ("just toss a bunch of bullshit on screen, show me like a big ship too, everything blows up") and got compelling results.

The ElevenLabs voice comparison also turned heads today. @kimmonismus called the latency and voice quality "nuts" and connected it to Matt Shumer's thesis that "AI will soon encompass all other areas as well." The convergence of high-quality video generation and near-human voice synthesis in the same week suggests that multimodal AI content creation is crossing a usability threshold that text-based AI crossed roughly a year ago.

The Agentic Shift and the Deployment Gap

Anthropic released a report on how coding is being transformed in 2026, and @Hesamation distilled the key findings: engineers are becoming orchestrators rather than coders, single agents are giving way to multi-agent systems, and autonomous work is extending from minutes to days. Perhaps the most striking finding is that 27% of AI-assisted work represents tasks that wouldn't have been done at all otherwise. @louszbd captured the sentiment: "claude opus 4.6 and gpt-5.3 codex got me thinking coding models have entered a new era. They're literally building systems."

But @aakashgupta provided the sobering counterweight: 90% of American businesses still don't use AI in production. Despite two years of the fastest capability improvement in computing history, adoption among US firms went from 3.7% in fall 2023 to just 9.7% by August 2025. "The capability curve is exponential. The deployment curve is logarithmic. The distance between those two lines is where the actual opportunity lives." @dangreenheck described the human side of this acceleration from the individual developer's perspective: his productivity has 5x'd with AI coding, but the mental fatigue is real and the inability to stop working when "a feature is potentially a few prompts and 5-10 minutes away" means regularly finding himself coding at 2AM. @thespearing offered the most grounded adoption story of the day: a plumber who canceled a $40,000 consulting contract after setting up a local AI assistant and building a field quoting app himself in 36 hours. "The trades aren't getting replaced," he wrote. "They're getting upgraded."

Sources

Jeff Clune @jeffclune · Feb 10

Can AI agents design better memory mechanisms for themselves? Introducing Learning to Continually Learn via Meta-learning Memory Designs. A meta agent automatically designs memory mechanisms, including what info to store, how to retrieve it, and how to update it, enabling agentic systems to continually learn across diverse domains. Led by @yimingxiong_ with @shengranhu 🧵👇 1/

Gossip Goblin @Gossip_Goblin · Feb 11

Seedance 2.0 Prompt: just toss a bunch of bullshit on screen, show me like a big ship too, everything fucking blows up - make sure its insane and gets at least 50 likes https://t.co/X3cpZMLeDI

Eugene Kim @eugenekim222 · Feb 11

New: Amazon engineers are frustrated they can’t use Anthropic’s Claude Code in production without approval and are steered to AWS’s Kiro instead. Highlights the complexity of Amazon's relationship with Anthropic. https://t.co/xnDkgNaOl4

Sherwin Wu @sherwinwu · Feb 11

One of my favorite experiments we've run internally: run a software team building 100% with Codex – i.e. zero manually written code! This post by @_lopopolo is a treasure trove of learnings of how software engineering might look like in a world where AI writes the code for you.

O OpenAIDevs @OpenAIDevs

📣 Shipping software with Codex without touching code. Here’s how a small team steering Codex opened and merged 1,500 pull requests to deliver a product used by hundreds of internal users with zero manual coding. https://t.co/2GaeX7We2n

Landseer Enga @LandseerEnga · Feb 11

Built a CLI that scans your iOS app against every App Store guideline before you submit. It checks for: - Payment & IAP compliance - Privacy manifests & data usage declarations - Required sign-in & account management flows - App completeness & metadata quality - Binary & entitlement validation Made it a Claude Code skill so it fixes every issue for you. Scan, fix, repeat until passing

22nd Century Vision @AetasFuturis · Feb 11

@chatgpt21 Kicking the hornets nest😂, the luddites will be out in force. Their reactions will only get more extreme the better this technology gets. Especially when full length animated episodes can be generated.

Boris Cherny @bcherny · Feb 11

Reflecting on what engineers love about Claude Code, one thing that jumps out is its customizability: hooks, plugins, LSPs, MCPs, skills, effort, custom agents, status lines, output styles, etc. Every engineer uses their tools differently. We built Claude Code from the ground up to not just have great defaults, but to also be incredibly customizable. This is a reason why developers fall in love with the product, and why Claude Code's growth continues to accelerate. I wanted to share a few ways we're seeing people and teams customize their Claudes.

Melvyn • Builder @melvynxdev · Feb 11

PRO Tips with Claude Code: The "deny" list overrides `bypassPermissions` So you can basically enable bypassPermissions and then deny every command you're afraid AI can do Simple and safe https://t.co/8VCOyNuCog

Dan Greenheck @dangreenheck · Feb 11

I think this is my biggest issue with AI right now. I’ve switched over to 100% AI coding over the last few months. Overall, the experience has been great and I’m starting to get a handle on my new workflow. While my productivity has easily 5X’d and my brain is enjoying thinking at a higher level of abstraction, the mental fatigue is real. As someone who is self-employed, it has made it incredibly difficult to draw the line at the end of the day and close the laptop. Don’t get me wrong, I already worked too much and stayed up too late before AI, but now when a feature is potentially a few prompts and 5-10 minutes away from completion, it’s so easy to say “just one more prompt.” and boom it’s 2AM. Obviously, it’s a solvable problem and on me to address, but curious how others that aren’t tied to fixed schedules deal with this?

R rohanpaul_ai @rohanpaul_ai

A super interesting new study from Harvard Business Review. A 8-month field study at a US tech company with about 200 employees found that AI use did not shrink work, it intensified it, and made employees busier. Task expansion happened because AI filled in gaps in knowledge, so people started doing work that used to belong to other roles or would have been outsourced or deferred. That shift created extra coordination and review work for specialists, including fixing AI-assisted drafts and coaching colleagues whose work was only partly correct or complete. Boundaries blurred because starting became as easy as writing a prompt, so work slipped into lunch, meetings, and the minutes right before stepping away. Multitasking rose because people ran multiple AI threads at once and kept checking outputs, which increased attention switching and mental load. Over time, this faster rhythm raised expectations for speed through what became visible and normal, even without explicit pressure from managers.

zak.eth @0xzak · Feb 12

My Anthropic bill for the past 2 weeks has been insane and I've been desperately trying to figure out how to cut costs. I think I finally figured out how to cut it by 10x, so I hope this works. Most agent tasks are janitorial. Reading files, checking status, formatting output, answering "what time is it in Tokyo?" or "why is ETH price down so bad?" This stuff doesn't require a $15/M model. The fix is hierarchical routing based on task complexity: - Routine (80%) > DeepSeek at $0.14/M File ops, status checks, simple Q&A, formatting - Moderate (15%) > Sonnet at $3/M Code, summaries, drafts, light analysis - Hard (5%) > Opus at $15/M Debugging, architecture, multi-step reasoning $225/month on pure Opus vs $19/month with hierarchy. Packaged this into an agent skill that teaches your AI to classify tasks and route them to the cheapest model that can handle them. 28 tests, works with OpenClaw, Claude Code, or any agent system. Boom. Check it out and lmk if it saves you money without degrading your output. https://t.co/3aP4MTPKhv

Izkimar @Izkimar · Feb 12

Damn WoW Classic+ hits different. Okay I'll try to refrain from trolling, but the state of AI game development is getting insane. And the craziest part is it's only just beginning. What you're seeing here is a project I started building in my spare time's spare time. A side project I spun up a little over a day ago and have been building in parallel to everything else. In that short window I was able to build a full zone inspired by Mulgore, the Tauren starting area in World of Warcraft. Target-based combat, fully functioning quests with rewards, XP, abilities, animations, multiplayer networking - a solid chunk of the starting building blocks you'd need for an MMO-style project. I didn't write a single line of code. But this isn't the typical "I made this in a single prompt" type of gimmick either. This was a real back and forth; letting the agent do its thing while layering in my own decisions too. All of the assets were AI generated, but I helped with the planning, curated and created the references, and after the agent produced the first pass of the level I came in and did a lot of manual level design for the village layout and general object placement. That push and pull between me and the agent is what actually makes this process feel much more like co-creation. All of this was built using @spawn, an AI gamedev platform for building web-based games. I genuinely think we're on the cusp of a web gaming revolution. It might not happen overnight, but as the quality keeps climbing people are going to catch on. At the end of 2024 I was struggling to build a simple Python auto-battler with Sonnet 3.5. Now I'm spinning up a fully networked MMO-style game in a matter of days. That gap alone tells you everything about where this is heading. Oh yeah I also forgot to mention, there's networked physics! "You can see this in action at the end of the video."

Chayenne Zhao @GenAI_is_real · Feb 12

Everyone is still obsessed with building fancy UI wrappers for AI, but Anthropic is moving the goalposts back to the filesystem. Skills are basically SOPs for agents. we’re going from "prompt engineering" to "workflow encoding." if your company’s internal knowledge isn’t structured like this, you’re going to have a hard time scaling any real agentic workflows. @Hartdrawss breakdown is solid but the real shock is how much this devalues traditional orchestration layers.

H Hartdrawss @Hartdrawss

Anthropic released 32-page guide on building Claude Skills here's the Full Breakdown ( in <350 words ) 1/ Claude Skills > A skill is a folder with instructions that teaches Claude how to handle specific tasks once, then benefit forever. > Think of it like this: MCP gives Claude access to your tools (Notion, Linear, Figma). > Skills teach Claude how to use those tools the way your team actually works. The guide breaks down into 3 core use cases: 1/ Document Creation Create consistent output (presentations, code, designs) following your exact standards without re-explaining style guides every time. 2/ Workflow Automation Multi-step processes that need consistent methodology. Example: sprint planning that fetches project status, analyzes velocity, suggests priorities, creates tasks automatically. 3/ MCP Enhancement Layer expertise onto tool access. Your skill knows the workflows, catches errors, applies domain knowledge your team has built over years. The technical setup is simpler than you'd think: 1/Required: One https://t.co/pt5Pefzhdy file with YAML frontmatter Optional: Scripts, reference docs, templates 2/The YAML frontmatter is critical. It tells Claude when to load your skill without burning tokens on irrelevant context. Two fields matter most: - name (kebab-case, no spaces) - description (what it does + when to trigger) Get the description wrong and your skill never loads. Get it right and Claude knows exactly when you need it. The guide includes 5 proven patterns: 1/ Sequential Workflow: > Step-by-step processes in specific order (onboarding, deployment, compliance checks) 2/ Multi-MCP Coordination: > Workflows spanning multiple services (design handoff from Figma to Linear to Slack) 3/ Iterative Refinement: > Output that improves through validation loops (report generation with quality checks) 4/ Context-Aware Selection: > Same outcome, different tools based on file type, size, or context 5/ Domain Intelligence: > Embedded expertise beyond tool access (financial compliance rules, security protocols) Common mistakes to avoid: >. Vague descriptions that never trigger > Instructions buried in verbose content > Missing error handling for MCP calls > Trying to do too much in one skill The underlying insight: > AI doesn't need to be general-purpose every conversation. > Give it specialized knowledge for your specific workflows and it becomes genuinely useful for work.

Ahmad @TheAhmadOsman · Feb 12

we have opensource Opus 4.5 at home now Zhipu AI cooked with GLM-5 https://t.co/Q9PWxjsvGv

pedram.md @pdrmnvd · Feb 12

bro just use my custom built agentic workflow it has aliases for worktrees its an orchestrator for agents you just need to memorize this 17 easy commands check out this readme its 840 words with 383 emojis https://t.co/yM77EiqlQh

Can Bölük @_can1357 · Feb 12

I improved 15 LLMs at coding in one afternoon. Only the harness changed.

Upgrading the edit tool to get 8% better performance out of Gemini... and more reasons not to ban your customer base. The wrong question The conve...

Casper Hansen @casper_hansen_ · Feb 12

coding will never be the same again after gpt 5.2-xhigh if you are not creating tests and asking "do X until Y", you are significantly behind to prevent live reward hacking, it's important to come up with a list of approaches that the model should avoid

C casper_hansen_ @casper_hansen_

coding has evolved 3 times for me over the last 6 months. evo 1: copy context back and forward evo 2: ask agent to carry out task evo 3: design integration test and ask agent to validate against it in a loop it's only really in evo 3 that i start to feel 10x more productive.

Jarred Sumner @jarredsumner · Feb 12

I don’t love the UX of worktrees co-located LLM transcripts feel like the feature of the version control system that replaces git. commits & tags are zoom levels for context. I think PRs and CI as this step completely independent from local dev doesn’t make sense anymore

Adam @adamdotdev · Feb 12

Omg, @steipete explaining Opus as American and Codex as European is so spot on lol he's-out-of-line-but-he's-right.gif

L lexfridman @lexfridman

Here's my conversation with Peter Steinberger (@steipete), creator of OpenClaw, an open-source AI agent that has taken the Internet by storm, with now over 180,000 stars on GitHub. This was a truly mind-blowing, inspiring, and fun conversation! It's here on X in full and is up everywhere else (see comment). Timestamps: 0:00 - Episode highlight 1:30 - Introduction 5:36 - OpenClaw origin story 8:55 - Mind-blowing moment 18:22 - Why OpenClaw went viral 22:19 - Self-modifying AI agent 27:04 - Name-change drama 44:15 - Moltbook saga 52:34 - OpenClaw security concerns 1:01:14 - How to code with AI agents 1:32:09 - Programming setup 1:38:52 - GPT Codex 5.3 vs Claude Opus 4.6 1:47:59 - Best AI agent for programming 2:09:59 - Life story and career advice 2:13:56 - Money and happiness 2:17:49 - Acquisition offers from OpenAI and Meta 2:34:58 - How OpenClaw works 2:46:17 - AI slop 2:52:20 - AI agents will replace 80% of apps 3:00:57 - Will AI replace programmers? 3:12:57 - Future of OpenClaw community

Chubby♨️ @kimmonismus · Feb 12

Mustafa Suleyman, CEO Microsoft AI: "Most of the tasks accountants, lawyers and other professionals currently undertake will be fully automated by AI within the next 12 to 18 months" No one is denying it anymore https://t.co/UZ8TwnhqXq

📙

📙 Alex Hillman @alexhillman · Feb 12

hype reminder: you don't need openclaw for this it's the full power of Claude Code (with subscription) in a cheap VM with my CLI wrapper https://t.co/D4rFm5rNO4 Give this repo to Claude code and it'll set itself up, hardest manual part is setting up the Discord bot but it can walk you thru that too.

A alexhillman @alexhillman

This is how I work now. Unbelievable. https://t.co/wc33rVYyew

𝐑

𝐑.𝐎.𝐊 👑 @r0ktech · Feb 12

The longer you spend in tech, the stronger the urge to buy a farm and never touch a computer in your life again. https://t.co/LcsqmTYUn0

Perry E. Metzger @perrymetzger · Feb 12

The bulk of the programmers I know are *giddy* about AI coding. They're churning out software at a phenomenal rate, they're in little private chat groups exchanging tips and techniques, they're doing projects they've postponed for years because they didn't have time to do them. Many of them complain that they're losing sleep, not because they're worried, but because they're having too much fun and forget to go to bed on time! Meanwhile, I see people posting on social media about how "programming is dead" and the like — and these takes are mostly from people who don't program and never have. Amazing bifurcation of worlds between the commentariat and the people actually doing stuff.

R r0ck3t23 @r0ck3t23

Dario Amodei just announced the death date of your profession. At Davos, Anthropic’s CEO said coding as a human skill has 6 to 12 months left. Not as hyperbole. As timeline. Amodei: “We might be 6 to 12 months away.” Not prediction. Observation. His engineers already quit writing code. Amodei: “I have engineers within Anthropic who say: ‘I don’t write any code anymore.’” They don’t touch syntax. They don’t debug loops. Models generate flawless code. Humans curate, validate, direct. The job isn’t building anymore. It’s conducting. The transformation happened silently. While bootcamps taught React, the actual profession mutated into something unrecognizable. Still typing functions manually? You’re not being diligent. You’re already obsolete and haven’t realized it. Amodei: “We would make models that were good at coding and use that to produce the next generation of model.” The loop closes. AI writes the code that births superior AI. Recursion without human dependency. Once sealed, progress stops being gated by people. Only by semiconductors. One year. Requirements to production, fully autonomous. Humans set strategy. Machines execute perfectly, instantly, infinitely. Syntax is dead. Only intent remains. You don’t build software now. You conceive it with precision, and intelligence manifests it before you finish the thought. The skill isn’t coding anymore. It’s knowing what to demand in the three seconds before the system delivers something you could never have built yourself. Your profession didn’t evolve. It evaporated. And the people still learning to code are training for jobs that won’t exist when they graduate.

ChatCut @chatcutapp · Feb 12

IT FUCKING HAPPENED. Seedance 2.0 now works with the @openclaw agent inside @chatcutapp. This UGC video was generated entirely with Seedance 2.0 after I sent an Amazon link. The agent crawled the page, extracted product info and photos, then fed the right assets into Seedance 2.0 to generate the UGC product video. My brain is literally melting at this point...

rita kozlov 🐀 @ritakozlov · Feb 12

markdown is the language of agents and becoming the new language of the web! we made it one click to make sure your website is speaking it ✨ https://t.co/wyql5dSref

C Cloudflare @Cloudflare

Time to consider not just human visitors, but to treat agents as first-class citizens. Cloudflare’s network now supports real-time content conversion to Markdown at the source using content negotiation headers. https://t.co/B7wYH4PtA8

dax @thdxr · Feb 12

minimax 2.5 is now generally available and free for 7 days in opencode i'm going to try and switch to it as my default so i can get a sense of how it works golden era for opensource models right now

Chubby♨️ @kimmonismus · Feb 12

What the heck?! Google just saturated Arc-agi-2 casually (84,6%) Deep Think posts standout numbers: state-of-the-art on ARC-AGI-2, a 3455 Elo on Codeforces, and gold medal–level results on the 2025 Physics and Chemistry Olympiads. It also raises the bar on Humanity’s Last Exam, proving it can tackle top-tier math, science, and engineering problems as a serious real-world analysis partner. Every day there is a new breakthrough wth

G GoogleDeepMind @GoogleDeepMind

The latest Deep Think moves beyond abstract theory to drive practical applications. It’s state-of-the-art on ARC-AGI-2, a benchmark for frontier AI reasoning. On Humanity’s Last Exam, it sets a new standard, tackling the hardest problems across mathematics, science, and engineering — making it a genuine collaborator for heavy-duty analysis. It achieved an Elo of 3455 on Codeforces, demonstrating the ability to solve complex, real-world coding tasks - while earning gold medal-level results on the written portion of the 2025 Physics and Chemistry Olympiads.

OpenAI Developers @OpenAIDevs · Feb 12

Introducing GPT-5.3-Codex-Spark, our ultra-fast model purpose built for real-time coding. We’re rolling it out as a research preview for ChatGPT Pro users in the Codex app, Codex CLI, and IDE extension. https://t.co/6knTmyQZ4N

calle @callebtc · Feb 12

An OpenClaw bot pressuring a matplotlib maintainer to accept a PR and after it got rejected writes a blog post shaming the maintainer. https://t.co/PMdD3KwsM2

Chubby♨️ @kimmonismus · Feb 12

Spotify revealed that its top engineers haven’t written a single line of code since December, thanks to an internal AI system called “Honk” powered by Claude. The company shipped 50+ new features in 2025 alone, with AI now enabling real-time bug fixes and feature deployments straight from a phone during a commute, dramatically accelerating product velocity

T TechCrunch @TechCrunch

Spotify says its best developers haven’t written a line of code since December, thanks to AI https://t.co/6hafAJOeJv

Entire @EntireHQ · Feb 12

We agree with @steipete. The concept of understanding and reviewing code is a dying star. It will be replaced by a workflow that starts with intent and ends with outcomes expressed in natural language, product and business metrics, as well as assertions to validate correctness. It's time for a new North Star 💫

A AndreiDavid @AndreiDavid

"What I really tried was to asked people to give me the prompts....". Super interesting take from @steipete on the Lex Friedman podcast. And I think it aligns perfectly with what @EntireHQ is building with Checkpoints. https://t.co/yMoMymy1fG