AI Learnings - December 31, 2025

December 31, 2025 · 11 sources

Claude Code & Workflows, AI Agents & Orchestration, Vibe Coding

AI Learnings - December 31, 2025

Overview

Discussions spanning Claude Code & Workflows, AI Agents & Orchestration, Vibe Coding, and Models & Capabilities.

Claude Code & Workflows

@ciphergoth: "Liam, I have been a professional programmer for 36 years"
@moreconfetti: "One of my holiday goals was to get my sht together and build myself a proper AI assistant"

AI Agents & Orchestration

@iannuttall: "This is the definition of vibe coding to me"
@bcherny: "@zeroxBigBoss Yes, just ask claude to invoke skill 1, then skill 2, then skill 3, in natural language"
@aphysicist: "millennial gamers are the best prepared generation for agentic work, they've been training for 25 years https://t"
@ctatedev: "My 2025 year-end project: Solving Agents"

Vibe Coding

@matanSF: "This creates an interesting matrix"
@DannyLimanseta: "Vibe coding is probably one of the most satisfying feedback loops for people who enjoy making stuff"

Models & Capabilities

@IntuitMachine: "Google is making progress on their diffusion models"

Other Highlights

@d33v33d0: "Claude can code- but can claude grow"
@alex_prompter: "🚨 RAG is broken and nobody's talking about it"
@manthanguptaa: "How to Use LLM as a Judge (Without Getting Burned)"
@0xSero: "My beloved vllm studio is now open source"
@dorsa_rohani: "LLMs are AMAZING at writing songs in MIDI"

Key Takeaways

Claude Code continues to reshape how developers approach coding

Agent orchestration patterns are maturing with new tools and frameworks

Vibe coding is evolving from meme to legitimate methodology

--- Curated from 15 posts

Sources

Dorsa @dorsa_rohani · Dec 31

LLMs are AMAZING at writing songs in MIDI. Here's a simple chiptune song by Claude https://t.co/m0hfmtsrK8

Alex Prompter @alex_prompter · Dec 31

🚨 RAG is broken and nobody's talking about it. Stanford just exposed the fatal flaw killing every "AI that reads your docs" product. It's called "Semantic Collapse", and it happens the moment your knowledge base hits critical mass. Here's the brutal math (and why your RAG system is already dying):

Ian Nuttall @iannuttall · Dec 31

This is the definition of vibe coding to me Managing the context, looking at agent responses, taking the time to ask the right questions: - is this secure? - how does this work? - can I do {x}? why? why not There's no such thing as non-technical any more!

Aaron Slodov @aphysicist · Dec 31

millennial gamers are the best prepared generation for agentic work, they've been training for 25 years https://t.co/JHsbPQHupk

0xSero @0xSero · Dec 31

My beloved vllm studio is now open source. It's a mess it was built just for me, but I really believe it's super super useful once you have it setup properly. It's a great chat, really good for storing and saving recipes, etc.. if you set it up, when you ask for a model that's not loaded it'll automatically load it up, clear the vram, and use your recipes. Enjoy, please contribute back if you want this to be a world class experience. https://t.co/kdC4kaw4dr

Ian Curtis @XRarchitect · Dec 31

15mb gaussian splat generated from a single Midjourney image running extremely smooth on the web. 2026 will be the year for world models. https://t.co/Lie0VXf91F

Paul Crowley @ciphergoth · Dec 31

Liam, I have been a professional programmer for 36 years. I spent 11 years at Google, where I ended up as a Staff Software Engineer, and now work at Anthropic. I've worked with some incredible people - you might have heard of Jaegeuk Kim or Ted Ts'o - and some ridiculously productive programmers - Eric Biggers, Jeff Sharkey and @jackinwarsaw come to mind as people who seemed to solve problems with code at a truly unearthly rate. At work, I am currently hitting levels of productivity that would put all of them to shame. Not just a rate of making code, but a rate of actually solving problems, that would have been unthinkable two years ago. And it's possible because Claude Code with Opus 4.5 is doing all the heavy lifting; I'm not doing much more than setting direction and reviewing the output. I often have three different sessions going at once, attacking three different aspects of the work I'm doing. Over the holidays I took a break from work Clauding, to do some home Clauding, writing in a few days from scratch a complicated webapp using disparate technologies I had no background in, that would have taken weeks prior to Claude. When I hit problems I just told Claude to debug them and that almost always worked. It also looks great, which is pleasing since not only have I zero CSS skill, I have zero design skill. I'm not out of a job quite yet; there are still some areas where I have better taste than it does, or better instincts. But when you talk about "AI's inability to code", this seems to me to reveal a total disconnect from reality. And this is why I'm urging you to ACTUALLY TRY IT, find out for yourself, and join the rest of us on this Earth.

Matan Grinberg @matanSF · Dec 31

This creates an interesting matrix. • TECHNICAL + LOW-AGENCY: Pearl clutching. In denial of productivity from AI. AI code = slop. Threatened by the non-technical builders entering their domain. • TECHNICAL + HIGH-AGENCY: Benefit from the deep intuitions. Huge head start on efficiency and parallelization. Cavalier enough to consistently throw out priors and adapt. Will be the first to create wonders and move mountains. • NON-TECHNICAL + HIGH-AGENCY: Original "vibe-coder". Free from the chains of the past, but less intuition on underlying systems. Blessing and a curse. New perspectives such as these may be where unique/best innovation comes from. • NON-TECHNICAL + LOW-AGENCY: NGMI

steve caldwell 🎉 @moreconfetti · Dec 31

One of my holiday goals was to get my sh*t together and build myself a proper AI assistant. I spend my day bouncing between multiple Claude Code / Codex sessions and sometimes too many meetings. I've got three kids, two businesses, one wife, and virtually zero time to build systems that make my life better. The data is all there, I just need to bring it together and make sense of it. This week, I stumbled upon this awesome agent workflow writeup by @steipete where he mentions what he's building at https://t.co/7QDWnQGfpM , and I was intrigued. This is a very appealing AI assistant. Seems like exactly what I'm looking for. So I asked Claude Code to set up Clawdis on a headless Mac Mini on my LAN. After I pointed it to some credentials, my Clawdis ("Crawdad" 🦞) quite literally did the rest, setting itself up, all via WhatsApp. I gave it access to my 5 email accounts, calendars, iMessages (via BlueBubbles server), and Granola transcripts. This stuff rarely works the first time. It was awesome. Then, earlier today, we had a real Jarvis moment. I was in my car waiting on my wife to emerge from the store, so I sent Crawdad a voice message. I hadn't configured OpenAI API keys for transcribing my voice, so I asked it to try to install and use whisper.cpp (via WhatsApp), and it *just worked*, and will use this method going forward. Excited to build out more tooling here. I think this is a super important project to watch - seems like it could be the foundation of a unicorn tbh. Thanks for all of your work here Peter - feels like a weight has been lifted for real! 2026 is gonna be lit 🔥

Carlos E. Perez @IntuitMachine · Dec 31

Google is making progress on their diffusion models... It's now as good as a Gemini 2.0 Flash Lite. The writing is on the wall, a majority of language AI use in the future will be diffusion models. https://t.co/GKmBW0EgEe

Chris Tate @ctatedev · Dec 31

My 2025 year-end project: Solving Agents After weeks of stress testing agents, I landed on what a top-tier e2e agent actually looks like: Fast, Durable, Resumable, Secure Benchmarked. Battle-tested. Shipping the best patterns as reusable templates, packages and other OSS soon. https://t.co/2ZZt6uWXQT