AI Coding Tools Reach Production Grade as San Francisco Grapples With AI's $20M Wealth Divide

May 16, 2026 · 19 sources

The AI coding assistant ecosystem matured rapidly this week, with new templates, session managers, and diff viewers built entirely by AI. Meanwhile, a viral post from @deedydas laid bare the staggering wealth concentration in San Francisco's AI sector and the existential career anxiety rippling outward.

Daily Wrap-Up

Something shifted in the AI developer tooling space this week, and it's visible in the sheer density of posts about production-grade workflows. We saw templates for structured AI coding sessions, a unified vault for managing multiple AI coding assistants, and a diff viewer built by an AI in 16 minutes. These aren't demos or toy projects anymore. They're real tools, solving real friction points, and in several cases they were built by the very AI assistants they're designed to manage. The snake is eating its own tail, and the tail tastes great.

The undercurrent running through today's feed is a deep anxiety about who benefits from all of this. @deedydas painted a vivid picture of San Francisco's AI wealth divide, estimating that roughly 10,000 people at companies like Anthropic, OpenAI, xAI, and Nvidia have hit retirement-level wealth above $20M, while everyone else, even those making solid salaries, feels like they're running on a treadmill going nowhere. That post resonated because it named something many people are feeling but few articulate: the AI boom is creating winners and losers at a speed that makes previous tech cycles look glacial. @levie's post about forward-deployed engineering reinforced the point, arguing that AI is fundamentally different from traditional software because it constantly evolves, requiring vendors to stay embedded with customers rather than ship and walk away.

The most practical takeaway for developers: invest in learning how to structure AI coding workflows with clear goals, constraints, and verification steps, as demonstrated by @kloss_xyz's /goal templates. The developers who will thrive are not those who use AI assistants casually, but those who build systematic processes around them, treating prompt structure as seriously as code architecture.

Quick Hits

@ClaudeDevs announced that 5-hour and weekly rate limits have been reset for all users, a small but welcome Friday housekeeping update.
@gaganghotra_ flagged that Google published a comprehensive guide on optimizing websites for generative AI features in Search, a must-read for anyone in SEO navigating the shift from traditional rankings to AI-generated answers.
@mackenzieprice shared a curated list of ten third-party AI educational tools for kids, noting that many of the adaptive learning apps used at Alpha School are still internal and not publicly available.
@isnit0 shared photos of his brother-in-law's home GPU farm generating roughly $3,000/month in profit, currently 30% solar-powered with plans to reach 60% by summer through battery additions rather than more panels.
@davepl1968 offered a nostalgic trip back to MS-DOS 6.22 and the meditative ritual of watching Disk Defragmenter crawl across a 540MB hard drive, a reminder that tech nostalgia hits different when you actually worked on the OS.

AI Coding Tools Hit Production Grade

The AI coding assistant ecosystem has crossed an important threshold. We're no longer watching people experiment with prompting tricks. We're watching developers build infrastructure around AI-assisted development the same way they'd build infrastructure around any other critical workflow. The tools are maturing, the workflows are solidifying, and the people building them are treating AI coding not as a novelty but as a core part of their engineering stack.

@kloss_xyz released a set of seven production-grade templates for the /goal command in Codex, Claude Code, and Hermes. The templates cover the full software development lifecycle: ideation, planning, building, refactoring, consolidation, hardening, and migrations. The first three are meant to be used sequentially, while the remaining four are applied as needed. What makes this notable is the structure itself. Each template enforces a discipline of stating a clear goal, defining constraints, creating a verification plan, and specifying stop rules that prevent scope creep. This is the difference between chatting with an AI and engineering with one.

> "use 1-3 in order, 4-7 whenever" - @kloss_xyz

The original /goal framework that inspired these templates is worth understanding. It structures prompts around six elements: GOAL (single measurable outcome), CONTEXT (architecture and assumptions), CONSTRAINTS (what must not change), PLAN (understand before acting), DONE WHEN (verifiable completion state), and VERIFY (tests, builds, rollback plans). This is essentially a mini software development methodology compressed into a prompt format.

Meanwhile, @lawrencecchen introduced cmux Vault, a sidebar pane that unifies session management across Codex, Claude Code, OpenCode, and Pi. Full-text search across all sessions, drag-and-drop into workspaces. The fragmentation problem, where your AI coding history is scattered across multiple tools, just got a solution. And @cnakazawa built Codiff, a local diff viewer, entirely with Codex in 16 minutes. A tool for reviewing AI-generated code, built by AI-generated code.

@mattpocockuk pushed back against a common anti-pattern in AI agent design, arguing that long "skills" (essentially lengthy prompt instructions) are a red flag: hard to audit, hard to edit, and expensive to run. "The shorter the skill, the better IMO," he wrote. This aligns with the broader movement toward modular, composable AI workflows rather than monolithic prompt engineering.

What connects all of these is a shift from using AI coding assistants as conversational partners to treating them as components in a structured engineering pipeline. The developers getting the most value are not those who type the cleverest prompts. They are the ones building systems around the prompts, with clear inputs, outputs, and feedback loops.

San Francisco's $20M AI Wealth Divide

@deedydas wrote what might be the most candid assessment of San Francisco's current psychological state that you'll read this year. His thesis is straightforward: over the past five years, roughly 10,000 people at AI companies have accumulated wealth exceeding $20M, while everyone else, including well-paid software engineers making under $500K, feels like they're falling irreversibly behind.

> "Everyone outside that group feels like they can work their well-paying (but <$500k) job for their whole life and never get there." - @deedydas

The post identifies four distinct groups caught in this distortion field. First, career strategists desperately trying to pick the right path: founder, early employee at the right AI lab, or something else entirely. Second, workers paralyzed by the fear that their skills are being deprecated in real time. Third, middle managers with families who lack the flexibility to pivot and see their roles being hollowed out. And fourth, the winners themselves, who @deedydas notes are often struggling with a profound lack of purpose after going from middle-class salaries to tens of millions almost overnight.

@levie's post on forward-deployed engineering added an important dimension to this conversation. He argued that AI is fundamentally unlike traditional software because it constantly evolves. Models change, capabilities shift, and best practices emerge at a pace that makes it impossible for any single company to keep up on its own.

> "In AI, you're delivering something that is constantly evolving both due to the nature of the new capabilities and best practices that emerge, but also because the underlying models change so much that they can meaningfully change the workflow." - @levie

This creates a compelling argument for vendor-driven AI deployment, where one company learns from thousands of customers and feeds those learnings back into the product. It also implicitly acknowledges that the skills gap is real and widening. If deploying AI effectively requires constant proximity to evolving best practices, then the companies and individuals closest to the source of those practices have an insurmountable advantage.

AI-Powered Trading and Prediction Markets

Two posts today highlighted how AI is reshaping financial markets, one from a deeply technical angle and one from a practical workflow perspective.

@AlterEgo_eth shared details about synthdataco/synth-subnet, a system where AI models compete to generate complete probability distributions of future price movements rather than simple directional predictions. Instead of guessing whether Bitcoin goes up or down, each model produces 1,000 full Monte Carlo simulated price paths for a given time horizon. Quality is evaluated using Continuous Ranked Probability Score, which penalizes both poorly calibrated predictions and distributions that are too vague or overconfident.

> "Instead of trying to guess a single price, AI models generate 1000 full Monte-Carlo simulated price paths. This gives a much more realistic probability distribution for trading on Polymarket." - @AlterEgo_eth

This is a meaningful evolution beyond the typical crypto prediction model. Most market prediction systems output a single number or a binary up/down signal. Outputting a full probability distribution is far more useful for platforms like Polymarket where you're pricing binary outcomes against a market. The approach acknowledges uncertainty rather than pretending it doesn't exist.

@milesdeutscher took a more accessible angle, sharing an AI-powered trading workflow that he described with characteristic understatement as "fucking sauce" and claimed would have let him "print money" if he'd had it years ago. The post links to a full guide on building AI workflows for trading analysis.

Local AI and Model Optimization

Running large language models locally on consumer hardware remains a craft that blends equal parts expertise and obsession. @nicekate8888 documented a 20-day journey to optimize Qwen3.6-27B on a Mac, moving from Unsloth Q5 at 18 tokens per second to MLX 6-bit with DFlash at 22 tok/s, and finally landing on MTPLX 4-bit at 43 tok/s with acceptable quality. That's a 2.4x speedup through quantization alone, a reminder that how you compress a model matters as much as the model itself.

The post also highlights an interesting workflow: using Grok to help configure the optimization process step by step. Using one AI to help you run another AI locally is becoming a standard pattern.

@antirez, the creator of Redis, is working on making LLM evaluation less painful. His observation that "evals take time and are boring" but are "a fundamental validation step of sane LLM inference" captures a real tension in the field. Everyone knows evaluations matter. Almost nobody wants to do them. Building tools that make evals as easy and engaging as possible could remove one of the biggest bottlenecks in responsible AI development.

Agent Observability and Realtime Voice AI

As AI agents become more autonomous, the ability to inspect and understand their behavior becomes critical. @marcklingen noted a neat symmetry: "2025 just look at agent traces. 2026 agents look at agent traces." The observation, referencing a primer on agent tracing by @lotte_verheyden, highlights how quickly the field is moving from humans debugging agent behavior to agents debugging other agents. This recursive pattern, tools building tools, agents monitoring agents, is becoming a defining feature of the current AI landscape.

On the voice AI front, @badlogicgames shared a demonstration of OpenAI's Realtime-2 API running fully in real-time, with the experimenter moving beyond rigid "turn-for-turn" conversations toward more natural, overlapping dialogue. The fact that this is described as a breakthrough in naturalness rather than a technical demo says a lot about how far voice AI has come. The bar is no longer "it works." It's "it feels like talking to someone."

Image-to-3D in Five Minutes

@CopyRebeldia highlighted a GitHub repository called image-blaster that converts any photograph into an explorable 3D world in approximately five minutes. The output includes physical meshes, background splats, and ambient audio. The post, written in Spanish, noted with a mix of awe and melancholy that people who spent a decade learning Blender were watching in silence.

> "Una imagen entra. Un mundo sale. Cinco minutos." (An image enters. A world comes out. Five minutes.) - @CopyRebeldia

This is the kind of tool that compresses years of specialized skill into a single command. Whether that's liberating or devastating depends entirely on which side of the compression you're standing. It's also a sign that 3D generation is following the same trajectory as image generation: from research curiosity to open-source tool to "why would you do this manually?" in what feels like months.

Sources

CopyRebeldia @CopyRebeldia · May 15

Hoy una industria entera dejó de tener sentido. Un tío publicó en GitHub un repo que convierte cualquier foto en un mundo 3D explorable: meshes con físicas, splat del fondo, audio ambiente. Todo. Una imagen entra. Un mundo sale. Cinco minutos. La gente que se pasó diez años aprendiendo Blender lleva todo el día mirando esto en silencio. Se llama image-blaster.

Rahul @sairahul1 · May 15

Karpathy just described what hiring looks like in 2026: "Build a large project with Claude Code — like a Twitter clone. Make it secure. Have real agents using the platform doing stuff. The interviewer uses parallel agents trying to break in to verify security." One person. Multiple agents. Shipping and defending production code simultaneously. This is not a future job description. This is happening right now. The founders who get there first are not the smartest ones in the room. They are the ones who stopped doing everything themselves and built agents to do it for them. Here is the complete playbook — 13 agents, exact prompts, 90-day build plan ↓ Read this before your competition does.

S sairahul1 @sairahul1

How to Build a Team of AI Agents That Run Your Business While You Sleep — The Complete Playbook

Joe Reeve - 🇬🇧/acc @isnit0 · May 15

My brother-in-law runs his own data center/GPU farm at home. Currently makes him ~$3k/month profit. 30% solar powered. Aiming to get to 60% by end of this summer which will increase profits significantly (mostly down to adding batteries, not panels), then will buy more GPUs. He sells the compute via some 3rd party service that takes a cut. Apparently a lot of ML/rendering workloads. You can just do stuff!

Lawrence Chen @lawrencecchen · May 15

Introducing cmux Vault: cmux now has a right sidebar with a vault pane where you can see every single Codex, Claude Code, OpenCode, and Pi session You can search through all sessions with full-text search, and drag them directly into your workspace Out in v0.64+ https://t.co/PwkqlmbcRu

antirez @antirez · May 15

Iterating... Evals take time and are boring: but are a fundamental validation step of sane LLM inference. Let's try to make them as easy and fun to run as possible. https://t.co/CFCXANDdtz

Alter Ego @AlterEgo_eth · May 15

I found a GitHub repository with the most comprehensive Crypto UP DOWN market prediction system on Polymarket Instead of trying to guess a single price, AI models generate 1000 full Monte-Carlo simulated price paths This gives a much more realistic probability distribution for trading on Polymarket synthdataco/synth-subnet is a subnet where AI models don’t make simple directional predictions. Instead, they compete to create complete probability distributions of future price movement Repo: https://t.co/s1J9eWcxFx > How it works Validators send a task to the miner with the following parameters: • asset (BTC, ETH, SOL, etc.) start_time • time_increment (usually 5 minutes) • time_horizon (1 hour or 24 hours) • num_simulations (usually 1000) The miner has until the start_time to return 1000 complete price paths - not a single number, but a full trajectory of price movement > Quality Evaluation They use CRPS (Continuous Ranked Probability Score) - one of the best metrics for evaluating probabilistic forecasts. It penalizes both poor calibration and distributions that are too vague or too overconfident If you’re building a trading bot, probability-based strategies, or AI agents for Polymarket, then @SynthdataCo have created one of the most interesting and technically solid repositories in this space right now

Christoph Nakazawa @cnakazawa · May 15

Codiff: A beautiful, extremely fast local diff viewer I review SO MUCH code locally these days. I asked Codex to build it using https://t.co/4P6iuEJ4sq and https://t.co/FhMJ3wfsa6. Thanks @amadeus and @fat. Amazing software. It took 16 minutes to build this. It's amazing. https://t.co/ofxTYTM9OG

Matt Pocock @mattpocockuk · May 15

Long skills are such a red flag to me - Hard to audit (and therefore, trust) - Hard to edit (more text, harder to maintain) - Expensive to run (more text, more tokens) The shorter the skill, the better IMO

nicekate @nicekate8888 · May 15

最近二十天我都在折腾一件事——怎么让 Qwen3.6-27B 在我的 Mac 上跑得又快又好。一开始我用 Unsloth Q5，18 tok/s，风扇呼啦呼啦响。后来换成 MLX 6bit + DFlash，提到 22 tok/s，还是不够快。直到我遇到了 MTPLX 4bit，43 tok/s，质量不错。完整视频：🧵 视频里有完整对比、编码任务实测、写作推理测试，还分享了我和 Grok 一步步搞配置的全过程

Marc Klingen @marcklingen · May 15

2025 just look at agent traces 2026 agents look at agent traces great intro by @lotte_verheyden as part of the new https://t.co/ffHZbLdSSo on agent tracing, why it matters, and how to get it right

L lotte_verheyden @lotte_verheyden

A primer on tracing for LLM applications

Gagan Ghotra @gaganghotra_ · May 15

🚨 JUST IN - Google published a long piece about "Optimizing your website for generative AI features on Google Search" 👀 A lot in it https://t.co/22t75EtwUH 🧵 https://t.co/dZm8qbyEjO

ClaudeDevs @ClaudeDevs · May 15

Happy Friday! We've reset everyone's 5-hour and weekly rate limits.

MacKenzie Price @mackenzieprice · May 15

Two questions I get all the time: "What educational AI tools would you recommend for my kid?" "What adaptive apps does Alpha use?" Many of the apps we've built ourselves aren't publicly accessible yet. Here are ten third-party ones I do recommend.

Miles Deutscher @milesdeutscher · May 15

This is f*cking sauce. This AI workflow will completely revolutionize how you trade. If I had this years ago, I literally would've printed money. In the right hands, this article is dangerous: https://t.co/pXJxZt0bKW

T TraderMorin @TraderMorin

The First AI Workflow Every Trader Should Build (FULL GUIDE)

Mario Zechner @badlogicgames · May 15

RT @Jaytel: This is fully realtime — not sped up. I’ve been experimenting with Realtime-2 and trying to move beyond “turn-for-turn” conver…

klöss @kloss_xyz · May 16

people loved my Codex /goal share so I built 7 production grade templates covering use cases 1. Ideation/Interrogation 2. Planning & Documentation 3. Build & Implementation 4. Refactoring/Restructuring 5. Consolidation 6. Hardening 7. Migrations use 1-3 in order, 4-7 whenever https://t.co/tl4ypi74rz

K kloss_xyz @kloss_xyz

/goal is the best command in Codex, Claude Code, and Hermes right now. And most are using it wrong. They write "make no mistakes". And pray. Here's how to structure yours for a mission, to rank your uncertainties before acting, to kill scope creep, and to close every loop other prompts leave open. /goal prompt [structure below] GOAL: <single clear measurable outcome; one mission only> CONTEXT: <repo/files/architecture/current state> <known assumptions, dependencies, and relevant prior decisions> CONSTRAINTS: <what must not change> <required standards/patterns> <forbidden files/actions if any> PRIORITY: (optional) 1. <highest priority> 2. <secondary priority> 3. <tertiary priority> PLAN: <understand first, then act> <restate understanding before executing non-trivial changes> <prefer minimal sufficient changes over broad rewrites> DONE WHEN: <verifiable completion state> <expected behavior preserved or improved> VERIFY: <tests/build/lint/typecheck/manual validation> <state what could not be verified and why> <include rollback/containment plan for destructive or high-risk changes> OUTPUT: <concise summary/docs/audit/results> <changed files, key decisions, risks, and follow-ups> STOP RULES: <halt on high-impact ambiguity or risk; do not invent architecture, behavior, or requirements> <surface uncertainties together with ranked highest-confidence proposals before acting; not open-ended clarification questions> <do not continue expanding scope after the goal is satisfied>

Deedy @deedydas · May 16

The vibes in SF feel pretty frenetic right now. The divide in outcomes is the worst I've ever seen. Over the last 5yrs, a group of ~10k people - employees at Anthropic, OpenAI, xAI, Nvidia, Meta TBD, founders - have hit retirement wealth of well above $20M (back of the envelope AI estimation). Everyone outside that group feels like they can work their well-paying (but <$500k) job for their whole life and never get there. Worse yet, layoffs are in full swing. Many software engineers feel like their life's skill is no longer useful. The day to day role of most jobs has changed overnight with AI. As a result, 1. The corporate ladder looks like the wrong building to climb. Everyone's trying to align with a new set of career "paths": should I be a founder? Is it too late to join Anthropic / OpenAI? should I get into AI? what company stock will 10x next? People are demanding higher salaries and switching jobs more and more. 2. There’s a deep malaise about work (and its future). Why even work at all for “peanuts”? Will my job even exist in a few years? Many feel helpless. You hear the “permanent underclass” conversation a lot, esp from young people. It's hard to focus on doing good work when you think "man, if I joined Anthropic 2yrs ago, I could retire" 3. The mid to late middle managers feel paralyzed. Many have families and don't feel like they have the energy or network to just "start a company". They don't particularly have any AI skills. They see the writing on the wall: middle management is being hollowed out in many companies. 4. The rich aren’t particularly happy either. No one is shedding tears for them (and rightfully so). But those who have "made it" experience a profound lack of purpose too. Some have gone from <$150k to >$50M in a few years with no ramp. It flips your life plans upside down. For some, comparison is the thief of joy. For some, they escape to NYC to "live life". For others still, they start companies "just cuz", often to win status points. They never imagined that by age 30, they'd be set. I once asked a post-economic founder friend why they didn't just sell the co and they said "and do what? right now, everyone wants to talk to me. if i sell, I will only have money." I understand that many reading this scoff at the champagne problems of the valley. Society is warped in this tech bubble. What is often well-off anywhere else in the world is bang average here. Unlike many other places, tenure, intelligence and hard work can be loosely correlated with outcomes in the Bay. Living through a societally transformative gold rush in that environment can be paralyzing. "Am I in the right place? Should I move? Is there time still left? Am I gonna make it?" It psychologically torments many who have moved here in search of "success". Ironically, a frequent side effect of this torment is to spin up the very products making everyone rich in hopes that you too can vibecode your path to economic enlightenment.

Dave W Plummer @davepl1968 · May 16

Back in the early 90s, before the Internet, we had "Defrag and Chill". You'd start Disk Defragmenter on your 540MB hard drive, dim the lights, crack open a Surge, and just vibe while the little blue bars crawled across the screen like they were solving world peace. Forty-five minutes of pure, unfiltered anticipation. No notifications. No algorithms. Just the two of you, the gentle grinding of the hard drive, and the sacred promise that your Solitaire games were about to feel 3% snappier. This is MS_DOS 6.22, which I worked on, but I honestly have no idea who wrote defrag. Iconic utility though!

Aaron Levie @levie · May 16

I’m fully forward deployed engineering pilled specifically because AI simply is not the same as software. In software, you deliver a stable piece of technology to a customer and they adopt it and that’s that (extreme over simplification). In AI, you’re delivering something that is constantly evolving both due to the nature of the new capabilities and best practices that emerge, but also because the underlying models change so much that they can meaningfully change the workflow as a result of their upgrades. For this reason it’s far more logical that one vendor can share best practices across thousands of companies more efficiently than every single company can learn and manage these best practices themselves. Further, the learnings from those customers should go right back into the core product as a result. As we go from chat systems to anyone can relatively easily adopt to agentic systems that require more meaningful efforts to manage and update, the FDE model (or equivalent) essentially becomes a core competency for anyone deploying AI at scale.

Y ypatil125 @ypatil125

The real power of forward deployed engineering has always been putting strong technical people directly alongside the operators who own the outcome. That proximity forces the work to solve the actual problem instead of some sanitized version of it. In the AI era this principle has become even more valuable. Agents can now sit inside real workflows and improve from actual decisions, which means the highest-leverage work is extracting the tacit knowledge that lives with subject matter experts, building evaluations that reflect how things actually break, and closing the production feedback loop so agents get better from real outcomes.