AI Digest.

Uber Burns Through Annual AI Budget in One Quarter as DeepSeek v4 Undercuts Anthropic on Price

Enterprise AI hit a cost reckoning with Uber's CEO admitting they blew through their 2026 AI budget in a single quarter, while Lindy switched 100% of traffic from Anthropic to DeepSeek v4 and actually saw performance gains. Meanwhile, Meta launched Business Agent for small businesses, Thrive Capital put $1 billion into AI-powered accounting roll-ups, and Google released Gemma 4 12B as an open-source multimodal model that runs on a single consumer GPU.

Daily Wrap-Up

The enterprise AI cost narrative reached an inflection point today, and it wasn't the kind the big model providers wanted to see. Uber CEO Dara Khosrowshahi went on record saying the company burned through its entire 2026 AI budget in one quarter, forcing a rethink on headcount and a pivot toward cheaper models for production workloads. This is a $150 billion company with world-class engineering discovering that AI adoption costs scale far faster than anyone modeled. Then Flo Crivello at Lindy dropped the news that they've switched all traffic from Anthropic to DeepSeek v4, saving millions while seeing performance improvements. When your customers can switch to a competitor and get better results for less money, the pricing power narrative starts looking wobbly.

The product launches were equally telling about where AI is actually landing. Meta introduced Business Agent to bring conversational AI to small businesses out of the box. Thrive Capital announced a $1 billion bet on buying accounting firms and rebuilding them around AI. Eric Glyman launched Stack, an AI operating system for accounting firms. Three independent signals pointing at the same conclusion: the real money in AI isn't in selling to developers who read benchmark papers. It's in replacing the back-office drudgery that keeps small businesses small.

The most practical takeaway for developers: start building for cost efficiency now, not later. Uber's approach of using expensive models for exploration and cheaper ones for production is becoming the standard pattern. If your architecture assumes a single premium provider, you're building on sand. Design your pipelines to be model-agnostic from day one, because your finance team will force you to switch anyway.

Quick Hits

  • @badlogicgames is eyeing Plannotator 0.19.27 for a major refactoring project, which now supports Kiro and integrates with Glimpse for a semi-standalone coding experience
  • @T_Zahil raises the question many developers are quietly asking: why use Hermes when you already have Codex and Claude in your workflow?
  • @CostHawkAI piggybacks on Uber's budget overshoot to pitch their AI cost tracking tool, proving every bubble spawns a compliance startup
  • @DarioCpx backs @edzitron's takedown of OpenAI's cost commentary, calling it brave given the industry's groupthink problem
  • @elonmusk responds to Fei-Fei Li's world models thesis with a note about Hadamard transforms and image-space thinking, a rare semi-technical comment from the X owner
  • @demian_ai publishes a deep analysis on AI's photonics bottleneck, arguing that data centers should be understood as communication systems first and compute systems second

Enterprise AI Hits the Cost Wall

Dara Khosrowshahi's candid remarks about Uber's AI spending should be required reading for anyone building or buying AI products. "We blew through our AI budget in a quarter, for the whole year. It is forcing us to adjust," he told @patrick_oshag. The adjustment involves metering headcount increases because engineers are getting more efficient, but that efficiency carries a significant cost. Khosrowshahi also revealed a clear two-phase strategy: use expensive frontier models for exploration, then switch to cheaper or open source models once experiences scale.

Brandon Carl provided the enterprise reality check that explains why costs spiral even when the AI technically works. He laid out the "Seven Gates of Software Hell" that every enterprise AI deployment must pass through: data controls, data quality, security and controls, SLAs, vendor risk, legal and procurement, and model governance. Each gate adds months to deployment timelines. His sharpest observation was about the human tension underneath it all: "While you've been working through the 7 Gates of Hell you've had to manage a team of workers you know you're going to fire to justify the AI spend." Ed Zitron was characteristically blunt about what this means for the providers, calling OpenAI "absolutely cooked" for acknowledging customer cost concerns four years and $122 billion into the bubble.

AI Agents Get Serious About Context and Control

The agent ecosystem matured noticeably this week, with multiple teams shipping infrastructure that goes beyond chat-and-response patterns. @tylbar and the Mastra team released Agent Signals, a new context engineering primitive that enables multiplayer steering, dynamic cacheable system prompts, and automatic behavior guidance. Cacheable system prompts alone address one of the biggest cost and latency pain points teams face when running agents at scale.

@thorstenball observed the trend with characteristic brevity: "Agents, everywhere." He was commenting on Amp's rebuilt UI that lets users watch and drive agents across web, mobile, and CLI simultaneously. The emphasis on observability and control is telling. Building agents is no longer the hard part. Knowing what they're doing, correcting them mid-task, and managing multiple agents in parallel is where the engineering challenge has moved.

@alexhillman shared an open-source learning skill that discovers its own context from session files, with progress indicators and visual recaps in solo or co-learner modes. "It rules so hard," he wrote. @HuggingModels highlighted Mercury Agent, which keeps all memory local in SQLite, runs as a daemon, and asks permission before write operations. The community is converging on agents that are stateful, observable, and safe by default.

The Model Price War Heats Up

Flo Crivello's announcement that Lindy switched entirely to DeepSeek v4 from Anthropic models is the kind of customer testimonial that should keep premium providers up at night. "Saves us millions of $ and we're actually seeing an increase in performance on many core use cases," @Altimor wrote. This isn't a side experiment. It's 100% of production traffic for a significant AI product. If DeepSeek delivers comparable results at a fraction of the cost, the premium model market has a serious structural problem.

The open source ecosystem keeps strengthening the price pressure from below. Google released Gemma 4 12B, a unified multimodal model that processes text, image, video, and audio natively under an Apache 2.0 license with 256k context. @sudoingX noted it runs on a single 3090 at bf16, roughly 24GB, and is benchmarking it against Qwen 3.6 27B dense for the consumer GPU crown. On the Nvidia side, @fujikanaeda revealed that his acquired team contributed to Nemotron models and released tools including NeMo Data Designer, NeMo Anonymizer, and OpenShell. Every new open release narrows the gap between premium APIs and self-hosted alternatives.

AI Roll-Ups Target Professional Services

Joshua Kushner's Thrive Capital is placing a $1 billion bet that AI plus permanent capital can run professional services firms better than the people who built them. The vehicle is a company called Current, which acquires majority stakes in established CPA firms and re-engineers their back offices with AI. @NikMilanovic reported that in-house models are hitting 98% accuracy on data entry at the first test firm, Larson Gross, a regional practice Current acquired in 2025. He notes the important caveat: data entry is the high-volume floor of accounting, not the judgment work clients actually pay for. The hold strategy is modeled after Berkshire Hathaway: buy, hold, and let local partners keep meaningful equity.

Eric Glyman's Stack launch maps directly onto the same thesis. Stack is an AI operating system for accounting firms that learns a firm's process, runs the close, and posts journals, all fully auditable. @eglyman called it "the biggest shift in accounting since the spreadsheet." Between Thrive's acquisition engine and products like Stack, accounting is becoming the first real test case for whether AI can transform an entire professional services vertical end to end. If the template works, law firms, insurance agencies, and consulting shops are next in line.

The Engineering Workflow is Evolving Fast

Hiten Shah captured a shift that many developers are feeling but few have articulated this cleanly. The tools are useful, but the workflow is the part worth studying. "Ideas become plans. Plans become durable context. Agents run in parallel. Voice replaces typing. Notes become memory. Skills turn repeated work into leverage," @hnshah wrote. The key insight is about where the human role is migrating: closer to judgment. You steer, react, redirect, and decide what is good enough to keep. Peter Steinberger's MS Build talk, titled "Build the thing that builds the thing," pushes this further up the chain. When your tools can build other tools, the job becomes less about implementation and more about specification.

Claude's Internal Analytics and Anthropic's Security Research

Anthropic's data team published details on how they automated 95% of business analytics queries using Claude. @_catwu shared that the blog post covers their approach to evals, ablations, and online validation when building agents for data analysis. It's a notable signal because it's Anthropic eating its own cooking at scale. If nearly all of your analytics queries can be automated, the implications for business intelligence teams are hard to ignore. In a separate thread, @AnthropicAI shared research examining 832 malicious accounts and mapping their activity onto established security frameworks, testing how well traditional cyberdefense techniques hold up against AI-enabled attacks.

Meta Brings AI to Main Street

@fivepointscap made a bold call that Meta Business Agent is the company's ChatGPT moment. The product gives small businesses an always-on AI that handles customer questions, product recommendations, booking, and sales. What makes this different from the thousands of AI chatbot products already flooding the market is distribution and simplicity. Mark Zuckerberg framed it: "A clothing shop in Birmingham or a bakery in Sao Paulo can offer the same always-on, highly personalized experience as a major brand." A restaurant owner doesn't want to learn prompt engineering or hire a consultant. Meta Business Agents just work. If the execution matches the promise, having billions of users on WhatsApp, Instagram, and Messenger makes Meta the default AI interface for small business globally.

Sources

C
CostHawkAi @CostHawkAI ·
Uber burned through their entire 2026 AI budget in four months. Their board is not happy. Half the engineering teams in this country are one quarter away from the same problem. CostHawk is the answer. Real usage. Real output. Receipts your board can't argue with.
H
Hiten Shah @hnshah ·
This is one of the clearest windows into how engineering is changing right now. The tools are useful. The workflow is the part worth studying. Ideas become plans. Plans become durable context. Agents run in parallel. Voice replaces typing. Notes become memory. Skills turn repeated work into leverage. The human job moves closer to judgment. You steer, react, redirect, and decide what is good enough to keep.
M mvanhorn @mvanhorn

https://t.co/95lFnAyw0e

E
Eric Glyman @eglyman ·
Introducing Stack. The AI operating system that lets accounting firms take on more clients without hiring. Learns your firm's process, runs the close, posts the journals. Fully auditable. We’re living through the biggest shift in accounting since the spreadsheet. https://t.co/L94QkFoNEW
T
Thomas Sanlis 🥐 @T_Zahil ·
Please someone explain to me why should I use Hermes if I already use Codex, Claude etc What could I do with it?
F
Five Points Capital @fivepointscap ·
$META Business Agent is their ChatGPT moment. I don’t think even bulls understand how big this is. “Now, a clothing shop in Birmingham or a bakery in Sao Paulo can offer the same always-on, highly personalized experience as a major brand.” - Zuck The biggest problem with AI right now is usability. If you’re a restaurant owner, you’re too busy to learn how to setup AI agents. And you don’t want some “AI consultant” coming in to charge you $20k for something you’re not sure will even work. Meta Business Agents will just work. Like an iPhone. That convenience and simplicity is what small business owners desperately want.
M MetaNewsroom @MetaNewsroom

Introducing Meta Business Agent: AI that lets businesses show up for their customers as if they had an infinite team behind them answering questions, making product recommendations, booking appointments, closing sales, and more. https://t.co/wCFU7OWXQv

N
Nik @NikMilanovic ·
🚨 BREAKING: Joshua Kushner's @ThriveCapital is putting $1 billion into buying local accounting firms and rebuilding them around AI. The acquisition arm is a company called Current. The pitch to a decades-old CPA firm: sell us a majority stake, keep a meaningful piece for yourselves, and we'll re-engineer the back office with AI. The ownership model is the part worth a look: → Traditional PE buys to sell inside a fixed window → Thrive plans to hold for the long run, the way Berkshire Hathaway does → Local partners keep real, meaningful stakes Patient capital, pointed at a fragmented, unglamorous industry. The proof point so far is Larson Gross – one accountant, one office in Bellingham, WA in 1949, grown into a regional firm with five offices and 200 employees. In 2025 its partners sold control to Current. @Forbes reports the in-house models are hitting up to 98% accuracy on data entry. Worth being precise, though – data entry is the high-volume floor of accounting, not the judgment work clients actually pay for. The skepticism is fair, too. AI roll-ups have been hyped for years and mostly underdelivered. The gap between the pitch and the operating reality is still wide. The bet underneath all of this: that permanent capital plus AI can run a professional-services firm better than the people who spent decades building it. If it works in accounting, the same template is waiting for law, insurance, and consulting. A billion-dollar wager on the back office of American business.
P
Patrick OShaughnessy @patrick_oshag ·
Dara (CEO of Uber) on their AI spend: "We blew through our AI budget in a quarter, for the whole year. It is forcing us to adjust. We are going to meter headcount increases because to the extent that my engineers are getting much more efficient, their throughput is increasing. There's a cost to that, and it's a significant cost. AI adoption has been occurring in all parts of the business –– whether it's engineers and how they scope projects, how they build, debugging, platform migrations. I'm pushing the teams to fundamentally use the power of AI to rebuild systems and processes from the bottoms up. I do think it's a combination for us right now of encouraging adoption, but then driving efficiency. We're using the more expensive models to explore. Once we scale some of these experiences, we'll look to bring in more efficient models that are more efficient on a token basis or are open source."
P patrick_oshag @patrick_oshag

My conversation with @dkhos, CEO of Uber. Dara took over in 2017, when Uber was losing roughly $4.5B a year. Today the company generates $10B in free cash flow and is worth about $150B. We discuss: - How Daniel Ek convinced him to take the job - How Uber spent a full year of its AI budget in a single quarter - Uber's approach to autonomous vehicles - Drones, hotels, and building a superapp - Lessons from Allen & Co, Barry Diller, and Reed Hastings Enjoy! Timestamps 0:00 Intro 3:44 Bringing Order to Uber’s Chaos 7:22 Managing Stress and Going All In 14:28 Why Uber Is at the Center of AI and Physical 22:39 How to Win in Autonomous Vehicles 32:25 The Trillion-Dollar AV Opportunity 37:05 Drones, Robotaxis, and Global Adoption 38:20 Uber Eats, Uber One, and Aggregating Supply 47:00 The Future of the Uber App 55:55 Lessons from Barry Diller

A
Anthropic @AnthropicAI ·
How well do the security community's techniques hold up against AI-enabled cyberattacks? We examined 832 malicious accounts and mapped their activity onto a longstanding database of tactics and techniques used by threat actors. Here's what we learned:https://t.co/fgOqJRh2rx
S
Sudo su @sudoingX ·
this is the part of the open-source push i actually love. google dropped a 12b clanker that eats text, image, video and audio natively, no separate encoders, apache 2.0, 256k context. at bf16 it's ~24gb, lands on a single 3090. they say it nears their own 26b moe at under half the memory. bold claim. the real question is whether a 12b can take qwen 3.6 27b dense, the current king on my bench on single 24gb vram tier. so i'm running it. receipts incoming soon anon.
G googlegemma @googlegemma

Meet Gemma 4 12B! A unified, encoder-free multimodal model designed to bring high-performance intelligence directly to your laptop, and released under an Apache 2.0 license. Bridging the gap between edge efficiency and advanced reasoning. Here is what’s new with Gemma 4 12B: 👇 https://t.co/gf4FZv0WZb

T
Tyler Barnes @tylbar ·
🚨We shipped a new context engineering primitive @mastra today, Agent Signals! (alpha) 🔓Unlocks: multiplayer steering, dynamic cacheable system prompts, agent notifications (gh/email/etc), and automatic behavior guidance 🧵
H
Hugging Models @HuggingModels ·
People after realising hermes agent and open claw are just 2% of mercury agent’s capabilities. https://t.co/X3BV4gX0dd
E EveryDevAi @EveryDevAi

Running an AI agent that forgets everything between sessions and phones home to the cloud? Mercury Agent keeps all memory local in SQLite, runs 24/7 as a daemon, and asks permission before any write or shell command. 2,500+ GitHub stars and counting. Full specs next. #DevTools https://t.co/6MnNXqGJUz

M
Mario Zechner @badlogicgames ·
imma go use this for the big pi refactor in anger. it looks really helpful.
P plannotator @plannotator

Plannotator 0.19.27 is out Plannotator will use @DanielGri's Glimpse instead of a browser if you have that installed. This creates a semi standalone experience. (See glimpse in the video along with annotating all agent messages) @kirodotdev is now supported Plan/Annotate: - You can now annotate multiple messages. - Mermaid rendering optimizations - Pi approval flow optimizations (plan mode) App wide: - A bit more of an obvious update notification that doesn't stick around.

E
Eric W. Tramel @fujikanaeda ·
we got bought by Nvidia and made a bunch of contributions to Nemotron models and released a lot of successful open source data and software like: - OpenShell - NeMo Data Designer - NeMo Anonymizer - NeMo Safe Synthesizer - Nvidia PII detector …
S soatto @soatto

What happened to all the synthetic data startups?

E
Ed Zitron @edzitron ·
OpenAI is absolutely cooked. This is loser language. You can’t be four years into the bubble saying “yeah our customers have a huge issue with how expensive our business is.” You just raised $122 billion! You can’t say shit like this! https://t.co/1uKEEpSS03 https://t.co/HWMuy3TQX8
D
dylan ツ @demian_ai ·
The next AI factory may run out of ways to talk to itself before it runs out of GPUs. This app is full of « photonics this, CPO that, $POET, $SIVE and the gang ». So I wrote a piece that actually maps why this bottleneck feels different, where it bites first, and how the thesis shifts from chips/HBM/power to the back-end fabric. You should stop picturing a data center as a room full of chips and start picturing it as a communication system. Every GPU in a training cluster is constantly asking other GPUs what they know. Gradients, activations, experts, cache state, routing decisions, all of it has to move. At small scale, copper can carry the conversation. At AI factory scale, copper starts becoming a wall. The signal has to cross board traces, connectors, cables, heat, loss, reflection, crosstalk, and distance. The fix is always more power, more retimers, more cooling, more cable bulk. At some point the system is spending too much energy keeping electricity alive long enough to become useful. That is why photonics matters. Most coverage still frames this as “optics TAM growing.” It misses the supply chain relocation and the new constraints on light sources, integration, and test that actually determine who scales. The names positioned at the light source + photonic integration layer for next-gen ELS/CPO? $POET’s Optical Interposer and $SIVE’s high-power DFB laser arrays are right in the critical path, but there are many many others. AI’s photonics bottleneck: why moving data is now harder than making chips: https://t.co/OguHp3QzFt
B
Brandon Carl @brandonjcarl ·
Let me give some "behind the scenes" as to why AI ROI is so elusive. Even if the AI works, you have to navigate the "Seven Gates of Software Hell". I ran AI for a company that managed a huge portion of the world's communications data for financial services companies. This is an excruciating read but the realities are tough. Let's get started. Suppose you want to scan all of your communications for customer complaints and respond quickly. Here's your journey: Gate 1: Data Controls Various geographies require the data to be stored in-region, and in some cases, only accessed in-region. You may need separate AI deployments for each one. The data may need to be scrubbed for PHI/PII and will need to be scrubbed for Material Non-Public Information. If it leaves the System of Record you'll need to ensure there is a way to selectively delete data so that you can adhere to GDPR, CCPA or PIPL. Gate 2: Data Quality Even if you get controls in place, you discover that your data is coming from 8 different vendors. Some are real-time, others T+1 and they all have different APIs. To boot, your corporate directory has 4 identities for Brandon Carl that have never been merged so you can't properly query even a single person's data. Gate 3: Security + Controls Given the sensitivity of the data you're sending, you'll need to go through an extensive security audit. Since this is an LLM you'll need to look beyond SOC2 and into OWASP Top 10 LLM risks and Gen AI risks too. Gate 4: SLAs Your AI Agent calls are taxing your system with bursty volumes and risking your mission-critical production workloads. You may need to set up read-only replicas, throttling and overage billing. Gate 5: Vendor Risk Your vendor will be assessed for their financial viability as well as the controls they put in place. This may go as far as analyzing the vendor's software development processes. Gate 6: Legal + Procurement You've almost made it, but procurement needs to demonstrate that they are saving the firm money. Negotiations come down to the end of the quarter. Redlines are flying everywhere to meet your firm's AI policies and to ensure there's no training on your data. Gate 7: Model Governance The AI/ML models need to be assessed versus your firm's Responsible AI Policy. And if you're going to automate things get really tricky. The model needs to be assessed for Materiality, Autonomy and Complexity. You may need documented evaluations, extensive model documentation and champion challenger comparisons performed by your own internal AI teams. ––– You've made it this far, congratulations! While you've been working through the "7 Gates of Hell" you've had to manage a team of workers you know you're going to fire to justify the AI spend. This requires coordination with HR, one-time separation costs and managing team morale for those employees that stay. Thanks @emollick for posing the question. Also see https://t.co/ISNlNgDfvH
📙
📙 Alex Hillman @alexhillman ·
Just open sourced my variant of this amazing skill that I'm now using a LOT https://t.co/gp3fdiCZcp - discovers its own context for a topic from session files - progress indicators and visual recaps - solo or co-learner modes It rules so hard
T trq212 @trq212

been asking others at Anthropic how they stay in the loop with Claude and fully understand the work being done this is one of my favorites from Suzanne: https://t.co/nqIMcGXiKI

F
Flo Crivello @Altimor ·
Pulled the trigger today and switched 100% of Lindy traffic to DeepSeek v4, churning from Anthropic models. Saves us millions of $ and we're actually seeing an *increase* in performance on many core use cases. Transformative for the business.
P
Peter Steinberger 🦞 @steipete ·
Here’s the video of my talk at MS Build: Build the thing that builds the thing. https://t.co/lJuv2twhFe
T
Thorsten Ball @thorstenball ·
Agents, everywhere.
S sqs @sqs

We rebuilt Amp's UI so you can watch and drive all your Amp agents on web, mobile, and CLI. https://t.co/nlNhe1hAA3 https://t.co/R2MwXsgDcb

C
cat @_catwu ·
Excited to share how Anthropic's data team has automated 95% of business analytics queries with Claude. Blog post covers how we approach evals, ablations, and online validation!
C ClaudeDevs @ClaudeDevs

How do we automate business analytics with Claude? New blog post covering our best practices for skills, data foundations, and evaluations when building agents to perform data analysis: https://t.co/mfEJMAQFBU

E
Elon Musk @elonmusk ·
Hadamard thought in image space
A a16z @a16z

World Labs CEO Dr. Fei-Fei Li: "The world is not made of words." "Language models have given machines an extraordinary command of concepts, vocabulary, and reasoning, but the physical world, virtual or real, runs on a different substrate." "Where language models learn the statistical structure of text, world models learn the statistical structure of space and time: how light falls on a surface, how a garden looks from an angle no camera has captured, how objects respond to force and follow the laws of physics." "Language gave machines a way to talk about that world. World models are how machines will finally come to understand, imagine, reason and interact with it." Full piece: https://t.co/C9qOJg5wuc

J
JustDario @DarioCpx ·
To go out publicly and say what he is saying, @edzitron has balls the size of the Las Vegas sphere https://t.co/oHjv3LCvXh