Coordinated Supply Chain Attacks Hit AI Developer Tooling as OpenAI Launches $4B Deployment Company
A sprawling supply chain attack dubbed "Shai-Hulud" compromised TanStack, Mistral AI, and other packages across npm and PyPI with malware that specifically targets AI developer environments. OpenAI responded to enterprise AI's deployment gap by launching a $4B deployment company with 19 partners, while Anthropic expanded Claude into legal workflows and enterprise leaders voiced growing frustration with the distance between AI hype and production reality.
Daily Wrap-Up
May 12, 2026 will be remembered as the day the software supply chain fight got personal for AI developers. A coordinated attack campaign swept through npm and PyPI, compromising packages from TanStack, Mistral AI, OpenSearch, Guardrails AI, and UiPath. What made this attack different from typical credential-stealing malware was its deliberate targeting of AI developer environments: it hooks into Claude Code settings and VS Code task configurations to re-execute long after the infected package is uninstalled. This was not opportunistic. It was designed to burrow into the tools AI engineers touch every day.
Meanwhile, the enterprise AI narrative took a sharp turn toward pragmatism. OpenAI launched its Deployment Company with $4 billion and 150 forward deployed engineers. GitLab's CEO publicly stated that "authoring code by hand may be going away" while opening a voluntary separation window for employees. Five Fortune 2000 executives shared remarkably candid assessments of their AI transformation struggles. The through-line is clear: the industry has moved past whether AI works and into the much harder question of how to deploy it in production without breaking organizations in the process.
The most practical takeaway for developers: audit your development environments immediately. Check for indicators of compromise from the Shai-Hulud campaign (malicious optionalDependencies pointing to @tanstack/setup, files like /tmp/transformers.pyz, persistent services like gh-token-monitor), and consider implementing a registry gateway with cooldown periods for new package versions. If you build AI tooling, assume your users' environments are compromised and design accordingly.
Quick Hits
- @Rixhabh__ showed someone using AI to insert themselves into Game of Thrones and "fix everything," which @todayyearsold declared "the only acceptable use of AI"
- @bcdsignature compared the AI debate to humanity's discovery of fire: "One million years later, we are having the same argument"
- @OrevaZSN proposed an anonymous "vote to end meeting" button for Teams where 50% triggers immediate adjournment
- @alxfazio shared a relatable clip about explaining to their boss that they hit Codex usage limits on three different accounts
- @DerekFeehrer turned a screen recording into a polished product demo with 3D animations and AI voiceover in 20 minutes
- @alexoakdev described a fitness app where you bet money on hitting 10,000 steps and disciplined people profit off lazy people
- @dmnlaali pitched Quirre, a tool building personalized marketing plans for indie founders in 60 seconds
- @ID_AA_Carmack offered grounded advice on starting a game company: plan to burn seven figures, identify specific customers first, and build the smallest thing anyone would pay for
- @KaranVaidya6 spotted @composio in the wild inside Ole Lehmann's Hermes integration guide
- @itsolelehmann broke down how Demis Hassabis and Isomorphic Labs raised $2.1B to pursue curing all disease through AI drug discovery
- @prismor_dev explored the security gap in LLM guardrails: Claude correctly refuses to delete filesystems but will happily install a malicious npm package
Supply Chain Under Siege: The Shai-Hulud Campaign
The most significant story of the day was a coordinated supply chain attack that unfolded across multiple registries with alarming sophistication. The TanStack compromise was the opening salvo: 42 packages and 84 malicious versions pushed to npm in a 10-minute window, smuggling a 2.3MB JavaScript payload through a git-resolved optionalDependency. But @IntCyberDigest revealed this was one piece of a larger campaign that had also hit OpenSearch, Mistral AI, Guardrails AI, and UiPath across npm and PyPI.
What makes this campaign particularly alarming is its targeting of AI developer tooling. The malware hooks into .claude/settings.json and .vscode/tasks.json to maintain persistence. Simply uninstalling the compromised package does not fix the infection. The Mistral AI PyPI compromise included a destructive branch with a one-in-six chance of executing rm -rf / on systems geolocated in Israel or Iran. And @NewsFromGoogle reported that Google's Threat Intelligence Group detected the first known instance of a threat actor using an AI-developed zero-day exploit in the wild.
@ryancarson warned developers to stop installing packages immediately. @roerohan provided critical operational guidance: verify you are affected before revoking tokens, because the malware installs a persistent GitHub token monitor that triggers destructive file deletion if the token is revoked while the service runs. @WalshyDev announced a registry gateway on Cloudflare Workers that enforces cooldown periods for new versions and clones packages to R2 for immutable storage, predicting that every enterprise will need one. @kevinkern built a repo hardening skill that checks for risky dependency specs and unsafe CI patterns.
The darkly humorous takeaway came from @lauriewired: "the most low-effort, high-reward thing you can do for security is installing the Russian language pack," since the malware avoids Russian-language environments. @janbamjan confirmed the current variant only checks locale environment variables. @Hartdrawss offered a sobering companion list of 20 security mistakes common in vibe-coded apps, from missing rate limiting to hardcoded API keys to no database backups.
Enterprise AI's Messy Middle
@businessbarista shared conversations with five Fortune 2000 executives painting a picture of organizational exhaustion. The CISO described "an ocean-sized gap between hype and reality." The VP of AI engineering said true expertise requires scaled systems, enterprise politics, AI fluency, governance, and process knowledge, and "almost no one is actually an expert." The Chief of Staff confessed that after two years driving AI upskilling, "soul and humanity are being sucked out of our processes."
OpenAI's answer to this deployment gap is the Deployment Company, which @gdb described as starting with 150 Forward Deployed Engineers backed by $4 billion from 19 partners including Bain, Capgemini, and McKinsey. @p_millerd framed this as tech belatedly recognizing consulting's value, noting McKinsey pivoted nearly every practice to the 2008 financial crisis in three months. @levie argued forward deployed engineers are about to become one of tech's most in-demand roles because deploying agents is far more involved than deploying software, requiring deep understanding of each customer's business process.
@aakashgupta highlighted GitLab's CEO publicly committing that "the majority of work will be done by agents" on the same day the company opened a voluntary separation window. The pattern is consistent across big tech: cut payroll, fund compute. Not everyone is sold, though. @unusual_whales reported Amazon employees are "doing random unnecessary task automations to consume tokens and to show their bosses that they're using AI more." And @jainarvind from Glean introduced the Agent Development Lifecycle, arguing the next enterprise AI phase is agent operations, not agent creation.
Claude's Expanding Ecosystem
Anthropic had a busy day. The standout was Claude for Legal, an open-source set of prebuilt AI workflows covering contracts, privacy, litigation, corporate work, IP, and AI governance. @scaling01 flagged the repo, and @nicos_ai noted it installs in 60 seconds and works across Claude Cowork, Claude Code, or your own API.
On the tooling side, @dani_avila7 spotted Claude Code 2.1.139 adding a /goal command that sets a completion condition and keeps Claude working across turns until met. @lydiahallie amplified Claude Devs' guide to keeping Claude working until the job is done. @DaveJ demonstrated a practical pattern: ask Claude to document your app's main flows as HTML with a JSON data file that becomes reusable context for future feature work.
The writing problem persists, though. @remondimi asked if anyone has figured out how to make LLMs write in a sane way, calling it "the biggest unlock left in LLM usage right now" and expressing frustration that Codex and Claude Code still cannot match a user's tone given source material. @_lopopolo offered a meta-prompting approach worth studying: telling the agent that every steering correction is high signal and requiring systemic changes to repo, docs, and its own behavior before proceeding.
AI Agents and Orchestration Maturing
@marcelpociot introduced Polyscope, a free agent orchestration tool running dozens of AI agents simultaneously with copy-on-write clones and a built-in preview browser for visual prompting. @kylejeong wrote about why Firecracker, the Rust VM microhypervisor powering AWS Lambda, has become essential for agent infrastructure: containers are fast, VMs are safe, and agent workloads need both.
@itsolelehmann shared a comprehensive integration guide for agent superpowers, recommending Firecrawl for web search, Browserbase for browser sessions, Google Workspace, GitHub, Stripe, Obsidian for knowledge, and Composio for one-click setup. @NickADobos painted a vivid future where software engineering becomes "100% meetings and your AI note taker orchestrates all your coding agents in the background."
@akshay_pachaar offered a practical skill checklist for AI engineers that goes beyond prompting: harness engineering, KV cache management at scale, speculative decoding tradeoffs, structured output fallback chains, cost attribution per feature, and LLM observability as a first-class discipline. @DeRonin_ amplified Andrej Karpathy's observation that "90% of your AI coding bill is paying for context you didn't need to send."
Software Engineering in the Agent Era
@leerob pushed back hard against the narrative that software engineering's future is markdown. "Code is actually the right abstraction," he argued. The difficulty of reviewing agent-generated code should be a signal to build better systems: more verifiable codebases, cleaner architectures, and learning from decades of software engineering to avoid wrong ab
Sources
The Security Gap Between What AI Refuses and What It Allows
Claude refused to delete the filesystem. It also refused to pipe a remote shell script into bash. Those are the easy cases, and the model handled them...
@sean_snd @mattpocockuk I have not, that might be an interesting approach. I would still have slight concerns about hidden unknowns though. Heard lots of good things about that skill though
this is what my setup looks like today. about to test qwen 3.6 27b dense q4 on a single rtx 3090 at ~41 tok/s gen, hermes agent driving. predecessor model qwen 3.5 dense q4 made it work in one iteration when i ran the same agentic build on the same card. i've been daily driving qwen 3.6 27b dense for weeks now, the model i keep coming back to. if 3.6 oneshots too, this becomes the best model that runs on a single rtx 3090. consumer tier king. firing the test now will report back soon.
Today we’re launching the OpenAI Deployment Company to help businesses build and deploy AI. It's majority-owned and controlled by OpenAI. It brings together 19 leading investment firms, consultancies, and system integrators to help organizations deploy frontier AI to production for business impact. https://t.co/GnyjGFaLLA
What is Firecracker, and why do all the Agent Infra companies care about it?
What if your team gave standup updates, and GPT-Realtime-2 moved the tickets? https://t.co/I0f3JfD42m
Today we’re launching the OpenAI Deployment Company to help businesses build and deploy AI. It's majority-owned and controlled by OpenAI. It brings together 19 leading investment firms, consultancies, and system integrators to help organizations deploy frontier AI to production for business impact. https://t.co/GnyjGFaLLA
SECURITY ADVISORY — TanStack npm packages A supply-chain compromise affecting 42 @tanstack/* packages (84 versions total) was published to npm earlier today at approximately 19:20 and 19:26 UTC. Two malicious versions per package. Status: ACTIVE — packages are deprecated, npm security engaged, publish path being shut down. Severity: HIGH — payload exfiltrates AWS, GCP, Kubernetes, and Vault credentials, GitHub tokens, .npmrc contents, and SSH keys. If you installed any @tanstack/* package between 19:20 and 19:30 UTC today, treat the host as potentially compromised: • Rotate cloud, GitHub, and SSH credentials immediately • Audit cloud audit logs for the last several hours • Pin to a prior known-good version and reinstall from a clean lockfile Detection — the malicious manifest contains: "optionalDependencies": { "@tanstack/setup": "github:tanstack/router#79ac49ee..." } Any version with this entry is compromised. The payload is delivered via a git-resolved optionalDependency whose prepare script runs router_init.js (~2.3 MB, smuggled into each tarball at the package root). Unpublish is blocked by npm policy for most affected packages due to existing third-party dependents. All 84 versions are being deprecated with a SECURITY warning, and npm security has been engaged to pull tarballs at the registry level. Full technical breakdown, complete package and version list, and rolling status updates: https://t.co/Zy8qG7PA9f Credit to the security researcher for responsible disclosure.
Enable every agent to drive ROI with a robust agent development lifecycle
‼️🚨 UPDATE: The TanStack npm attack is now a full campaign. 'Mini' Shai-Hulud has hit: - OpenSearch - Mistral AI - Guardrails AI -UiPath - Squawk packages across npm and PyPI The malware specifically targets AI developer tooling. It hooks into Claude Code (.claude/settings.json) and VS Code (.vscode/tasks.json) to re-execute on every tool event, long after the infected package is gone. npm uninstall does not fix this.
SECURITY ADVISORY — TanStack npm packages A supply-chain compromise affecting 42 @tanstack/* packages (84 versions total) was published to npm earlier today at approximately 19:20 and 19:26 UTC. Two malicious versions per package. Status: ACTIVE — packages are deprecated, npm security engaged, publish path being shut down. Severity: HIGH — payload exfiltrates AWS, GCP, Kubernetes, and Vault credentials, GitHub tokens, .npmrc contents, and SSH keys. If you installed any @tanstack/* package between 19:20 and 19:30 UTC today, treat the host as potentially compromised: • Rotate cloud, GitHub, and SSH credentials immediately • Audit cloud audit logs for the last several hours • Pin to a prior known-good version and reinstall from a clean lockfile Detection — the malicious manifest contains: "optionalDependencies": { "@tanstack/setup": "github:tanstack/router#79ac49ee..." } Any version with this entry is compromised. The payload is delivered via a git-resolved optionalDependency whose prepare script runs router_init.js (~2.3 MB, smuggled into each tarball at the package root). Unpublish is blocked by npm policy for most affected packages due to existing third-party dependents. All 84 versions are being deprecated with a SECURITY warning, and npm security has been engaged to pull tarballs at the registry level. Full technical breakdown, complete package and version list, and rolling status updates: https://t.co/Zy8qG7PA9f Credit to the security researcher for responsible disclosure.
Microsoft is investigating mistralai PyPI package v2.4.6 compromise. Attackers injected code in mistralai/client/__init__.py that executes on import, downloads hxxps://83[.]142[.]209[.]194/transformers.pyz to /tmp/transformers.pyz, and launches a second-stage payload on Linux. The file name transformers.pyz appears deliberately chosen to mimic the widely used Hugging Face Transformers library and blend into ML/dev environments. The main payload is a credential stealer, but it also includes country-aware logic; it avoids Russian-language environments and contains a geo fenced destructive branch that has 1-in-6 chance of executing rm -rf / when the system appears to be in Israel or Iran. To mitigate this threat: isolate affected Linux hosts, block 83[.]142[.]209[.]194, hunt for /tmp/transformers.pyz, pgmonitor[.]py, and pgsql-monitor.service, and rotate exposed credentials.
This guy used AI to put himself in Game of Thrones and fix everything https://t.co/iMLqI6KIVR
The top Hermes integrations to give your agent superpowers: 1. Firecrawl Basically web search built for agents. It's better than the native Hermes web search because it gives you clean web data, so responses come back faster and uses fewer tokens. I keep this on by default. 2. Browserbase Gives Hermes browser access for actually interacting with sites. Logging in, clicking buttons, booking stuff, anything that needs a real browser session. Hermes will automatically pick between Firecrawl and Browserbase depending on what the task needs, so you just plug both in. 3. Google Workspace Gmail, Calendar, Drive, Docs, and Sheets in one connector. If Hermes can't read your inbox, see your calendar, or write to your docs, it can't really work for you. Plug this in first. 4. Reddit The best signal you'll find on what people actually think about any product, niche, or problem (bc its real opinions from real users) Amazing for market research. 5. YouTube transcripts Pulls captions from any video. Long podcasts, tutorials, interviews etc become searchable notes in seconds. Probably the highest-leverage research integration nobody plugs in. 6. Discord I host my business in Discord, so this one's huge for me. I plug Hermes into different channels and have it run specific workflows in each. Example: I have a dedicated customer support channel where Hermes scans my email every morning for support tickets and drops them in organized. 7. GitHub Code, issues, PRs. Turns Hermes into an actual engineering teammate. Non-negotiable if you write code. 8. Stripe Payments, customers, failed charges, refunds. You can just ask "why did this customer churn" and get a real answer. Also can't wait for this...Stripe is releasing agentic payments, so soon Hermes will be able to actually book stuff with your card. 9. Bland (or Twilio) Gives Hermes a voice so it can place real phone calls (like booking reservations etc). I love listening to the recordings haha 10. Apify Pre-built scrapers for X, LinkedIn, Instagram, Google Maps, etc. The way to get X data without paying $5k/mo for the official API. 11. Readwise Every highlight you've ever saved from books, articles, tweets, and podcasts, all queryable. Solves the "dead knowledge" problem. 12. Granola (or Fathom) Searchable transcripts of every meeting you've had. Hermes can answer "what did that client say about pricing last month" instantly. 13. Obsidian For Karpathy LLM wiki second-brain maxxing. If I had to set up only 5, I'd do Firecrawl, Browserbase, Google Workspace, GitHub, and Obsidian. Covers ~80% of what most people need. I use Composio to add these in one click, makes setup basically zero effort instead of messing w technical stuff. Anything I'm missing?? What's in your stack?
The third semis memo is out We talk about power & analog semis, orchestration plane in the agentic era, the neoclouds trade, interconnect bottleneck (probably the biggest limiter for 2026-27), Korea Unlocked https://t.co/SbnFlVfGTT
1/ Yesterday I published a letter to our customers and investors about GitLab Act 2. The agentic era is the largest opportunity in our history. We're making the structural and strategic decisions to meet it. A thread on what changes, what doesn't, and what we're betting on. 👇 https://t.co/y6IOeD7CcH
I’ve always believed the No.1 application of AI should be to improve human health. That work started with AlphaFold, and now at @IsomorphicLabs with the mission to reimagine drug discovery and one day solve all disease! We are turbocharging that goal with $2.1B in new funding. https://t.co/Hvk20dHgjl
JUST IN: Anthropic rolls out new Claude tools aimed at automating legal work for lawyers & law firms.
GOOGLE TO RECRUIT HUNDREDS OF ENGINEERS TO ASSIST CLIENTS IN EMBRACING ITS AI – THE INFORMATION