Skip to content
News

Microsoft’s Custom AI Chip Program Is Dead — Nvidia Wins Again

Microsoft shut down its custom AI chip division after 18 months and ~$800M spent, confirming that dethroning Nvidia in AI hardware is harder than anyone’s balance sheet can fix.

4 min read
Microsoft's Custom AI Chip Program Is Dead — Nvidia Wins Again

Reuters reported on February 20, 2026 that Microsoft has shut down its internal custom AI chip division after roughly 18 months of trying — and failing — to build silicon that could meaningfully challenge Nvidia’s H100s and B200s in large-scale AI workloads. The project reportedly burned through approximately $800 million before leadership decided the math simply didn’t work.

This isn’t a story about one bad product decision. It’s a story about just how deep Nvidia’s moat actually runs — and why even one of the richest companies on the planet couldn’t dig around it.

What Microsoft Was Trying to Do

The ambition was straightforward enough on paper: reduce dependency on Nvidia by designing custom accelerators tailored specifically for Microsoft’s AI workloads — everything from Azure inference to the backend muscle powering Copilot. Apple did it with the M-series. Google did it with TPUs. Amazon did it with Trainium and Inferentia. Microsoft wanted its seat at that table.

The problem is that wanting a seat and earning one are different things. Microsoft reportedly struggled with two compounding issues: raw chip performance that couldn’t match Nvidia’s latest generation, and a software ecosystem — CUDA — that nobody has successfully replicated at scale. Engineers building AI models don’t just want fast chips; they want the tools, libraries, and workflows they already know. Nvidia spent a decade building that lock-in, and it shows.

Nvidia's architecture: still the benchmark.
Nvidia's architecture: still the benchmark.

$800 Million and 18 Months Later

Reuters cited insiders putting the total spend at around $800 million over the program’s lifespan. That’s a large number by most measures, but in the context of AI infrastructure investment in 2025-2026, it’s not shocking — it’s actually modest for a project of this ambition. What’s notable is that Microsoft walked away rather than doubling down. That suggests internal benchmarks came back ugly enough that no amount of additional capital looked like it would close the gap in any reasonable timeframe.

The timing is pointed. Nvidia’s Blackwell architecture has been shipping at scale, and the performance delta between Nvidia’s best silicon and everyone else’s best attempt has not been shrinking. If anything, with each generation Nvidia widens it slightly before competitors catch up — at which point the next generation drops.

The Vertical Integration Wall

Google’s TPU success and Apple’s M-series dominance led a lot of people to conclude that vertical integration in AI hardware was the obvious play. Control your silicon, control your costs, control your destiny. Microsoft bet on that thesis. The problem is that Google spent years and multiple generations getting TPUs to actually work well, and Apple’s use case — consumer devices — is structurally simpler than training and serving massive frontier models across a hyperscale cloud.

Microsoft’s Azure business runs on diversity. It serves customers running every kind of workload on every kind of framework. Building a proprietary accelerator that works brilliantly for a narrow set of internal use cases is one thing; making it general enough to matter across Azure’s full customer base is another problem entirely. That tension may have been as damaging as any pure engineering shortfall.

Vertical integration, meet the wall.
Vertical integration, meet the wall.

What Nvidia Actually Sells

It’s easy to look at Nvidia’s margins — which have been extraordinary by any reasonable standard — and conclude the company is simply extracting rent. That’s partially true. But the harder truth is that CUDA, NVLink, and the software stack Nvidia has assembled represent a decade of accumulated lock-in that money alone cannot quickly replicate. Microsoft had the money. It didn’t have the decade.

Every major cloud provider has tried to chip away at this: Amazon with Trainium, Google with TPUs, now Microsoft’s failed attempt. Intel’s Gaudi series exists. Qualcomm has AI silicon. None of them have meaningfully dented Nvidia’s share of frontier model training. That’s not a coincidence — it’s a structural reality that the industry keeps relearning at significant expense.

What Comes Next for Microsoft

Microsoft’s AI infrastructure spending isn’t going anywhere — the company committed to $80 billion in data center investment for 2025 alone. But the strategy apparently shifts back toward being Nvidia’s best customer rather than its competitor. Expect Microsoft to lock in substantial Blackwell and next-generation GPU supply agreements and to refocus any silicon effort on narrower, more defensible targets like networking or inference optimization rather than training accelerators.

For Nvidia, this is validation wearing a business suit. Every time a hyperscaler announces a custom chip program, Jensen Huang’s company watches quietly. Every time one of those programs quietly dies, Nvidia’s stock moves. The throne, for now, isn’t going anywhere.

promptyze

ADMINISTRATOR