Discussion about this post

User's avatar
Gregg McKnight's avatar

Disaggregated memory was a theme we explored 15 years ago when I ran a cloud infrastructure dev team at Microsoft Research. We even had a private meeting with Jenson to explore several opportunities including a disaggregated memory pool solution for GPU, CPU and FPGA hybrid designs. In the end, the opportunity never panned out for several reasons. Clearly the additional latency, especially with copper became a showstopper for latency sensitive workloads. And multi-master implications to queuing latency compounded the already problematic limitations of copper. Optical may help in this regard. But your article got me thinking tactically about MU as an investment.

On the most recent Micron earnings call, CEO Sanjay Mehrotra's confirmed that non-HBM margins currently exceed HBM margins. UBS quantified this, projecting traditional DRAM gross margins reaching 67% in Q2 2026 versus HBM's 62%, widening to 75% by Q4 2026. This inverts the popular narrative that HBM is where all pricing power lives. The reason is structural, not cyclical: every HBM4 wafer displaces approximately three conventional DRAM wafers while achieving lower yields (~50% HBM vs. ~70% for DDR5). By starving the conventional DRAM market to feed HBM, the memory industry has inadvertently made the product it's diverting away from more profitable than the premium product itself. Now lets layer on the implications of agentic AI...

I have yet to see a comprehensive analysis for how agentic CPU demand affects DRAM demand. With current GPU to CPU ratios of 4:1 to 8:1 evolving to 1:1 as agentic workloads flood into datacenters. For DRAM the implication appears structurally MORE positive — if HBM fades, the replacement architecture runs on commodity DRAM/LPDDR, which Micron produces at higher yields and currently at higher margins compared to complex HBM stacks. This is a net positive for MU provided agentic demand exceeds the marginal reduction of HBM disaggregation. But I suspect a massive shift away from HBM will reduce the DRAM premium as greater wafer supply floods into the DRAM market. This will be one to watch.

Thanks for another thought provoking article!

Steve Romero's avatar

This is the picks-and-shovels thesis stated in a single layer, and it's the cleanest version I've read. Bandwidth was never a memory problem, it's a real-estate problem at the chip edge, which is exactly why I stopped trying to pick the winning accelerator and started owning the layers that get paid no matter which one wins. The link is the control point precisely because the market keeps pricing it as a cost.

The shoreline can resolve four ways: optical disaggregation, hybrid-bonding/3D-DRAM widening HBM's own doorway, wafer-scale integration that erases the edge (Cerebras), or compute-in-memory that never crosses it (d-Matrix). The architecture is the contested layer. But across all four branches the same three slices get paid: the interconnect that crosses the shoreline, the bulk DRAM bits in whatever form they take, and the power to run the hotter stove your last image promised. I'd rather own the layers than the kitchen.

Long the networking/optical complex and power, plus CBRS as the erase-the-edge bet. Also long d-Matrix. #NotAdvice

No posts

Ready for more?