The CoWoS bottleneck

AIAcademy · AIAcademy · 2026-05-16

Read Digitimes on TSMC packaging capacity

The 2026 AI supply story is not the 3nm or 2nm fab race. It is packaging. Specifically, it is TSMC's chip-on-wafer-on-substrate process, known as CoWoS, which is what stitches a GPU die together with its stacks of High-Bandwidth Memory on a single silicon interposer. Every Hopper, every Blackwell, every Rubin shipped to date has passed through a CoWoS line. There are not enough CoWoS lines.

What CoWoS does. Modern accelerators are not single chips. A B200 is two GPU dies and eight HBM stacks sitting on a passive silicon interposer roughly the size of a large postage stamp, all bonded into one package. CoWoS is the assembly step that builds that package — etching the interposer, placing the dies, routing tens of thousands of micro-bumps between them. Without it, you have a GPU die and a pile of memory and no way to connect them at the bandwidth a frontier model needs.

Why it gates supply. Leading-edge wafer capacity at TSMC's N3 and N2 nodes is large and growing. CoWoS capacity is small and growing slowly — clean room build-out, equipment lead times, and yield learning on each generation all stretch into years. Digitimes reports TSMC is doubling CoWoS capacity through 2026 and still cannot meet Nvidia's demand; CNBC's reporting confirms the queue. A 3nm die you can re-spin in a quarter. A CoWoS line takes 18-24 months to bring online.