SemiAnalysis • 15961 implied HN points • 25 Feb 26
- NVIDIA built Rubin as an "extreme co-design" where the rack is treated as one integrated compute unit, combining Rubin GPUs, Vera CPUs, NVLink‑6 switches, ConnectX‑9 NICs, BlueField‑4 DPUs and Spectrum switches to push performance and tight system control.
- Rubin GPUs prioritize low‑precision scaling (big FP4/FP8 gains), much higher HBM bandwidth and an adaptive compression engine for sparsity, but they also bring very large power envelopes (up to 2300W), driving big thermal and cost impacts.
- The NVL72 rack is redesigned for manufacturing and reliability: cableless modular trays with board‑to‑board connectors, upgraded high‑end PCBs, 100% liquid cooling and 50V power delivery, which shifts component, cooling and assembly supply chains and raises TCO considerations.