SXM (socket) — LLMpedia

SXM (socket)
AI-generated (Stable Diffusion 3.5) · CC BY 4.0 · source
Name	SXM
Type	Land Grid Array
Contacts	Varies by generation
Protocol	NVLink, PCI Express
Processor	NVIDIA Tesla, Ampere, Hopper GPUs
Predecessor	PCI Express
Successor	SXM2, SXM3, SXM4, SXM5

Contents

Overview
Technical specifications
Variants and compatibility
Applications and usage
Comparison with other sockets

SXM (socket). The SXM socket is a specialized processor interface developed by NVIDIA Corporation for high-performance computing and artificial intelligence accelerators. It is designed exclusively for the company's flagship data center GPUs, providing a high-bandwidth connection that surpasses standard expansion slots. The socket enables direct integration of powerful accelerators like the Tesla, A100, and H100 into dense server systems from partners such as Hewlett Packard Enterprise and Super Micro Computer.

Overview

Introduced to address the limitations of PCI Express in data center environments, the SXM form factor provides significantly higher power delivery and thermal headroom. This design is central to NVIDIA's strategy for its DGX systems and is a key component in large-scale AI supercomputer installations like the Cambridge-1 and Leonardo (supercomputer). The socket's architecture facilitates the use of NVLink for high-speed GPU-to-GPU communication, which is critical for workloads in computational science and machine learning. Its adoption is widespread in facilities operated by Microsoft Azure, Amazon Web Services, and the Texas Advanced Computing Center.

Technical specifications

The SXM specification defines a land grid array socket with a pin count that has increased across generations to support more PCI Express lanes and enhanced NVLink capabilities. SXM modules, such as those for the NVIDIA A100, integrate High Bandwidth Memory and support advanced features like Multi-Instance GPU technology. Thermal design power for processors in this socket can exceed 500 watts, necessitating sophisticated cooling solutions often involving vapor chambers and liquid cooling plates. Electrical specifications are tightly controlled to maintain signal integrity for the high-speed SerDes links used by NVLink and PCI Express protocols.

Variants and compatibility

The primary variants include SXM, SXM2, SXM3, SXM4, and SXM5, each corresponding to successive GPU architectures from NVIDIA. SXM2 was introduced with the Tesla V100 based on the Volta (microarchitecture), while SXM3 debuted with the NVIDIA A100 utilizing the Ampere (microarchitecture). The NVIDIA H100, built on the Hopper (microarchitecture), utilizes the SXM5 interface. These sockets are generally not cross-compatible due to differences in pinout, keying, and power requirements, though they maintain a consistent mechanical and electrical philosophy. System integrators like Lenovo and Dell Technologies design specific server boards, such as the NVIDIA HGX platform, for each generation.

Applications and usage

SXM-based accelerators are deployed in some of the world's most powerful supercomputers, including Fugaku (supercomputer) and the LUMI supercomputer, for tasks ranging from climate modeling to drug discovery. They form the computational backbone of NVIDIA's own DGX Station and DGX A100 appliances, which are used by research institutions like the Massachusetts Institute of Technology and corporations such as Pfizer. In cloud infrastructure, these GPUs power instances on Google Cloud Platform and Oracle Cloud for large-scale language model training and recommender systems. Their use is also pivotal in autonomous vehicle development at companies like NVIDIA Drive and Zoox.

Comparison with other sockets

Unlike consumer-oriented sockets such as LGA 1700 for Intel Core processors or Socket AM5 for AMD Ryzen CPUs, the SXM interface is designed solely for data center accelerators and lacks consumer motherboard support. It offers substantially higher power delivery and interconnect bandwidth than a PCI Express x16 slot, enabling features like NVLink that are absent on standard add-in cards. Compared to Open Compute Project accelerator modules or AMD Instinct cards, SXM provides a more integrated, vendor-locked solution optimized for NVIDIA's full stack of CUDA software. Its closest conceptual relative is the OAM form factor, though OAM is an open standard promoted by a consortium including Google and Microsoft.

Category:Computer hardware Category:Computer buses Category:NVIDIA