July 2nd, 2024

Nvidia NVLink Switch Chips Change to the HGX B200

NVIDIA introduced the HGX B200 board at Computex 2024, featuring two NVLink Switch chips instead of four, aiming to enhance performance and efficiency in high-performance computing applications by optimizing GPU configurations.

Read original articleLink Icon
Nvidia NVLink Switch Chips Change to the HGX B200

NVIDIA showcased a new development at Computex 2024 with the introduction of the HGX B200 board, replacing the NVLink Switch chips with a reduced quantity of two on the board. This change marks a departure from the previous four NVLink Switch chips setup. The transition to the HGX B200 board signifies a shift in design strategy compared to previous generations like the P100/V100 and A100 eras. The new board layout places the NVLink Switches in the middle of the board instead of at the edge, potentially improving high-speed signaling by reducing trace lengths. The adjustment in chip positioning and quantity aims to enhance performance and efficiency in high-performance computing applications. NVIDIA's move towards the HGX B200 board represents a step forward in optimizing GPU configurations for improved functionality and signal integrity.

Link Icon 1 comments
By @throwoutway - 5 months
> That DeepLearning12 build cost a lot. For example, we had to get a $350 torque driver that could hit the tolerances needed not to crack the NVIDIA P100 GPUs. I was in the data center starting to install these, and one of our kind readers who worked on the Baidu AI research cluster in that same data center stopped me, letting me know I might break the GPUs if the torque was not perfect. Indeed, the worst story I heard about this was that HPE broke several V100 generation GPUs due to torque/ thermal paste thickness.

This is scary to me!