Media Summary: Want to scale beyond the limits of a single GPU? Learn how to use CUDA-aware MPI, NVSHMEM, and Zhiyi Hu, Siyuan Shen, Tommaso Bonato (ETH Zurich), Sylvain Jeaugey (NVIDIA), Cedell Alexander, Eric Spada (Broadcom), ... In this AI Research Roundup episode, Alex discusses the paper: 'Collective Communication for 100k+ GPUs(2510.20171v1)' This ...

Multigpu Nccl From The Authors - Detailed Analysis & Overview

Want to scale beyond the limits of a single GPU? Learn how to use CUDA-aware MPI, NVSHMEM, and Zhiyi Hu, Siyuan Shen, Tommaso Bonato (ETH Zurich), Sylvain Jeaugey (NVIDIA), Cedell Alexander, Eric Spada (Broadcom), ... In this AI Research Roundup episode, Alex discusses the paper: 'Collective Communication for 100k+ GPUs(2510.20171v1)' This ... Welcome to this deep dive into GPU-GPU communication for high-performance computing and machine learning with me, Dr. This video was recorded at Lambda Days 2022 - Using smoke and mirrors to ... NCCL: High-Speed Inter-GPU Communication for Large-Scale Training - Sylvain Jeaugey, NVIDIA

Presenter(s): James Hongyi Zeng, Senior Engineering Manager, Meta As Meta's AI infrastructure scales to massive- ... Scaling beyond a single GPU doesn't have to be hard. In this NVIDIA GTC 2025 session, explore how distributed Materials and Molecular Modelling Hub GPU Training Day:

Photo Gallery

MultiGPU + NCCL from the authors
NCCL Explained: How NVIDIA's GPU Communication Library Powers Distributed Deep Learning
Lecture 17: NCCL
Multi-GPU Communication Libraries for Scaling HPC and AI Workloads | NVIDIA GTC 2025
Demystifying NCCL An In depth Analysis of GPU Communication Protocols and Algorithms - Zhiyi Hu
NCCLX: Collective Comms for 100k+ GPUs
GPU-GPU Communication: Boosting HPC with Peer-to-Peer Access & RCCL/NCCL
Using smoke & mirrors to compile a (...) to efficient GPU code | Troels Henriksen | Lambda Days 2022
NCCL: High-Speed Inter-GPU Communication for Large-Scale Training - Sylvain Jeaugey, NVIDIA
Lecture 67: NCCL and NVSHMEM
GPU Communication Library in Meta-Scale AI Clusters
Getting Started with Distributed Multi-GPU Libraries for Scalable AI and HPC | NVIDIA GTC 2025
Sponsored
View Detailed Profile
MultiGPU + NCCL from the authors

MultiGPU + NCCL from the authors

Speaker: Jeff Hammond.

NCCL Explained: How NVIDIA's GPU Communication Library Powers Distributed Deep Learning

NCCL Explained: How NVIDIA's GPU Communication Library Powers Distributed Deep Learning

In this video, we break down

Lecture 17: NCCL

Lecture 17: NCCL

Code and Slides: https://github.com/cuda-mode/lectures/tree/main/lecture_017.

Multi-GPU Communication Libraries for Scaling HPC and AI Workloads | NVIDIA GTC 2025

Multi-GPU Communication Libraries for Scaling HPC and AI Workloads | NVIDIA GTC 2025

Want to scale beyond the limits of a single GPU? Learn how to use CUDA-aware MPI, NVSHMEM, and

Demystifying NCCL An In depth Analysis of GPU Communication Protocols and Algorithms - Zhiyi Hu

Demystifying NCCL An In depth Analysis of GPU Communication Protocols and Algorithms - Zhiyi Hu

Zhiyi Hu, Siyuan Shen, Tommaso Bonato (ETH Zurich), Sylvain Jeaugey (NVIDIA), Cedell Alexander, Eric Spada (Broadcom), ...

Sponsored
NCCLX: Collective Comms for 100k+ GPUs

NCCLX: Collective Comms for 100k+ GPUs

In this AI Research Roundup episode, Alex discusses the paper: 'Collective Communication for 100k+ GPUs(2510.20171v1)' This ...

GPU-GPU Communication: Boosting HPC with Peer-to-Peer Access & RCCL/NCCL

GPU-GPU Communication: Boosting HPC with Peer-to-Peer Access & RCCL/NCCL

Welcome to this deep dive into GPU-GPU communication for high-performance computing and machine learning with me, Dr.

Using smoke & mirrors to compile a (...) to efficient GPU code | Troels Henriksen | Lambda Days 2022

Using smoke & mirrors to compile a (...) to efficient GPU code | Troels Henriksen | Lambda Days 2022

This video was recorded at Lambda Days 2022 -https://www.lambdadays.org/lambdadays2022 Using smoke and mirrors to ...

NCCL: High-Speed Inter-GPU Communication for Large-Scale Training - Sylvain Jeaugey, NVIDIA

NCCL: High-Speed Inter-GPU Communication for Large-Scale Training - Sylvain Jeaugey, NVIDIA

NCCL: High-Speed Inter-GPU Communication for Large-Scale Training - Sylvain Jeaugey, NVIDIA

Lecture 67: NCCL and NVSHMEM

Lecture 67: NCCL and NVSHMEM

Speaker: Jeff Hammond.

GPU Communication Library in Meta-Scale AI Clusters

GPU Communication Library in Meta-Scale AI Clusters

Presenter(s): James Hongyi Zeng, Senior Engineering Manager, Meta As Meta's AI infrastructure scales to massive- ...

Getting Started with Distributed Multi-GPU Libraries for Scalable AI and HPC | NVIDIA GTC 2025

Getting Started with Distributed Multi-GPU Libraries for Scalable AI and HPC | NVIDIA GTC 2025

Scaling beyond a single GPU doesn't have to be hard. In this NVIDIA GTC 2025 session, explore how distributed

5 MMM Hub GPU Training Day: Multi GPU Programming with MPI & NCCL Jiri Kraus, 31 March 22

5 MMM Hub GPU Training Day: Multi GPU Programming with MPI & NCCL Jiri Kraus, 31 March 22

Materials and Molecular Modelling Hub GPU Training Day: