DCAIPFAug 24, 2025

Bine Trees: Enhancing Collective Operations by Optimizing Communication Locality

arXiv:2508.17311v24 citationsh-index: 9SC
Originality Incremental advance
AI Analysis

This work addresses performance bottlenecks in high-performance computing for users of large-scale systems with oversubscribed networks, representing an incremental improvement over existing methods like binomial trees and butterflies.

The paper tackled the problem of communication locality in collective operations on large HPC systems by introducing Bine trees, a family of algorithms that reduce global-link traffic by up to 33% and achieve up to 5x speedups across various supercomputer topologies.

Communication locality plays a key role in the performance of collective operations on large HPC systems, especially on oversubscribed networks where groups of nodes are fully connected internally but sparsely linked through global connections. We present Bine (binomial negabinary) trees, a family of collective algorithms that improve communication locality. Bine trees maintain the generality of binomial trees and butterflies while cutting global-link traffic by up to 33%. We implement eight Bine-based collectives and evaluate them on four large-scale supercomputers with Dragonfly, Dragonfly+, oversubscribed fat-tree, and torus topologies, achieving up to 5x speedups and consistent reductions in global-link traffic across different vector sizes and node counts.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes