Optimizing Scalable Multi-Cluster Architectures for Next-Generation Wireless Sensing and Communication
METADATA ONLY
Loading...
Author / Producer
Date
2025
Publication Type
Conference Paper, INPROCEEDINGS
ETH Bibliography
yes
Citations
Altmetric
METADATA ONLY
Data
Rights / License
Abstract
Next-generation wireless technologies (for immersive-massive communication, joint communication and sensing) demand highly parallel architectures for massive data processing. A common architectural template scales up by grouping tens to hundreds of cores into shared-memory clusters, which are then scaled out as multi-cluster manycore systems. This hierarchical design, used in GPUs and accelerators, requires a balancing act between fewer large clusters and more smaller clusters, affecting design complexity, synchronization, communication efficiency, and programmability. While all multi-cluster architectures must balance these trade-offs, there is limited insight into optimal cluster sizes. This paper analyzes various cluster configurations, focusing on synchronization, data movement overhead, and programmability for typical wireless sensing and communication workloads. We extend the open-source shared-memory cluster MemPool into a multi-cluster architecture and propose a novel double-buffering barrier that decouples processor and DMA. Our results show a single 256-core cluster can be twice as fast as 16 16-core clusters for memory-bound kernels and up to 24% faster for compute-bound kernels due to reduced synchronization and communication overheads.
Permanent link
Publication status
published
Editor
Book title
2025 10th International Workshop on Advances in Sensors and Interfaces (IWASI)
Journal / series
Volume
Pages / Article No.
11122009
Publisher
IEEE
Event
10th International Workshop on Advances in Sensors and Interfaces (IWASI 2025)
Edition / version
Methods
Software
Geographic location
Date collected
Date created
Subject
Manycore; RISC-V; Synchronization
Organisational unit
03996 - Benini, Luca / Benini, Luca