Effective On-Chip Communication for Message Passing Programs on Multi-Core Processors

Shared memory is the most popular parallel programming model for multi-core processors, while message passing is generally used for large distributed machines. However, as the number of cores on a chip increases, the relative merits of shared memory versus message passing change, and we argue that m...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Joonmoo Huh, Deokwoo Lee
Formato: article
Lenguaje:EN
Publicado: MDPI AG 2021
Materias:
Acceso en línea:https://doaj.org/article/ed3de22360b9427484326ada995d0fc3
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:ed3de22360b9427484326ada995d0fc3
record_format dspace
spelling oai:doaj.org-article:ed3de22360b9427484326ada995d0fc32021-11-11T15:40:19ZEffective On-Chip Communication for Message Passing Programs on Multi-Core Processors10.3390/electronics102126812079-9292https://doaj.org/article/ed3de22360b9427484326ada995d0fc32021-11-01T00:00:00Zhttps://www.mdpi.com/2079-9292/10/21/2681https://doaj.org/toc/2079-9292Shared memory is the most popular parallel programming model for multi-core processors, while message passing is generally used for large distributed machines. However, as the number of cores on a chip increases, the relative merits of shared memory versus message passing change, and we argue that message passing becomes a viable, high performing, and parallel programming model. To demonstrate this hypothesis, we compare a shared memory architecture with a new message passing architecture on a suite of applications tuned for each system independently. Perhaps surprisingly, the fundamental behaviors of the applications studied in this work, when optimized for both models, are very similar to each other, and both could execute efficiently on multicore architectures despite many implementations being different from each other. Furthermore, if hardware is tuned to support message passing by supporting bulk message transfer and the elimination of unnecessary coherence overheads, and if effective support is available for global operations, then some applications would perform much better on a message passing architecture. Leveraging our insights, we design a message passing architecture that supports both memory-to-memory and cache-to-cache messaging in hardware. With the new architecture, message passing is able to outperform its shared memory counterparts on many of the applications due to the unique advantages of the message passing hardware as compared to cache coherence. In the best case, message passing achieves up to a 34% increase in speed over its shared memory counterpart, and it achieves an average 10% increase in speed. In the worst case, message passing is slowed down in two applications—CG (conjugate gradient) and FT (Fourier transform)—because it could not perform well on the unique data sharing patterns as its counterpart of shared memory. Overall, our analysis demonstrates the importance of considering message passing as a high performing and hardware-supported programming model on future multicore architectures.Joonmoo HuhDeokwoo LeeMDPI AGarticlemessage passingscalableon-chip communicationhardware designElectronicsTK7800-8360ENElectronics, Vol 10, Iss 2681, p 2681 (2021)
institution DOAJ
collection DOAJ
language EN
topic message passing
scalable
on-chip communication
hardware design
Electronics
TK7800-8360
spellingShingle message passing
scalable
on-chip communication
hardware design
Electronics
TK7800-8360
Joonmoo Huh
Deokwoo Lee
Effective On-Chip Communication for Message Passing Programs on Multi-Core Processors
description Shared memory is the most popular parallel programming model for multi-core processors, while message passing is generally used for large distributed machines. However, as the number of cores on a chip increases, the relative merits of shared memory versus message passing change, and we argue that message passing becomes a viable, high performing, and parallel programming model. To demonstrate this hypothesis, we compare a shared memory architecture with a new message passing architecture on a suite of applications tuned for each system independently. Perhaps surprisingly, the fundamental behaviors of the applications studied in this work, when optimized for both models, are very similar to each other, and both could execute efficiently on multicore architectures despite many implementations being different from each other. Furthermore, if hardware is tuned to support message passing by supporting bulk message transfer and the elimination of unnecessary coherence overheads, and if effective support is available for global operations, then some applications would perform much better on a message passing architecture. Leveraging our insights, we design a message passing architecture that supports both memory-to-memory and cache-to-cache messaging in hardware. With the new architecture, message passing is able to outperform its shared memory counterparts on many of the applications due to the unique advantages of the message passing hardware as compared to cache coherence. In the best case, message passing achieves up to a 34% increase in speed over its shared memory counterpart, and it achieves an average 10% increase in speed. In the worst case, message passing is slowed down in two applications—CG (conjugate gradient) and FT (Fourier transform)—because it could not perform well on the unique data sharing patterns as its counterpart of shared memory. Overall, our analysis demonstrates the importance of considering message passing as a high performing and hardware-supported programming model on future multicore architectures.
format article
author Joonmoo Huh
Deokwoo Lee
author_facet Joonmoo Huh
Deokwoo Lee
author_sort Joonmoo Huh
title Effective On-Chip Communication for Message Passing Programs on Multi-Core Processors
title_short Effective On-Chip Communication for Message Passing Programs on Multi-Core Processors
title_full Effective On-Chip Communication for Message Passing Programs on Multi-Core Processors
title_fullStr Effective On-Chip Communication for Message Passing Programs on Multi-Core Processors
title_full_unstemmed Effective On-Chip Communication for Message Passing Programs on Multi-Core Processors
title_sort effective on-chip communication for message passing programs on multi-core processors
publisher MDPI AG
publishDate 2021
url https://doaj.org/article/ed3de22360b9427484326ada995d0fc3
work_keys_str_mv AT joonmoohuh effectiveonchipcommunicationformessagepassingprogramsonmulticoreprocessors
AT deokwoolee effectiveonchipcommunicationformessagepassingprogramsonmulticoreprocessors
_version_ 1718434574390788096