Automatic parallelization with pMapper

Nadya Bliss, Henry Hoffmann, Robert Bond, Hector Chan, Jeremy Kepner, Edmund Wong

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Algorithm implementation efficiency is key to delivering high-performance computing capabilities to demanding, high throughput signal and image processing applications and simulations. Significant progress has been made in optimization of serial programs, but many applications require parallel processing, which brings with it the difficult task of determining efficient mappings of algorithms. The pMapper infrastructure addresses the problem of performance optimization of multistage MATLAB® applications on parallel architectures. pMapper is an automatic performance tuning library written as a layer on top of pMatlab: Parallel Matlab Toolbox. While pMatlab abstracts the message-passing interface, the responsibility of mapping numerical arrays falls on the user. Choosing the best mapping for a set of numerical arrays is a nontrivial task that requires significant knowledge of programming languages, parallel computing, and processor architecture. pMapper automates the task of map generation. This abstract addresses the design details of pMapper and presents preliminary results.

Original languageEnglish (US)
Title of host publicationProceedings - IEEE International Conference on Cluster Computing, ICCC
DOIs
StatePublished - 2005
Externally publishedYes
Event2005 IEEE International Conference on Cluster Computing, CLUSTER - Burlington, MA, United States
Duration: Sep 27 2005Sep 30 2005

Other

Other2005 IEEE International Conference on Cluster Computing, CLUSTER
CountryUnited States
CityBurlington, MA
Period9/27/059/30/05

Fingerprint

Parallel architectures
Message passing
Parallel processing systems
Computer programming languages
MATLAB
Program processors
Signal processing
Image processing
Tuning
Throughput
Processing

ASJC Scopus subject areas

  • Engineering(all)

Cite this

Bliss, N., Hoffmann, H., Bond, R., Chan, H., Kepner, J., & Wong, E. (2005). Automatic parallelization with pMapper. In Proceedings - IEEE International Conference on Cluster Computing, ICCC [4154145] https://doi.org/10.1109/CLUSTR.2005.347017

Automatic parallelization with pMapper. / Bliss, Nadya; Hoffmann, Henry; Bond, Robert; Chan, Hector; Kepner, Jeremy; Wong, Edmund.

Proceedings - IEEE International Conference on Cluster Computing, ICCC. 2005. 4154145.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Bliss, N, Hoffmann, H, Bond, R, Chan, H, Kepner, J & Wong, E 2005, Automatic parallelization with pMapper. in Proceedings - IEEE International Conference on Cluster Computing, ICCC., 4154145, 2005 IEEE International Conference on Cluster Computing, CLUSTER, Burlington, MA, United States, 9/27/05. https://doi.org/10.1109/CLUSTR.2005.347017
Bliss N, Hoffmann H, Bond R, Chan H, Kepner J, Wong E. Automatic parallelization with pMapper. In Proceedings - IEEE International Conference on Cluster Computing, ICCC. 2005. 4154145 https://doi.org/10.1109/CLUSTR.2005.347017
Bliss, Nadya ; Hoffmann, Henry ; Bond, Robert ; Chan, Hector ; Kepner, Jeremy ; Wong, Edmund. / Automatic parallelization with pMapper. Proceedings - IEEE International Conference on Cluster Computing, ICCC. 2005.
@inproceedings{2d9fc4b1db7245ba8cf82069f4e58b44,
title = "Automatic parallelization with pMapper",
abstract = "Algorithm implementation efficiency is key to delivering high-performance computing capabilities to demanding, high throughput signal and image processing applications and simulations. Significant progress has been made in optimization of serial programs, but many applications require parallel processing, which brings with it the difficult task of determining efficient mappings of algorithms. The pMapper infrastructure addresses the problem of performance optimization of multistage MATLAB{\circledR} applications on parallel architectures. pMapper is an automatic performance tuning library written as a layer on top of pMatlab: Parallel Matlab Toolbox. While pMatlab abstracts the message-passing interface, the responsibility of mapping numerical arrays falls on the user. Choosing the best mapping for a set of numerical arrays is a nontrivial task that requires significant knowledge of programming languages, parallel computing, and processor architecture. pMapper automates the task of map generation. This abstract addresses the design details of pMapper and presents preliminary results.",
author = "Nadya Bliss and Henry Hoffmann and Robert Bond and Hector Chan and Jeremy Kepner and Edmund Wong",
year = "2005",
doi = "10.1109/CLUSTR.2005.347017",
language = "English (US)",
isbn = "0780394852",
booktitle = "Proceedings - IEEE International Conference on Cluster Computing, ICCC",

}

TY - GEN

T1 - Automatic parallelization with pMapper

AU - Bliss, Nadya

AU - Hoffmann, Henry

AU - Bond, Robert

AU - Chan, Hector

AU - Kepner, Jeremy

AU - Wong, Edmund

PY - 2005

Y1 - 2005

N2 - Algorithm implementation efficiency is key to delivering high-performance computing capabilities to demanding, high throughput signal and image processing applications and simulations. Significant progress has been made in optimization of serial programs, but many applications require parallel processing, which brings with it the difficult task of determining efficient mappings of algorithms. The pMapper infrastructure addresses the problem of performance optimization of multistage MATLAB® applications on parallel architectures. pMapper is an automatic performance tuning library written as a layer on top of pMatlab: Parallel Matlab Toolbox. While pMatlab abstracts the message-passing interface, the responsibility of mapping numerical arrays falls on the user. Choosing the best mapping for a set of numerical arrays is a nontrivial task that requires significant knowledge of programming languages, parallel computing, and processor architecture. pMapper automates the task of map generation. This abstract addresses the design details of pMapper and presents preliminary results.

AB - Algorithm implementation efficiency is key to delivering high-performance computing capabilities to demanding, high throughput signal and image processing applications and simulations. Significant progress has been made in optimization of serial programs, but many applications require parallel processing, which brings with it the difficult task of determining efficient mappings of algorithms. The pMapper infrastructure addresses the problem of performance optimization of multistage MATLAB® applications on parallel architectures. pMapper is an automatic performance tuning library written as a layer on top of pMatlab: Parallel Matlab Toolbox. While pMatlab abstracts the message-passing interface, the responsibility of mapping numerical arrays falls on the user. Choosing the best mapping for a set of numerical arrays is a nontrivial task that requires significant knowledge of programming languages, parallel computing, and processor architecture. pMapper automates the task of map generation. This abstract addresses the design details of pMapper and presents preliminary results.

UR - http://www.scopus.com/inward/record.url?scp=50149095634&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=50149095634&partnerID=8YFLogxK

U2 - 10.1109/CLUSTR.2005.347017

DO - 10.1109/CLUSTR.2005.347017

M3 - Conference contribution

SN - 0780394852

SN - 9780780394858

BT - Proceedings - IEEE International Conference on Cluster Computing, ICCC

ER -