Muutke küpsiste eelistusi

Applications, Tools and Techniques on the Road to Exascale Computing [Kõva köide]

Edited by , Edited by , Edited by , Edited by , Edited by
  • Formaat: Hardback, 688 pages
  • Sari: Advances in Parallel Computing 22
  • Ilmumisaeg: 01-May-2012
  • Kirjastus: IOS Press,US
  • ISBN-10: 1614990409
  • ISBN-13: 9781614990406
Teised raamatud teemal:
  • Formaat: Hardback, 688 pages
  • Sari: Advances in Parallel Computing 22
  • Ilmumisaeg: 01-May-2012
  • Kirjastus: IOS Press,US
  • ISBN-10: 1614990409
  • ISBN-13: 9781614990406
Teised raamatud teemal:
Single processing units have now reached a point where further major improvements in their performance are restricted by their physical limitations. This is causing a slowing down in advances at the same time as new scientific challenges are demanding exascale speed. This has meant that parallel processing has become key to High Performance Computing (HPC). This book contains the proceedings of the 14th biennial ParCo conference, ParCo2011, held in Ghent, Belgium. The ParCo conferences have traditionally concentrated on three main themes: Algorithms, Architectures and Applications. Nowadays though, the focus has shifted from traditional multiprocessor topologies to heterogeneous and manycores, incorporating standard CPUs, GPUs (Graphics Processing Units) and FPGAs (Field Programmable Gate Arrays). These platforms are, at a higher abstraction level, integrated in clusters, grids and clouds. The papers presented here reflect this change of focus. New architectures, programming tools and techniques are also explored, and the need for exascale hardware and software was also discussed in the industrial session of the conference. This book will be of interest to all those interested in parallel computing today, and progress towards the exascale computing of tomorrow.
Preface v
Erik D'Hollander
Koen De Bosschere
Gerhard R. Joubert
David Padua
Frans Peters
Conference Organisation vii
Keynotes
The Future of High Performance Computing in Europe
3(4)
Bernhard Fabianek
Christian Cuciniello
PRACE: Europe's Supercomputing Research Infrastructure
7(12)
Thomas Lippert
Thomas Eickermann
Dietmar Erwin
Comparison of Admission Control Policies for Service Provision in Public Clouds
19(10)
J.L. Vazquez Poletti
R. Moreno-Vozmediano
I.M. Llorente
Program Execution Models for Massively Parallel Computing
29(12)
Jack B. Dennis
Advances in Physarum Machines Gates, Hulls, Mazes and Routing with Slime Mould
41(16)
Andrew Adamatzky
Algorithms
Parallel Remeshing in Tree Codes for Vortex Particle Methods
57(8)
Robert Speck
Rolf Krause
Paul Gibbon
A Case Study of the Task-Based Parallel Wavefront Pattern
65(8)
Antonio J. Dios
Angeles Navarro
Rafael Asenjo
Francisco Corbera
Emilio L. Zapata
Design and Evaluation of a Parallel Execution Framework for the CLEVER Clustering Algorithm
73(8)
Chung Sheng Chen
Nauful Shaikh
Panitee Charoenrattanaruk
Christoph F. Eick
Nouhad Rizk
Edgar Gabriel
The BL-Octree: An Efficient Data Structure for Discretized Block-Based Adaptive Mesh Refinement
81(10)
Ashley Zebrowski
Frank Loffler
Erik Schnetter
Automatic Parallelisation
Towards Parallelizing Object-Oriented Programs Automatically
91(8)
Welf Lowe
Jonas Lundberg
Heap Dependence Analysis for Sequential Programs
99(10)
Barnali Basak
Sandeep Dasgupta
Amey Karkare
Cloud Computing
Energy Aware Consolidation Policies
109(8)
Mehdi Sheikhalishahi
Ignacio Martin Llorente
Lucio Grandinetti
MapReduce for Scientific Computing - Viability for Non-Embarrassingly Parallel Algorithms
117(8)
Pelle Jakovits
Satish Narayan Srirama
Eero Vainikko
An Autonomic Management System for Choreography-Based Workflows on Grids and Clouds
125(8)
Giuseppe Papuzzo
Giandomenico Spezzano
Remote Utilization of OpenCL for Flexible Computation Offloading Using Embedded ECUs, CE Devices and Cloud Servers
133(10)
Holger Endt
Kay Weckemann
GPU Applications
Monte Carlo Option Pricing with Graphics Processing Units
143(8)
Fredrik Nord
Erwin Laure
Speeding-Up the Discrete Wavelet Transform Computation with Multicore and GPU-Based Algorithms
151(8)
V. Galiano
O. Lopez
M.P. Malumbres
H. Migallon
Flexible Runtime Support for Efficient Skeleton Programming on Heterogeneous GPU-Based Systems
159(8)
Usman Dastgeer
Christoph Kessler
Samuel Thibault
Lattice Boltzmann for Large-Scale GPU Systems
167(8)
Alan Gray
Alistair Hart
Alan Richardson
Kevin Stratford
High-Fidelity Real-Time Antiship Cruise Missile Modeling on the GPU
175(8)
Christopher Scannell
Jonathan Decker
Joseph Collins
William Smith
Egomotion Compensation and Moving Objects Detection Algorithm on GPU
183(8)
Juan Gomez-Luna
Holger Endt
Walter Stechele
Jose Maria Gonzalez-Linares
Jose Ignacio Benavides
Nicolas Guil
Performance Model for a Cellular Automata Implementation on a GPU Cluster
191(8)
Paul Albuquerque
Pierre Kunzli
Xavier Meyer
(iPU-Based Image Processing Use Cases: A High-Level Approach
199(10)
Volkmar Wieser
Clemens Grelck
Holger Schoner
Peter Haslinger
Karoly Bosa
Bernhard Moser
Heterogeneous Computing
Parallel Likelihood Function Evaluation on Heterogeneous Many-Core Systems
209(8)
Sverre Jarp
Alfio Lazzaro
Julien Leduc
Andrzej Nowak
Yngve Sneen Lindal
A Model-Based Software Generation Approach Qualified for Heterogeneous GPGPU-Enabled Platforms
217(10)
Holger Endt
Lothar Stolz
Martin Wechs
Walter Stechele
High Performance Applications
Trajectory-Search on ScaleMP's vSMP Architecture
227(8)
Nicolas Berr
Dirk Schmidl
Jens Henrik Gobbert
Stefan Lankes
Dieteran Mey
Thomas Bemmerl
Christian Bischof
Towards an Application of High-Performance Computer Systems to 3D Simulations of High Energy Density Plasmas in Z-Pinches
235(8)
Vladimir Gasilov
Alexey Boldarev
Sergey Dyachenko
Olga Olkhovskaya
Elena Kartasheva
Gennadiy Bagdasarov
Sergey Boldyrev
Irina Gasilova
Valeriy Shmyrov
Svetlana Tkachenko
Julien Grunenwald
Thierry Maillard
On-the-Fly Singular Value Decomposition for Aitken's Acceleration of the Schwarz Domain Decomposition Method
243(8)
Laurent Berenguer
Thomas Dufaud
Toan Pham
Damien Tromeur-Dervout
A Software Concept for Cache-Efficient Simulation on Dynamically Adaptive Structured Triangular Grids
251(10)
Oliver Meister
Kaveh Rahnema
Michael Bader
Performance Artalysis of an Ultrasound Reconstruction Algorithm for Non Destructive Testing
261(10)
Antoine Pedron
Lionel Lacassagne
Victor Barbillon
Franck Bimbard
Gilles Rougeron
Stephane Le Berre
Languages
Corento - SIMD Parallelism from Portable High-Level Code
271(10)
Juhana Helovuo
Jarkko Niittylahti
Heikki Berg
A Parallel Benchmark Suite for Fortran Coarrays
281(8)
David Henty
SAC on a Niagara T3-4 Server: Lessons and Experiences
289(8)
Clemens Grelck
Roeland Douma
Declarative Parallel Programming for GPUs
297(10)
Eric Holk
William Byrd
Nilesh Mahajan
Jeremiah Willcock
Arun Chauhan
Andrew Lumsdaine
Load Balancing
Balancing CPU Load for Irregular MPI Applications
307(10)
Jorg Keller
Mudassar Majeed
Christoph W. Kessler
Reactive Rebalancing for Scientific Simulations Running on ExaScale High Performance Computers
317(10)
Roel Wuyts
Karl Meerbergen
Pascal Costanza
Massive Parallelism
Processing with a Million Cores
327(8)
Andrew Brown
Jeffrey Reeve
Stephen Furber
David Lester
The Fresh Breeze Program Execution Model
335(8)
Jack B. Dennis
Guang R. Gao
Xiao X. Meng
Brian Lucas
Joshua Slocum
Using Fast and Accurate Simulation to Explore Hardware/Software Trade-Offs in the Multi-Core Era
343(8)
Wim Heirman
Trevor E. Carlson
Souradip Sarkar
Pieter Ghysels
Wim Vanroose
Lieven Eeckhout
A Massive Data Parallel Computational Framework for Petascale/Exascale Hybrid Computer Systems
351(10)
Marek Blazewicz
Steven R. Brandt
Peter Diener
David M. Koppelman
Krzysztof Kurowski
Frank Loffler
Erik Schnetter
Jian Tao
Multicores
The PEPPHER Approach to Programmability and Performance Portability for Heterogeneous Many-Core Architectures
361(8)
Siegfried Benkner
Sabri Pllana
Jesper Larsson Traff
Philippas Tsigas
Andrew Richards
Raymond Namyst
Beverly Bachmayer
Christoph Kessler
David Moloney
Peter Sanders
An Efficient Parallel Set Container for Multicore Architectures
369(8)
Alvaro de Vega
Diego Andrade
Basilio B. Fraguela
Use of High Accuracy and Interval Arithmetic on Multicore Processors
377(8)
Carlos Amaral Holbig
Andriele Busatto do Carmo
Viviane Linck Lara
Luis Paulo Arendt
Engineering Concurrent Software Guided by Statistical Performance Analysis
385(12)
Clemens Grelck
Kevin Hammond
Heinz Hertlein
Philip Holzenspies
Chris Jesshope
Raimund Kirner
Bernd Scheuermann
Alex Shafarenko
Iraneus te Boekhorst
Volkmar Wieser
Numerical Algorithms
Solving the Generalized Symmetric Eigenvalue Problem Using Tile Algorithms on Multicore Architectures
397(8)
Hatem Ltaief
Piotr Luszczek
Azzam Haidar
Jack Dongarra
Improving Performance of Triangular Matrix-Vector BLAS Routines on GPUs
405(8)
Marek Karwacki
Przemyslaw Stpiczynski
Accelerating Grid Kernels for Virtual Screening on Graphics Processing Units
413(8)
Irene Sanchez-Linares
Horacio Perez-Sanchez
Jose Manuel Garcia
Parallelism on the Nonnegative Matrix Factorization
421(8)
Edgardo Mejia-Roa
Carlos Garcia
Jose-Ignacio Gomez
Manuel Prieto
Christian Tenllado
Alberto Pascual-Montano
Francisco Tirado
Exploiting Fine-Grain Parallelism in Recursive LU Factorization
429(8)
Jack Dongarra
Mathieu Faverge
Hatem Ltaief
Piotr Luszczek
Parareal Acceleration of Matrix Multiplication
437(10)
Toshiya Takami
Akira Nishida
Parallel I/O
A First Implementation of Parallel IO in Chapel for Block Data Distribution
447(8)
Rafael Larrosa
Rafael Asenjo
Angeles Navarro
Bradford L. Chamberlain
Optimizations for Two-Phase Collective I/O
455(10)
Michael Kuhn
Julian Kunkel
Yuichi Tsujita
Hidetaka Muguruma
Thomas Ludwig
Performance Modelling and Analysis
JuBE-Based Automatic Testing and Performance Measurement System for Fusion Codes
465(8)
A. Galonska
W. Frings
P. Gibbon
D. Borodin
A. Kirschner
Visualization of MPI(-IO) Datatypes
473(8)
Julian Kunkel
Thomas Ludwig
Open Trace Format 2: The Next Generation of Scalable Trace Formats and Support Libraries
481(10)
Dominic Eschweiler
Michael Wagner
Markus Geimer
Andreas Knupfer
Wolfgang E. Nagel
Felix Wolf
Tools for Analyzing the Behavior and Performance of Parallel Applications
491(8)
Frederik Vandeputte
Benchmarks Based on Anti-Parallel Patterns for the Evaluation of GPUs
499(10)
Jan G. Cornelis
Jan Lemeire
Skeleton Programming
Data Parallel Skeletons for GPU Clusters and Multi-GPU Systems
509(10)
Steffen Ernsting
Herbert Kuchen
Network Monitoring on Multicores with Algorithmic Skeletons
519(10)
M. Danelutto
L. Deri
D. De Sensi
Thread Management
Experience Using Lazy Task Creation in OpenMP Task for the UTS Benchmark
529(8)
Adnan
Mitsuhisa Sato
Folding Applications into High Dimensional Torus Networks
537(8)
Lukas Arnold
Composable Parallelism Foundations in the Intel® Threading Building Blocks Task Scheduler
545(12)
Andrey Marochko
Alexey Kukanov
Industrial Papers
Cray's Approach to Heterogeneous Computing
557(8)
Roberto Ansaloni
Alistair Hart
Integrated Simulation Workflows in Computer Aided Engineering on HPC Resources
565(10)
Florian Niebling
Andreas Kopecki
Martin Aumuller
Mini-Symposium "ParaFPGA"
ParaFPGA 2011 - High Performance Computing with Multiple FPGAs: Design, Methodology and Applications
575(4)
Erik H. D'Hollander
Dirk Stroobandt
Abdellah Touhafi
A Framework for Self-Adaptive Collaborative Computing on Reconfigurable Platforms
579(8)
Michiel W. van Tol
Zdenek Pohl
Milan Tichy
Accelerating HMMER Search Using FPGA Grid
587(8)
Toyokazu Takagi
Tsutomu Maruyama
Reconfigurable Computing Cluster - A Five-Year Perspective of the Project
595(8)
Ron Sass
Andrew G. Schmidt
Scott Buscemi
From Mono-FPGA to Multi-FPGA Emulation Platform for NoC Performance Evaluations
603(8)
Junyan Tan
Virginie Fresse
Frederic Rousseau
A Dynamically Reconfigurable Pattern Matcher for Regular Expressionson FPGA
611(10)
Tom Davidson
Mattias Merlier
Karel Bruneel
Dirk Stroobandt
Mini-Symposium "Exascale"
Hybrid Parallel Programming with MPI/StarSs
621(8)
Jesus Labarta
Vladimir Marjanovic
Eduard Ayguade
Rosa M. Badia
Mateo Valero
GPI - Global Address Space Programming Interface - Experiences on Scalability
629(10)
Mirko Rahn
TEMANEJO - A Debugger for Task Based Parallel Programming Models
639(8)
Steffen Brinkmann
Jose Gracia
Christoph Niethammer
Rainer Keller
Characterizing I/O Performance Using the TAU Performance System
647(10)
Sameer Shende
Allen D. Malony
Wyatt Spear
Karen Schuchardt
Symmetric Rank-k Update on Clusters of Multicore Processors with SMPSs
657(8)
Rosa M. Badia
Jesus Labarta
Vladimir Marjanovic
Alberto F. Martin
Rafael Mayo
Enrique S. Quintana-Orti
Ruyman Reyes
Author Index 665