Muutke küpsiste eelistusi

Parallel Computing: On the Road to Exascale: On the Road to Exascale [Kõva köide]

Edited by , Edited by , Edited by , Edited by
  • Formaat: Hardback, 872 pages
  • Sari: Advances in Parallel Computing
  • Ilmumisaeg: 29-Apr-2016
  • Kirjastus: IOS Press,US
  • ISBN-10: 1614996202
  • ISBN-13: 9781614996200
Teised raamatud teemal:
  • Formaat: Hardback, 872 pages
  • Sari: Advances in Parallel Computing
  • Ilmumisaeg: 29-Apr-2016
  • Kirjastus: IOS Press,US
  • ISBN-10: 1614996202
  • ISBN-13: 9781614996200
Teised raamatud teemal:
As predicted by Gordon E. Moore in 1965, the performance of computer processors increased at an exponential rate. Nevertheless, the increases in computing speeds of single processor machines were eventually curtailed by physical constraints. This led to the development of parallel computing, and whilst progress has been made in this field, the complexities of parallel algorithm design, the deficiencies of the available software development tools and the complexity of scheduling tasks over thousands and even millions of processing nodes represent a major challenge to the construction and use of more powerful parallel systems. This book presents the proceedings of the biennial International Conference on Parallel Computing (ParCo2015), held in Edinburgh, Scotland, in September 2015. Topics covered include computer architecture and performance, programming models and methods, as well as applications. The book also includes two invited talks and a number of mini-symposia. Exascale computing holds enormous promise in terms of increasing scientific knowledge acquisition and thus contributing to the future well-being and prosperity of mankind. A number of innovative approaches to the development and use of future high-performance and high-throughput systems are to be found in this book, which will be of interest to all those whose work involves the handling and processing of large amounts of data.
Preface v
Gerhard Joubert
Hugh Leather
Mark Parsons
Frans Peters
Mark Sawyer
Conference Organisation vii
Invited Talks
Bio-Inspired Massively-Parallel Computation
3(8)
Steve Furber
Automatic Tuning of Task Scheduling Policies on Multicore Architectures
11(14)
Akshatha Bhat
Andrew Lenharth
Donald Nguyen
Qing Yi
Keshav Pingali
Architectures and Performance Algorithms
Algorithmic Scheme for Hybrid Computing with CPU, Xeon-Phi/MIC and GPU
Devices on a Single Machine
25(10)
Sylvain Contassot-Vivier
Stephan Vialle
A Many-Core Machine Model for Designing Algorithms with Minimum Parallelism Overheads
35(10)
Sardar Anisul Hague
Marc Moreno Maza
Ning Xie
Performance
CPU Performance Analysis Using Score-P on PRIMEHPC FX100 Supercomputer
45(8)
Tomotake Nakamura
Performance Improvements of Polydisperse DEM Simulations Using a Loose Octree Approach
53(10)
G. Stein
S. Wirtz
V. Scherer
Execution Performance Analysis of the ABySS Genome Sequence Assembler Using Scalasca on the K Computer
63(10)
Itaru Kitayama
Brian J.N. Wylie
Toshiyuki Maeda
Performance Model Based on Memory Footprint for OpenMP Memory Bound Applications
73(10)
Cesar Allande
Josep Jorba
Anna Sikora
Eduardo Cesar
Evaluating OpenMP Performance on Thousands of Cores on the Numascale Architecture
83(10)
Dirk Schmidl
Atle Vesterkjoer
Matthias S. Muller
Acceleration of Large Scale OpenFOAM Simulations on Distributed Systems with Multicore CPUs and GPUs
93(10)
Boris Krasnopolsky
Alexey Medvedev
Optimized Variant-Selection Code Generation for Loops on Heterogeneous Multicore Systems
103(10)
Erik Hanson
Christoph Kessler
MPI Communication on MPPA Many-Core NoC: Design, Modeling and Performance Issues
113(10)
Minh Quan Ho
Bernard Tourancheau
Christian Obrecht
Benoit Dupont de Dinechin
Jerome Reybert
Drivers for Device to Device Streaming
123(12)
Dominic Eschweiler
Volker Lindenstruth
Programming Models and Methods
Skeletons
Portable Parallelization of the EDGE CFD Application for GPU-Based Systems Using the SkePU Skeleton Programming Library
135(10)
Oskar Sjostrom
Soon-Heum Ko
Usman Dastgeer
Lu Li
Christoph W. Kessler
Structured Parallel Implementation of Tree Echo State Network Model Selection
145(10)
Marco Danelutto
Claudio Gallicchio
Alessio Micheli
Massimo Torquati
Daniele Virgilio
Java Implementation of Data Parallel Skeletons on GPUs
155(10)
Steffen Ernsting
Herbert Kuchen
Data Parallel Patterns in Erlang/OpenCL
165(10)
Ugo Albanese
Marco Danelutto
Hybrid Coarrays: A PGAS Feature for Many-Core Architectures
175(10)
Valeria Cardellini
Alessandro Fanfarillo
Salvatore Filippone
Damian Rouson
Lapedo: Hybrid Skeletons for Programming Heterogeneous Multicore Machines in Erlang
185(12)
Vladimir Janjic
Christopher Brown
Kevin Hammond
Accelerators
Evaluation of 3-D Stencil Codes on the Intel Xeon Phi Coprocessor
197(10)
Mario Hernandez
Juan M. Cebrian
Jose M. Cecilia
Jose M. Garcia
Hierarchical Parallelism in a Physical Modelling Synthesis Code
207(10)
James Perry
Stefan Bilbao
Alberto Torin
Harnessing CUDA Dynamic Parallelism for the Solution of Sparse Linear Systems
217(10)
Jose Aliaga
Davor Davidovic
Joaquin Perez
Enrique S. Quintana-Orti
Model-Driven Development of GPU Applications
227(10)
Christoph Winter
Jan Dunnweber
Exploring the Offload Execution Model in the Intel Xeon Phi via Matrix Inversion
237(10)
Peter Benner
Pablo Ezzatti
Enrique S. Quintana-Orti
Alfredo Remon
Programming GPUs with C++14 and Just-In-Time Compilation
247(10)
Michael Haidl
Bastian Hagedorn
Sergei Gorlatch
Resource Management
Active Packet Pacing as a Congestion Avoidance Technique in Interconnection Network
257(8)
Hidetomo Shibamura
Hybrid Parallelization of Hyper-Dimensional Vlasov Code with OpenMP Loop Collapse Directive
265(10)
Takayuki Umeda
Keiichiro Fukazawa
Active Resource Management for Multi-Core Runtime Systems Serving Malleable Applications
275(10)
Clemens Grekk
Improving Energy-Efficiency of Static Schedules by Core Consolidation and Switching Off Unused Cores
285(10)
Nicolas Melot
Christoph Kessler
Jorg Keller
Efficient Parallel Linked List Processing
295(10)
Ashkan Tousimojarad
Wim Vanderbauwhede
Streams
Streams as an Alternative to Halo Exchange
305(12)
Daniel J. Holmes
Caoimhin Laoide-Kemp
An Embedded C++ Domain-Specific Language for Stream Parallelism
317(10)
Dalvan Griebler
Marco Danelutto
Massimo Torquati
Luiz Gustavo Fernandes
Pipeline Template for Streaming Applications on Heterogeneous Chips
327(12)
Andres Rodriguez
Angeles Navarro
Rafael Asenjo
Francisco Corbera
Antonio Vilches
Maria Garzarcin
Applications
Numerical Methods
Efficient and Scalable Distributed-Memory Hierarchization Algorithms for the Sparse Grid Combination Technique
339(10)
Mario Heene
Dirk Pfluger
Adapting a Finite-Element Type Solver for Bioelectromagnetics to the DEEP-ER Platform
349(12)
Raphael Leger
Damian A. Mallon
Alejandro Duran
Stephane Lanteri
High Performance Eigenvalue Solver in Exact-Diagonalization Method for Hubbard Model on CUDA GPU
361(10)
Susumu Yamada
Toshiyuki Imamura
Masahiko Machida
A General Tridiagonal Solver for Coprocessors: Adapting g-Spike for the Intel Xeon Phi
371(10)
Ioannis E. Venetis
Alexandros Sobczyk
Alexandros Kouris
Alexandros Nakos
Nikolaos Nikoloutsakos
Efstratios Gallopoulos
CAHTR: Communication-Avoiding Householder TRidiagonalization
381(10)
Toshiyuki Imamura
Takeshi Fukaya
Yusuke Hirota
Susumu Yamada
Masahiko Machida
Simulation of External Aerodynamics of the DrivAer Model with the LBM on GPGPUs
391(10)
Andrea Pasquali
Martin Schonherr
Martin Geier
Manfred Krafczyk
Algorithms
A Parallel Algorithm for Decomposition of Finite Languages
401(10)
Tomasz Jastrzqb
Zbigniew J. Czech
Wojciech Wieczorek
Exploiting the Space Filling Curve Ordering of Particles in the Neighbour Search of Gadget3
411(10)
Antonio Ragagnin
Nikola Tchipev
Michael Bader
Klaus Dolag
Nicolay Hammer
On-the-Fly Memory Compression for Multibody Algorithms
421(10)
Wolfgang Eckhardt
Robert Glas
Denys Korzh
Stefan Wallner
Tobias Weinzierl
Flexible and Generic Workflow Management
431(8)
Sebastian Luhrs
Daniel Rohe
Alexander Schnurpfeil
Kay Thust
Wolfgang Frings
A Massively Parallel Barnes-Hut Tree Code with Dual Tree Traversal
439(10)
Benedikt Steinbusch
Marvin-Lucas Henkel
Mathias Winkel
Paul Gibbon
Flow Problems
Performance Modeling of a Compressible Hydrodynamics Solver on Multicore CPUs
449(10)
Raphael Poncet
Mathieu Peybernes
Thibault Gasc
Iorian De Vuyst
Developing a Scalable and Flexible High-Resolution DNS Code for Two-Phase Flows
459(10)
Iain Bethune
Antonia B.K. Collis
Lennon O. Naraigh
David Scott
Prashant Valluri
FPGA Port of a Large Scientific Model from Legacy Code: The Emanuel Convection Scheme
469(10)
Kristian Thorin Hentschel
Wim Vanderbauwhede
Syed Waqar Nabi
Data Science
How to Keep a Geographic Map Up-To-Date
479(10)
Marco Grebe
Tilman Lacko
Rita Loogen
Static and Dynamic Big Data Partitioning on Apache Spark
489(12)
Massimiliano Bertolucci
Emanuele Carlini
Patrizio Dazzi
Alessandro Lulli
Laura Ricci
Mini-Symposium: ParaFPGA-2015: Parallel Computing with FPGAs
ParaFPGA15: Exploring Threads and Trends in Programmable Hardware
501(4)
Erik H. D'Hollander
Dirk Stroobandt
Abdellah Touhafi
FPGAs as Components in Heterogeneous High-Performance Computing Systems: Raising the Abstraction Level
505(10)
Wim Vanderbauwhede
Syed Waqar Nabi
FPGA Acceleration of SAT Preprocessor
515(10)
Masayuki Suzuki
Tsutomu Maruyama
Leveraging FPGA Clusters for SAT Computations
525(8)
Michal Kouril
High-Speed Calculation of Convex Hull in 2D Images Using FPGA
533(10)
Kenji Kanazawa
Kahori Kemmotsu
Yamato Mori
Noriyuki Aibe
Moritoshi Yasuanga
Workload Distribution and Balancing in FPGAs and CPUs with OpenCL and TBB
543(10)
Rafael Asenjo
Angeles Navarro
Andres Rodriguez
Jose Nunez-Yanez
A Run-Time System for Partially Reconfigurable FPGAs: The Case of STMicroelectronics SPEAr Board
553(10)
George Charitopoulos
Dionisios Pnevmatikatos
Marco D. Santambrogio
Kyprianos Papadimitriou
Danillo Pau
Exploring Automatically Generated Platforms in High Performance FPGAs
563(10)
Panagiotis Skrimponis
Georgios Zindros
Ioannis Parnassos
Muhsen Owaida
Nikolaos Bellas
Paolo Jenne
Mini-Symposium: Experiences of Porting and Optimising Code for Xeon Phi Processors
Symposium on Experiences of Porting and Optimising Code for Xeon Phi Processors
573(2)
Adrian Jackson
Michele Weiland
Mark Parsons
Simon McIntosh-Smith
Experiences Porting Production Codes to Xeon Phi Processors
575(10)
Emmanouil Farsarakis
Adrian Jackson
Fiona Reid
David Scott
Michele Weiland
Preparing a Seismic Imaging Code for the Intel Knights Landing Xeon Phi Processor
585(6)
Gilles Civario
Sean Delaney
Michael Lysaght
LU Factorisation on Xeon and Xeon Phi Processors
591(12)
Adrian Jackson
Mateusz Iwo Dubaniowski
Mini-Symposium: Coordination Programming
Mini-Symposium on Coordination Programming - Preface
603(2)
Clemens Grelck
Alex Shafarenko
Claud: Coordination, Locality and Universal Distribution
605(10)
Jossekin Beilharz
Frank Feinbube
Felix Eberhardt
Max Plauth
Andreas Polze
Coordination with Structured Composition for Cyber-Physical Systems
615(12)
Simon Maurer
Raimund Kirner
Mini-Symposium: Symposium on Parallel Solvers for Very Large PDE Based Systems in the Earth- and Atmospheric Sciences
On Efficient Time Stepping Using the Discontinuous Galerkin Method for Numerical Weather Prediction
627(10)
Andreas Dedner
Robert Klofkorn
Porting the COSMO Dynamical Core to Heterogeneous Platforms Using STELLA Library
637(10)
Carlos Osuna
Oliver Fuhrer
Tobias Gysi
Thomas C. Schulthess
Towards Compiler-Agnostic Performance in Finite-Difference Codes
647(14)
A.R. Porter
R.W. Ford
M. Ashworth
G.D. Riley
M. Modani
Mini-Symposium: Is the Programming Environment Ready for Hybrid Supercomputers?
Is the Programming Environment Ready for Hybrid Supercomputers?
661(2)
Alistair Hart
Harvey Richardson
Utilizing Hybrid Programming Environments: CSCS Case Studies
663(10)
William Sawyer
Anton Kozhevnikov
Raffaele Solca
SYCL: Single-Source C++ Accelerator Programming
673(10)
Ruyman Reyes
Victor Lomuller
Using Task-Based Parallelism Directly on the GPU for Automated Asynchronous Data Transfer
683(14)
Aidan B.G. Chalk
Pedro Gonnet
Matthieu Schaller
A Strategy for Developing a Performance Portable Highly Scalable Application
697(12)
Michael Neff
Stefan Andersson
Aaron Vose
John Levesque
Mini-Symposium: Symposium on Energy and Resilience in Parallel Programming
Mini-Symposium on Energy and Resilience in Parallel Programming
709(2)
Dimitrios S. Nikolopoulos
Christos D. Antonopoulos
Performance and Fault Tolerance of Preconditioned Iterative Solvers on Low-Power ARM Architectures
711(10)
Jose I. Aliaga
Sandra Catalan
Charalampos Chalios
Dimitrios S. Nikolopoulos
Enrique S. Quintana-Orti
Compiling for Resilience: The Performance Gap
721(10)
Norman A. Rink
Dmitrii Kuvaiskii
Jeronimo Castrillon
Christof Fetzer
Automation of Significance Analyses with Interval Splitting
731(10)
J. Deussen
J. Riehme
U. Naumann
Energy Minimization on Heterogeneous Systems Through Approximate Computing
741(12)
Michalis Spyrou
Christos Kalogirou
Christos Konstantas
Panos Koutsovasilis
Manolis Maroudas
Christos D. Antonopoulos
Nikolaos Bellas
Landing Containment Domains on SWARM: Toward a Robust Resiliency Solution on a Dynamic Adaptive Runtime Machine
753(12)
Sam Kaplan
Sergio Pino
Aaron M. Landwehr
Guang R. Gao
Mini-Symposium: Symposium on Multi-System Application Extreme-Scaling Imperative
MAXI - Multi-System Application Extreme-Scaling Imperative
765(2)
Dirk Brommel
Wolfgang Frings
Brian J.N. Wylie
High Throughput Simulations of Two-Phase Flows on Blue Gene/Q
767(10)
Panagiotis Hadjidoukas
Diego Rossinelli
Fabian Wermelinger
Jonas Sukys
Ursula Rasthofer
Christian Conti
Babak Hejazialhosseini
Petros Koumoutsakos
Direct Numerical Simulation of Fluid Turbulence at Extreme Scale with psOpen
777(10)
Jens Henrik Goebbert
Michael Gauding
Cedrick Ansorge
Bernd Hentschel
Torsten Kuhlen
Heinz Pitsch
Simulating Morphologically Detailed Neuronal Networks at Extreme Scale
787(10)
Aleksandr Ovcharenko
Pramod Kumbhar
Michael Hines
Francesco Cremonesi
Timothee Ewart
Stuart Yates
Felix Schumann
Fabien Delalondre
FE2TI: Computational Scale Bridging for Dual-Phase Steels
797(10)
Axel Klawonn
Martin Lanser
Oliver Rheinbach
Performance Evaluation of the LBM Solver Musubi on Various HPC Architectures
807(10)
Jiaxing Qi
Kartik Jain
Harald Klimach
Sabine Roller
Extreme-Scaling Applications 24/7 on JUQUEEN Blue Gene/Q
817(10)
Dirk Brommel
Wolfgang Frings
Brian J.N. Wylie
Extreme Scale-Out SuperMUC Phase 2 - Lessons Learned
827(10)
Nicolay Hammer
Ferdinand Jamitzky
Helmut Satzger
Momme Allalen
Alexander Block
Anupam Karmakar
Matthias Brehm
Reinhold Bader
Luigi Iapichino
Antonio Ragagnin
Vasilios Karakasis
Dieter Kranzmuller
Arndt Bode
Herbert Huber
Martin Kuhn
Rui Machado
Daniel Grunwald
Philipp V.F. Edelmann
Friedrich K. Ropke
Markus Wittmann
Thomas Zeiser
Gerhard Wellein
Gerald Mathias
Magnus Schworer
Konstantin Lorenzen
Christoph Federrath
Ralf Klessen
Karl-Ulrich Bamberg
Hartmut Ruhl
Florian Schornbaum
Martin Bauer
Anand Nikhil
Jiaxing Qi
Harald Klimach
Hinnerk Stuben
Abhishek Deshmukh
Tobias Falkenstein
Klaus Dolag
Margarita Petkova
"K-scale" Applications on the K Computer and Co-Design Effort for the Development of "post-K"
837(10)
Miwako Tsujia
Author Index 847