|
|
|
Hands on with OpenMP4.5 and Unified Memory: Developing Applications for IBM's Hybrid CPU + GPU Systems (Part I) |
|
|
3 | (14) |
|
|
|
|
Hands on with OpenMP4.5 and Unified Memory: Developing Applications for IBM's Hybrid CPU + GPU Systems (Part II) |
|
|
17 | (16) |
|
|
|
|
Advanced Implementations and Extensions |
|
|
|
Leveraging OpenMP 4.5 Support in CLANG for Fortran |
|
|
33 | (15) |
|
|
|
|
|
Compiling and Optimizing OpenMP 4.X Programs to OpenCL and SPIR |
|
|
48 | (14) |
|
|
|
|
Extending OpenMP SIMD Support for Target Specific Code and Application to ARM SVE |
|
|
62 | (15) |
|
|
|
|
|
OpenMP Application Studies |
|
|
|
OpenMP Tasking and MPI in a Lattice QCD Benchmark |
|
|
77 | (15) |
|
|
|
On the Performance of Parallel Tasking Runtimes for an Irregular Fast Multipole Method Application |
|
|
92 | (15) |
|
|
|
Porting VASP from MPI to MPI+OpenMP (SIMD): Optimization Strategies, Insights and Feature Proposals |
|
|
107 | (16) |
|
|
|
|
|
OpenMP* SIMD Vectorization and Threading of the Elmer Finite Element Software |
|
|
123 | (18) |
|
|
|
|
|
Analyzing and Extending Tasking |
|
|
|
Extending OMPT to Support Grain Graphs |
|
|
141 | (15) |
|
|
|
|
Patterns for OpenMP Task Data Dependency Overhead Measurements |
|
|
156 | (13) |
|
|
|
|
Adaptive and Architecture-Independent Task Granularity for Recursive Applications |
|
|
169 | (16) |
|
|
|
|
|
|
OpenMP 4 Application Evaluation |
|
|
|
The Productivity, Portability and Performance of OpenMP 4.5 for Scientific Applications Targeting Intel CPUs, IBM CPUs, and NVIDIA GPUs |
|
|
185 | (18) |
|
|
|
Extended Parallelism Models |
|
|
|
User Co-scheduling for MPI+OpenMP Applications Using OpenMP Semantics |
|
|
203 | (14) |
|
|
|
|
|
|
|
Asynchronous OpenMP Tasking with Easy Thread Context Switching and Pool Control |
|
|
217 | (14) |
|
|
|
|
A Functional Safety OpenMP* for Critical Real-Time Embedded Systems |
|
|
231 | (18) |
|
|
|
|
|
|
Performance Analysis and Tools |
|
|
|
OpenMP Tools Interface: Synchronization Information for Data Race Detection |
|
|
249 | (17) |
|
|
|
|
|
|
Accurate and Complete Hardware Profiling for OpenMP: Multiplexing Hardware Events Across Executions |
|
|
266 | (15) |
|
|
|
|
OpenMP® Runtime Instrumentation for Optimization |
|
|
281 | (15) |
|
|
|
|
|
|
|
|
|
|
Assessing the Performance of OpenMP Programs on the Knights Landing Architecture |
|
|
296 | (15) |
|
|
|
|
Advanced Data Management with OpenMP |
|
|
|
Double Buffering for MCDRAM on Second Generation Intel® Xeon Phi™ Processors with OpenMP |
|
|
311 | (14) |
|
|
|
|
A Pattern for Overlapping Communication and Computation with OpenMP* Target Directives |
|
|
325 | (13) |
|
|
|
|
|
|
Custom Data Mapping for Composable Data Management |
|
|
338 | (11) |
|
|
|
Author Index |
|
349 | |