Abstract
Overheads due to synchronisation and loop scheduling are an important factor in determining the performance of shared memory parallel programs. We present set of benchmarks to measure these classes of overhead for language constructs in OpenMP. Results are presented for three different hardware platforms, each with its own implementation of OpenMP. Significant differences are observed, which suggest possible means of improving performance. Keywords--- Benchmarking, OpenMP, synchronisation, scheduling, performance. I. Introduction OpenMP is a relatively new industry standard for shared memory parallel programming. The standard defines a set of directives and library routines for both Fortran 6 and C/C++ 7. For the first time, shared memory parallel programs can be made portable across a wide range of platforms. Synchronisation and loop scheduling can both be significant sources of overhead in shared memory parallel programs. In OpenMP, the cost of these operations is dependent on...
Users
Please
log in to take part in the discussion (add own reviews or comments).