A fast parallel graph partitioner for shared-memory inspector/executor strategies
CD Krieger, MM Strout - Languages and Compilers for Parallel Computing …, 2013 - Springer
Languages and Compilers for Parallel Computing: 25th International Workshop …, 2013•Springer
Graph partitioners play an important role in many parallel work distribution and locality
optimization approaches. Surprisingly, however, to our knowledge there is no freely
available parallel graph partitioner designed for execution on a shared memory multicore
system. This paper presents a shared memory parallel graph partitioner, ParCubed, for use
in the context of sparse tiling run-time data and computation reordering. Sparse tiling is a run-
time scheduling technique that schedules groups of iterations across loops together when …
optimization approaches. Surprisingly, however, to our knowledge there is no freely
available parallel graph partitioner designed for execution on a shared memory multicore
system. This paper presents a shared memory parallel graph partitioner, ParCubed, for use
in the context of sparse tiling run-time data and computation reordering. Sparse tiling is a run-
time scheduling technique that schedules groups of iterations across loops together when …
Abstract
Graph partitioners play an important role in many parallel work distribution and locality optimization approaches. Surprisingly, however, to our knowledge there is no freely available parallel graph partitioner designed for execution on a shared memory multicore system. This paper presents a shared memory parallel graph partitioner, ParCubed, for use in the context of sparse tiling run-time data and computation reordering. Sparse tiling is a run-time scheduling technique that schedules groups of iterations across loops together when they access the same data and one or more of the loops contains indirect array accesses. For sparse tiling, which is implemented with an inspector/executor strategy, the inspector needs to find an initial seed partitioning of adequate quality very quickly. We compare our presented hierarchical clustering partitioner, ParCubed, with GPart and METIS in terms of partitioning speed, partitioning quality, and the effect the generated seed partitions have on executor speed. We find that the presented partitioner is 25 to 100 times faster than METIS on a 16 core machine. The total edge cut of the partitioning generated by ParCubed was found not to exceed 1.27x that of the partitioning found by METIS.
Springer
Showing the best result for this search. See all results