Abstract
The main goal of this paper is to demonstrate that the newest generation of NEC SX-Aurora TSUBASA architecture can perform large-scale graph processing extremely efficiently. This paper proposes approaches, which can be used for the development of high-performance vector-oriented implementations of page rank and shortest paths algorithms, including vectorised graph storage format, efficient vector-friendly graph traversals, optimised cache-aware memory accesses and efficient load-balancing. The developed implementations are optimised according to the most important features and properties of SX-Aurora architecture, which allows them achieve up to 15 times better performance compared to the optimised Intel Skylake parallel implementations and up to 5 times better performance compared to NVGRAPH library implementations for Pascal GPU architecture.
Original language | English |
---|---|
Pages (from-to) | 1753-1762 |
Number of pages | 10 |
Journal | Lobachevskii Journal of Mathematics |
Volume | 40 |
Issue number | 11 |
DOIs | |
Publication status | Published - 2019 Nov 1 |
Keywords
- graph algorithms
- NEC SX-Aurora TSUBASA
- Page Rank
- SSSP