AlDhubhani, Raed and Eassa, Fathy and Saeed, Faisal (2015) Exascale MPI-based program deadlock detection. In: The 1st ICRIL-International Conference on Innovation in Science and Technology (IICIST 2015), 20 April, 2015, Kuala Lumpur, Malaysia.
|
PDF
116kB |
Official URL: http://www.utm.my/iicist/
Abstract
Deadlock detection is one of the main issues of software testing in High Performance Computing (HPC) and also in exascale computing areas in the near future. Developing and testing programs for machines which have millions of cores is not an easy task. HPC program consists of thousands (or millions) of parallel processes which need to communicate with each other in the runtime. Message Passing Interface (MPI) is a standard library which provides this communication capability and it is frequently used in the HPC. Exascale programs are expected to be developed using MPI standard library. For parallel programs, deadlock is one of the expected problems. In this paper, we discussed the deadlock detection for exascale MPI-based programs where the scalability and efficiency are critical issues. The proposed method is implemented to detect and flag the processes and communication commands which are potential to cause deadlocks in a scalable and efficient manner. MPI benchmark programs were used to test the propose method.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Uncontrolled Keywords: | exascale systems, message passing interface |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | Computing |
ID Code: | 62001 |
Deposited By: | Fazli Masari |
Deposited On: | 30 May 2017 00:21 |
Last Modified: | 30 May 2017 00:21 |
Repository Staff Only: item control page