A QA-Assisted Job Scheduler for Minimizing the Impact of Urgent Computing on HPC System Operation
Nov 26, 2024Citations per year
0 Citations
Abstract: (IEEE)
In recent years, there has been an increase in large-scale natural disasters such as earthquakes, tsunamis, and storms, raising the importance of disaster prevention and mitigation. Thus, extensive studies on urgent computing use High Performance Computing (HPC) systems for rapid simulations and prompt countermeasures. To meet the deadlines of urgent jobs, job schedulers may have some features so that urgent jobs could have a higher priority of execution than other jobs. As a result, urgent job execution could have negative impacts on the execution of other jobs with lower priorities and, hence, on overall HPC system operation. The goal of this paper is to efficiently execute urgent jobs to meet their deadlines while minimizing the negative impacts on HPC system operation. This paper assumes that some of the running jobs can be suspended (and resumed later) to immediately execute an urgent job. Another important assumption is that there is a possibility that the suspended job could be terminated if necessary to meet the deadline, and thus its intermediate computation results are lost if the total memory usage of urgent and suspended jobs exceeds the memory capacity. The proposed method employs quantum annealing or quantum-inspired annealing (QA) techniques to find an appropriate combination of jobs to be suspended so as to minimize the loss of computational results while meeting the deadlines of urgent jobs. The evaluation results show that the proposed method can properly select an appropriate combination of jobs to be suspended so that it can minimize the computational losses. The results also demonstrate that the superiority of the proposed method becomes more remarkable in practical situations where the power-saving feature of HPC systems is enabled.- Annealing
- Storms
- Processor scheduling
- Systems operation
- Prevention and mitigation
- Disasters
- High performance computing
- Resumes
- Quantum annealing
- Tsunami
References(31)
Figures(0)
- [1]
- [2]
- [3]
- [4]
- [5]
- [6]
- [7]
- [8]
- [9]
- [10]
- [11]
- [12]
- [12]
- [13]
- [14]
- [15]
- [16]
- [17]
- [18]
- [19]
- [20]
- [20]
- [21]
- [22]
- [23]