AutoAdmin: Automatic and Dynamic Resource Reservation Admission Control in Hadoop YARN Clusters
Document Type
Article
Publication Date
1-1-2018
Abstract
Hadoop YARN is an Apache Software Foundation's open project that provides a resource management framework for large scale parallel data processing, such as MapReduce jobs. Fair scheduler is a dispatcher which has been widely used in YARN to assign resources fairly and equally to applications. However, there exists a problem of the Fair scheduler when the resource requisition of applications is beyond the amount that the cluster can provide. In such a case, the YARN system will be halted if all resources are occupied by Application Masters, a special task of each job that negotiates resources for processing tasks and coordinates job execution. To solve this problem, we propose an automatic and dynamic admission control mechanism to prevent the ceasing situation happened when the requested amount of resources exceeds the cluster's resource capacity, and dynamically reserve resources for processing tasks in order to obtain good performance, e.g., reducing makespans of MapReduce jobs. After collecting resource usage information of each work node, our mechanism dynamically predicts the amount of reserved resources for processing tasks and automatically controls running jobs based on the prediction. We implement the new mechanism in Hadoop YARN and evaluate it with representative MapReduce benchmarks. The experimental results show the effectiveness and robustness of this mechanism under both homogeneous and heterogeneous workloads.
DOI
10.12694/scpe.v19i1.1393
Montclair State University Digital Commons Citation
Yang, Zhengyu; Bhimani, Janki; Yao, Yi; Lin, Cho Hsien; Wang, Jiayin; Mi, Ningfang; and Sheng, Bo, "AutoAdmin: Automatic and Dynamic Resource Reservation Admission Control in Hadoop YARN Clusters" (2018). Department of Computer Science Faculty Scholarship and Creative Works. 131.
https://digitalcommons.montclair.edu/compusci-facpubs/131