Fault-Tolerant Bandwidth Reservation Strategies for Data Transfers in High-Performance Networks
Many next-generation e-science applications require fast and reliable transfer of large volumes of data with guaranteed performance, which is typically enabled by the bandwidth reservation service in high-performance networks. One prominent issue in such network environments with large footprints is that node and link failures are inevitable, hence potentially degrading the quality of data transfer. We consider two generic types of bandwidth reservation requests (BRRs) concerning data transfer reliability: (i) to achieve the highest data transfer reliability under a given data transfer deadline, and (ii) to achieve the earliest data transfer completion time while satisfying a given data transfer reliability requirement. We propose two periodic bandwidth reservation algorithms with rigorous optimality proofs to optimize the scheduling of individual BRRs within BRR batches. The efficacy of the proposed algorithms is illustrated through extensive simulations in comparison with scheduling algorithms widely adopted in production networks in terms of various performance metrics.
MSU Digital Commons Citation
Zuo, Liudong; Zhu, Michelle; Wu, Chase Q.; and Zurawski, Jason, "Fault-Tolerant Bandwidth Reservation Strategies for Data Transfers in High-Performance Networks" (2017). Department of Computer Science Faculty Scholarship and Creative Works. 288.