Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/97591
Title: | The design of a fault management framework for cloud | Authors: | Chalermarrewong, Thanyalak Achalakul, Tiranee See, Simon Chong Wee |
Issue Date: | 2012 | Source: | Chalermarrewong, T., Achalakul, T., & See, S. C. W. (2012). The design of a fault management framework for cloud. 2012 9th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON). | Conference: | International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (9th : 2012 : Phetchaburi, Thailand) | Abstract: | High performance computing systems can have high failure rates as they feature a large number of servers and components with intensive workload. The availability of the system can be easily compromised if the failure of these subsystems is not handled correctly. This research proposes a framework of proactive fault tolerance for enterprise cloud computing systems. The main idea is to create an effective prediction model focusing on hardware failure. The proposed framework features two major components: monitoring and availability analysis. For each machine, the availability analysis module tracks historical states, and predicts the machine future state. Depending on the predicted state, the resource manager decides whether the machine requires task migration to prevent possible losses. By using task migration, the framework eliminates the cost of job replication and back up. The framework also includes the adequacy checking function into availability analysis in order to periodically evaluate and adjust the prediction model. The framework can thus be adopted by heterogeneous datacenters. The energy efficiency can be improved as the impact of the failure to the datacenters reduces. | URI: | https://hdl.handle.net/10356/97591 http://hdl.handle.net/10220/11862 |
DOI: | 10.1109/ECTICon.2012.6254358 | Schools: | School of Mechanical and Aerospace Engineering | Rights: | © 2012 IEEE. | Fulltext Permission: | none | Fulltext Availability: | No Fulltext |
Appears in Collections: | MAE Conference Papers |
SCOPUSTM
Citations
20
18
Updated on Apr 28, 2025
Page view(s) 50
629
Updated on May 4, 2025
Google ScholarTM
Check
Altmetric
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.