Actas de congresos
A systematic review of fault tolerance solutions for communication errors in open source cloud computing
Fecha
2020-06-01Registro en:
Iberian Conference on Information Systems and Technologies, CISTI, v. 2020-June.
2166-0735
2166-0727
10.23919/CISTI49556.2020.9140933
2-s2.0-85089036532
Autor
Universidade Estadual Paulista (Unesp)
Humber Institute of Technology and Advanced Learning
Institución
Resumen
Cloud systems, as any other system, must be reliable. This means that the system should respond correctly in presence of failures, which are quite probable in a distributed, largely independent, system as cloud systems are. Thus, it is important that cloud systems become fault tolerant, ensuring safe recovery from failures. Since failures in clouds may come from several different sources, although a major role comes from communication failures, the techniques that can be applied to assure reliability are also very different. This survey presents a systematic review of solutions to provide fault tolerance in open source clouds. Our goal with this review is to provide to cloud managers a guided approach to choose a solution for a given problem or system.