Artículos de revistas
Discuss And Fuse: Considering Modularity, Genericness, And Adaptation In The Development Of Consensus And Fault Detection Services
Registro en:
Lecture Notes In Computer Science (including Subseries Lecture Notes In Artificial Intelligence And Lecture Notes In Bioinformatics). , v. 2847, n. , p. 234 - 253, 2003.
3029743
2-s2.0-0142156606
Autor
Camargos L.J.
Madeira E.R.M.
Institución
Resumen
Although fault tolerant systems are badly needed, their development is not a trivial task. Considering fault tolerant distributed systems, this difficulty is even bigger, and any artifact that could make this task easier becomes highly valuable. In this paper, we propose and model a distributed consensus service and a fault detection service, namely, DisCusS and FuSe, that can be used as building blocks in the development of distributed fault tolerant applications. We also show the compliance of FuSe to FT-CORBA fault detection, and give some insights on the use of the proposed consensus service in a possible FT-CORBA implementation. Moreover, this paper presents some comparative tests of the influence of adaptive and non-adaptive fault detectors over consensus. © Springer-Verlag Berlin Heidelberg 2003. 2847
234 253 Oestereich, B., (1999) Developing Software with UML, , Addison-Wesley, Harlow, England Marin, O., Bertier, M., Pierre, S., Implementation and performance evaluation of an adaptable failure detector (2002) Proc. of the Int. Conference on Dependable Systems and Networks (DSN'02), p. 354. , Washington, D.C., USA, June Brasileiro, F., Greve, F., Mostefaoui, A., Raynal, M., Consensus in one communication step (2001) LNCS, pp. 42-50. , 6th Parallel Computing Technologies, 6th International Conference, PaCT 2001, number 2127 Barcelona, Spain, September Springer Verlag Chandra, T.D., Toueg, S., Unreliable failure detectors for reliable distributed systems (1996) Journal of the ACM, 43 (2), pp. 225-267 Chen, W., Toueg, S., Aguilera, M.K., On the quality of service of failure detectors (2002) IEEE Transactions on Computers, 51 (5), pp. 561-580 Dolev, D., Dwork, C., Stockmeyer, L., On the minimal synchronism needed for distributed consensus (1987) Journal of the ACM (JACM), 34 (1), pp. 77-97 Dwork, C., Lynch, N., Stockmeyer, L., Consensus in the presence of partial synchrony (1988) Journal of the ACM (JACM), 35 (2), pp. 288-323 Esteffanel, L.A.B., Jansch-Pôrto, I., On the evaluation of failure detectors performance (2001) Proc. of IX Brazilian Symposium of Fault Tolerant Computing, , Florianopolis, Brazil, March Felber, P., (1998) The CORBA Object Group Service: A Service Approach to Object Groups in CORBA, , PhD thesis, École Polytechnique Fédérale de Lausanne, Switzerland Fischer, M.J., Lynch, N.A., Paterson, M.S., Impossibility of distributed consensus with one faulty process (1985) Journal of the ACM (JACM), 32 (2), pp. 374-382 Gamma, E., Helm, R., Johnson, R., Vlissides, J., (1994) Design Patterns: Elements of Reusable Object-Oriented Software, , Addison-Wesley Professional Computing Series. Addison-Wesley Gartner, F.C., Fundamentals of fault-tolerant distributed computing in asynchronous environments (1999) ACM Computing Surveys, 31 (1), pp. 1-26 Guerraoui, R., Revisiting the relationship between non-blocking atomic commitment and consensus (1995) LNCS, pp. 87-100. , Proc. of the 9th International Workshop on Distributed Algorithms (WDAG-9), number 972 Le Mont-St-Michel, France, September Springer-Verlag Guerraoui, R., Oliveira, R., Schiper, A., (1998) Stubborn Communication Channels, , Technical Report 98/272, École Polytechnique Fédérale de Lausanne, Switzerland, March Guerraoui, R., Schiper, A., The generic consensus service (2001) IEEE Transactions on Software Engineering, 27 (1), pp. 29-41. , January Lamport, L., The part-time parliament (1998) ACM Transactions on Computer Systems (TOCS), 16 (2), pp. 133-169 Lung, C., Fraga, J., Farines, J.-M., Ogg, M., Ricciardi, A., Cosnamingft - A fault-tolerant corba naming service (1999) Proc. 18th IEEE International Symposium on Reliable Distributed Systems (SRDS'99), pp. 254-262. , IEEE Computer Society Hurfin, M., Macêdo, R., Raynal, M., Tronel, F., A general framework to solve agreement problems (1999) Proc. 18th IEEE International Symposium on Reliable Distributed Systems (SRDS'99), pp. 55-65. , IEEE Computer Society (2001) The Common Object Request Broker: Architecture and Specification, , Technical Report 2.6, Object Management Group, December (2001) Fault Tolerant Corba. Corba 2.6, , Technical Report 2.6, Object Management Group, December Schiper, A., Early consensus in an asynchronous system with a weak failure detector (1997) Distributed Computing, 10 (3), pp. 149-157 Schmidt, D.C., (2002), http://www.cs.wustl.edu/schmidt/tao.html, Internet site, DecSergent, N., Défago, X., Schiper, A., Impact of a failure detection mechanism on the performance of consensus (2001) Proc. IEEE Pacific Rim Symp. on Dependable Computing (PRDC), , Seoul, Korea, December Sommerlad, P., Buschmann, F., Manager design pattern (1996) 3rd Annual PLoP, , Allenton Park, Illinois, September Sotoma, I., Madeira, E.R.M., ADAPTATION - Algorithms to ADAPTive FAuIT MonItOriNg and Their Implementation on CORBA (2001) Proc. IEEE of the 3rd International Symposium on Distributed Objects and Applications (DOA01), pp. 219-228. , Rome, Italy, September Sotoma, I., Madeira, E.R.M., DPCP(Discard Past Consider Present) - A Novel Approach to Adaptive Fault Detection in Distributed Systems (2001) Proc. of the 8th IEEE Workshop on Future Trends of Distributed Computing Systems (FTDCS2001), pp. 76-82. , Bologna, Italy, November Urbán, P., Défago, X., Schiper, A., Contention-aware metrics for distributed algorithms: Comparison of atomic broadcast algorithms (2000) Proc. of the 9th IEEE International Conference on Computer Communications and Networks (IC3N 2000), pp. 582-589. , October