In this paper, we provide a review of the fault tolerance and reliability assessing techniques of mesh MCSs. A number of fault-tolerant routing techniques are analyzed including the dimension ordering, the turn model, and the block-fault model. In reliability analysis, we consider the sub-mesh reliability exact and approximate models and the task-based reliability computation. It is expected that some of the techniques and algorithms covered in this paper can have applications in the domain of wireless mesh networks. Abstract Multi-computer systems MCSs are efficient in solving computing-intensive problems.
How to Cite this Article? Mostafa Abd-el-barr References . Allen, F. Blue Gene: a vision for protein science using a Petaflop Supercomputer. Almohammad, B. Fault-tolerant communication algorithms in toroidal networks. Al-Tawil, K. Ashraf, F. Introduction to Routing in Multicomputer Networks.
Bataineh, S. Reliability of mesh and torus topologies in the presence of faults. Journal of Telecommunication Systems, 10 , pp. Boppana, R. Fault-tolerant wormhole routing algorithms for mesh networks. Chalasani, S. Communication in multicomputers with nonconvex faults.
Chang, C. Chen, C-L. A fault-tolerant routing scheme for meshes with nonconvex faults.
Chen, J. Journal of Parallel and Distributed Computing, Vol. Chirivlla, V. Accurate reliability and availability models for direct interconnection networks. Chiu, G-M. The odd-even turn model for adaptive routing. Cray Research Inc. Also, Karamcheti, V. Dally, W. Principles and Practices of Interconnection Networks. Elsevier publications, New York.
De Mello, A. Technical Report Series, No. Duato, J. Fahmy, H.
Farahabady, M. Journal of Parallel Computing, 32 , December , pp. Fillo, M. The M-Machine Multicomputer. Gaugh, P. Glass, C. The turn model for adaptive routing. Proceedings of the 9th annual International Symposium on Computer Architecture, , pp.
Reliability of Computer Systems and Networks: Fault Tolerance, Analysis, and .. Fault-tolerant computing is a generic term describing redundant design tech-. Reliability of Computer Systems and Networks: Fault Tolerance, Analysis, and Design. Author(s). Martin L. Shooman Ph.D.,. First published
Gomez, M. Groscup, W. Ho, C.
Holland, G. Fault-Tolerant Multiprocessor Systems. Boland, A Reliability Comparison. Bonthond, J. Dieperink, L. Jansson, E. Bouissou and J. Bowles, A Survey of Reliability. Cassandras and S.
Kluwer Academic. Introduction to Discrete Events Systems Publisher, Choi, V. Kulkarni and K. Ajmone Marsan ed. Choi, B. Johnson and J. Profeta III.
Ciapala, F. Rodriguez-Mateos, R. Schmidt and J. Ciardo, R. Marie, B. Sericola and K. German and C. Coffman and E. Dugan, S.
Bavuso and M. Filippini and A. Filippini, B. Dehning, G. Guaglio, F. Schmidt, B. Todd, J. Uythoven, A. Vergara-Fernandez, M. German, D. Logothetis, and K. Goble and J.