Software-based fault-tolerant routing algorithm in multi-dimensional networks

F. Safaei, M. Rezazad, A. Khonsari, M. Fathy, M. Ould-Khaoua, N. Alzeidi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

Massively parallel computing systems are being built with hundreds or thousands of components such as nodes, links, memories, and connectors. The failure of a component in such systems will not only reduce the computational power but also alter the network's topology. The Software-Based fault-tolerant routing algorithm is a popular routing to achieve fault-tolerance capability in networks. This algorithm is initially proposed only for two dimensional networks [1]. Since, higher dimensional networks have been widely employed in many contemporary massively parallel systems; this paper proposes an approach to extend this routing scheme to these indispensable higher dimensional networks. Deadlock and livelock freedom and the performance of presented algorithm, have been investigated for networks with different dimensionality and various fault regions. Furthermore, performance results have been presented through simulation experiments.

Original languageEnglish
Title of host publication20th International Parallel and Distributed Processing Symposium, IPDPS 2006
Volume2006
DOIs
Publication statusPublished - 2006

ASJC Scopus subject areas

  • Engineering(all)

Fingerprint Dive into the research topics of 'Software-based fault-tolerant routing algorithm in multi-dimensional networks'. Together they form a unique fingerprint.

  • Cite this

    Safaei, F., Rezazad, M., Khonsari, A., Fathy, M., Ould-Khaoua, M., & Alzeidi, N. (2006). Software-based fault-tolerant routing algorithm in multi-dimensional networks. In 20th International Parallel and Distributed Processing Symposium, IPDPS 2006 (Vol. 2006). [1639644] https://doi.org/10.1109/IPDPS.2006.1639644