A fault-tolerant communication scheme for regular cluster networks

Khaled Day*, Bassel Arafeh, Abderezak Touzene

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Large cluster systems with thousands of nodes have become a cost-effective alternative to traditional supercomputers. In these systems cluster nodes are interconnected using high-degree switches. Regular direct network topologies including tori (k-ary n-cubes) and meshes are among adapted choices for interconnecting these high-degree switches. We propose a general faulttolerant routing scheme applicable for regular direct interconnection networks satisfying some interconnection conditions. The scheme is based on the availability of efficiently identifiable disjoint routes between network nodes. The proposed scheme is first presented in general terms for any interconnection topology satisfying the presented connectivity conditions. The scheme is then illustrated on two examples of interconnection topologies namely the binary hypercube and the k-ary n-cube.

Original languageEnglish
Title of host publicationProceedings of the Sixth IASTED International Conference on Communications, Internet, and Information Technology, CIIT 2007
Pages147-152
Number of pages6
Publication statusPublished - 2007
Event6th IASTED International Conference on Communications, Internet, and Information Technology, CIIT 2007 - Banff, AB, Canada
Duration: Jul 2 2007Jul 4 2007

Publication series

NameProceedings of the Sixth IASTED International Conference on Communications, Internet, and Information Technology, CIIT 2007

Other

Other6th IASTED International Conference on Communications, Internet, and Information Technology, CIIT 2007
Country/TerritoryCanada
CityBanff, AB
Period7/2/077/4/07

Keywords

  • Cluster systems
  • Fault-tolerant routing
  • Interconnection networks
  • Network management

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Software

Fingerprint

Dive into the research topics of 'A fault-tolerant communication scheme for regular cluster networks'. Together they form a unique fingerprint.

Cite this