Communication delay analysis of fault-tolerant pipelined circuit switching in torus

F. Safaei, A. Khonsari, M. Fathy, M. Ould-Khaoua

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

Large-scale parallel systems, Multiprocessors System-on-Chip (MP-SoCs), multicomputers, and cluster computers are often composed of hundreds or thousands of components (such as routers, channels and connectors) that collectively possess failure rates higher than what arise in the ordinary systems. One of the most important issues in the design of such systems is the development of the efficient fault-tolerant mechanisms that provide high throughput and low latency in communications to ensure that these systems will keep running in a degraded mode until the faulty components are repaired. Pipelined Circuit Switching (PCS) has been suggested as an efficient switching method for supporting inter-processor communications in networks due to its ability to preserve both communication performance and fault-tolerant demands in such systems. This paper presents a new mathematical model to investigate the effects of failures and capture the mean message latency in torus using PCS in the presence of faulty components. Simulation experiments confirm that the analytical model exhibits a good degree of accuracy under different working conditions.

Original languageEnglish
Pages (from-to)1131-1144
Number of pages14
JournalJournal of Computer and System Sciences
Volume73
Issue number8
DOIs
Publication statusPublished - Dec 2007

Keywords

  • Adaptive routing
  • Fault-tolerance
  • Large-scale parallel systems
  • Message latency
  • PCS
  • Performance evaluation
  • Queuing theory
  • Torus
  • Virtual channels

ASJC Scopus subject areas

  • Computational Theory and Mathematics

Fingerprint Dive into the research topics of 'Communication delay analysis of fault-tolerant pipelined circuit switching in torus'. Together they form a unique fingerprint.

  • Cite this