[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Fault-Tolerant Adaptive Routing in Dragonfly Networks

Published: 01 March 2019 Publication History

Abstract

Dragonfly networks have been widely used in the current high-performance computers or high-end servers. Fault-tolerant routing in dragonfly networks is essential. The rich interconnects provide good fault-tolerance ability for the network. A new deadlock-free adaptive fault-tolerant routing algorithm based on a new two-layer safety information model, is proposed by mapping routers in a group, and groups of the dragonfly network into two separate hypercubes. The new fault-tolerant routing algorithm tolerates static and dynamic faults. Our method can determine whether a packet can reach the destination at the source by using the new safety information model, which avoids dead-ends and aimless misrouting. Sufficient simulation results show that the proposed fault-tolerant routing algorithm even outperforms the previous minimal routing algorithm in fault-free networks in many cases.

Cited By

View all
  • (2024)A Graph Neural Network-Based Approach With Dynamic Multiqueue Optimization Scheduling (DMQOS) for Efficient Fault Tolerance and Load Balancing in Cloud ComputingInternational Journal of Intelligent Systems10.1155/int/63787202024Online publication date: 1-Jan-2024
  • (2024)Efficient Fault-Tolerant Path Embedding for 3D Torus Network Using Locally Faulty BlocksIEEE Transactions on Computers10.1109/TC.2024.341669573:9(2305-2319)Online publication date: 1-Sep-2024
  • (2023)Heterogeneous Die-to-Die Interfaces: Enabling More Flexible Chiplet Interconnection SystemsProceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture10.1145/3613424.3614310(930-943)Online publication date: 28-Oct-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image IEEE Transactions on Dependable and Secure Computing
IEEE Transactions on Dependable and Secure Computing  Volume 16, Issue 2
March 2019
185 pages

Publisher

IEEE Computer Society Press

Washington, DC, United States

Publication History

Published: 01 March 2019

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 20 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)A Graph Neural Network-Based Approach With Dynamic Multiqueue Optimization Scheduling (DMQOS) for Efficient Fault Tolerance and Load Balancing in Cloud ComputingInternational Journal of Intelligent Systems10.1155/int/63787202024Online publication date: 1-Jan-2024
  • (2024)Efficient Fault-Tolerant Path Embedding for 3D Torus Network Using Locally Faulty BlocksIEEE Transactions on Computers10.1109/TC.2024.341669573:9(2305-2319)Online publication date: 1-Sep-2024
  • (2023)Heterogeneous Die-to-Die Interfaces: Enabling More Flexible Chiplet Interconnection SystemsProceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture10.1145/3613424.3614310(930-943)Online publication date: 28-Oct-2023
  • (2023)Disjoint Paths Construction and Fault-Tolerant Routing in BCube of Data Center NetworksIEEE Transactions on Computers10.1109/TC.2023.325184972:9(2467-2481)Online publication date: 1-Sep-2023
  • (2022)Exploring the Galaxyfly Family to Build Flexible-Scale Interconnection NetworksIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2021.310078333:5(1054-1068)Online publication date: 1-May-2022
  • (2021)Fault-Tolerant Secure Routing of BHn-Based Data Center NetworksSecurity and Communication Networks10.1155/2021/67239142021Online publication date: 1-Jan-2021
  • (2021)Fault-tolerant routing algorithm based on disjoint paths in 3-ary n-cube networks with structure faultsThe Journal of Supercomputing10.1007/s11227-021-03799-077:11(13090-13114)Online publication date: 1-Nov-2021
  • (2021)All-to-All Broadcast in Dragonfly NetworksComputing and Combinatorics10.1007/978-3-030-89543-3_2(13-24)Online publication date: 24-Oct-2021
  • (2020)Evaluation of Link Failure Resilience in Multirail Dragonfly-Class Networks through SimulationProceedings of the 2020 ACM SIGSIM Conference on Principles of Advanced Discrete Simulation10.1145/3384441.3395989(105-116)Online publication date: 15-Jun-2020

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media