Totoro: A Scalable and Fault-Tolerant Data Center Network by Using Backup Port

Abstract : Scalability and fault tolerance become a fundamental challenge of data center network structure due to the explosive growth of data. Both structures proposed in the area of parallel computing and structures based on tree hierarchy are not able to satisfy these two demands. In this paper, we propose Totoro, a scalable and fault-tolerant network to handle the challenges by using backup built-in Ethernet ports. We connect a bunch of servers to an intra-switch to form a basic partition. Then we utilize half of backup ports to connect those basic partitions with inter-switches to build a larger partition. Totoro is hierarchically and recursively defined and the high-level Totoro is constructed by many low-level Totoros. Totoro can scale to millions of nodes. We also design a fault-tolerant routing protocol. Its capability is very close to the performance bound. Our experiments show that Totoro is a viable interconnection structure for data centers.
Type de document :
Communication dans un congrès
Ching-Hsien Hsu; Xiaoming Li; Xuanhua Shi; Ran Zheng. 10th International Conference on Network and Parallel Computing (NPC), Sep 2013, Guiyang, China. Springer, Lecture Notes in Computer Science, LNCS-8147, pp.94-105, 2013, Network and Parallel Computing. 〈10.1007/978-3-642-40820-5_9〉
Liste complète des métadonnées

Littérature citée [15 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01513881
Contributeur : Hal Ifip <>
Soumis le : mardi 25 avril 2017 - 15:11:05
Dernière modification le : vendredi 3 novembre 2017 - 22:24:07
Document(s) archivé(s) le : mercredi 26 juillet 2017 - 14:22:51

Fichier

978-3-642-40820-5_9_Chapter.pd...
Fichiers produits par l'(les) auteur(s)

Licence


Distributed under a Creative Commons Paternité 4.0 International License

Identifiants

Citation

Junjie Xie, Yuhui Deng, Ke Zhou. Totoro: A Scalable and Fault-Tolerant Data Center Network by Using Backup Port. Ching-Hsien Hsu; Xiaoming Li; Xuanhua Shi; Ran Zheng. 10th International Conference on Network and Parallel Computing (NPC), Sep 2013, Guiyang, China. Springer, Lecture Notes in Computer Science, LNCS-8147, pp.94-105, 2013, Network and Parallel Computing. 〈10.1007/978-3-642-40820-5_9〉. 〈hal-01513881〉

Partager

Métriques

Consultations de la notice

104

Téléchargements de fichiers

52