[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3106989.3107002acmotherconferencesArticle/Chapter ViewAbstractPublication PagescommConference Proceedingsconference-collections
research-article

Combining ECN and RTT for Datacenter Transport

Published: 03 August 2017 Publication History

Abstract

Datacenter transports should provide low average and tail flow completion times (FCT) to achieve desired application performance. While most prior datacenter transports take either ECN or RTT as congestion signal, this paper makes a case that both signals are indispensable: ECN, as a per-hop signal, is more effective to prevent packet loss; while RTT, as an end-to-end signal, controls end-to-end queueing delay better. As persistent low flow completion times imply low queueing delay and near zero packet loss, we introduce EAR, a new datacenter transport that hears and reacts to both ECN and RTT. Our preliminary results show that: 1) compared to delay-based DCTCP, EAR achieves up to 91% lower packet losses and 93% fewer timeouts; 2) compared to ECN-based DCTCP, EAR reduces RTT by up to 32% for cross-rack traffic in a 4-level fattree. As a result, EAR delivers persistent low average and tail completion times under various scenarios in large scale simulations.

References

[1]
Google Data Centers. https://www.google.com/about/datacenters.
[2]
Microsoft Datacenter Infrastructure. https://www.microsoft.com/en-us/cloud-platform/global-datacenters.
[3]
The Network Simulator NS-2. http://www.isi.edu/nsnam/ns.
[4]
Mohammad Al-Fares, Alexander Loukissas, and Amin Vahdat. 2008. A scalable, commodity data center network architecture. In SIGCOMM.
[5]
Mohammad Alizadeh, Albert Greenberg, David A Maltz, Jitendra Padhye, Parveen Patel, Balaji Prabhakar, Sudipta Sengupta, and Murari Sridharan. 2010. Data center tcp (dctcp). In SIGCOMM.
[6]
Mohammad Alizadeh, Abdul Kabbani, Tom Edsall, Balaji Prabhakar, Amin Vahdat, and Masato Yasuda. 2012. Less is more: trading a little bandwidth for ultra-low latency in the data center. In NSDI.
[7]
Mohammad Alizadeh, Shuang Yang, Milad Sharif, Sachin Katti, Nick McKeown, Balaji Prabhakar, and Scott Shenker. 2013. pfabric: Minimal near-optimal datacenter transport. In SIGCOMM.
[8]
Wei Bai, Kai Chen, Hao Wang, Li Chen, Dongsu Han, and Chen Tian. 2015. Information-Agnostic Flow Scheduling for Commodity Data Centers. In NSDI.
[9]
Wei Bai, Li Chen, Kai Chen, and Haitao Wu. 2016. Enabling ecn in multi-service multi-queue data centers. In NSDI.
[10]
Sally Floyd. 2003. HighSpeed TCP for large congestion windows. In RFC 3649.
[11]
Sangtae Ha, Injong Rhee, and Lisong Xu. 2008. CUBIC: a new TCP-friendly high-speed TCP variant. In SIGOPS.
[12]
Dongsu Han, Robert Grandl, Aditya Akella, and Srinivasan Seshan. 2013. FCP: A Flexible Transport Framework for Accommodating Diversity. In SIGCOMM.
[13]
Chi-Yao Hong, Matthew Caesar, and P Godfrey. 2012. Finishing flows quickly with preemptive scheduling. In SIGCOMM.
[14]
Michael Isard, Vijayan Prabhakaran, Jon Currey, Udi Wieder, Kunal Talwar, and Andrew Goldberg. 2009. Quincy: fair scheduling for distributed computing clusters. In SOSP.
[15]
Van Jacobson. 1988. Congestion avoidance and control. In SIGCOMM.
[16]
Virajith Jalaparti, Peter Bodik, Ishai Menache, Sriram Rao, Konstantin Makarychev, and Matthew Caesar. 2015. Network-aware scheduling for data-parallel jobs: Plan when you can. In SIGCOMM.
[17]
Glenn Judd. 2015. Attaining the Promise and Avoiding the Pitfalls of TCP in the Datacenter. In NSDI.
[18]
Changhyun Lee, Chunjong Park, Keon Jang, Sue Moon, and Dongsu Han. 2016. DX: Latency-Based Congestion Control for Datacenters. In ToN.
[19]
Vincent Liu, Daniel Halperin, Arvind Krishnamurthy, and Thomas E Anderson. 2013. F10: A Fault-Tolerant Engineered Network. In NSDI.
[20]
Radhika Mittal, Nandita Dukkipati, Emily Blem, Hassan Wassel, Monia Ghobadi, Amin Vahdat, Yaogong Wang, David Wetherall, David Zats, and others. 2015. TIMELY: RTT-based Congestion Control for the Datacenter. In SIGCOMM.
[21]
Ali Munir, Ghufran Baig, Syed M Irteza, Ihsan A Qazi, Alex X Liu, and Fahad R Dogar. 2014. Friends, not foes: synthesizing existing transport strategies for data center networks. In SIGCOMM.
[22]
Ali Munir, Ting He, Ramya Raghavendra, Franck Le, and Alex X Liu. 2016. Network Scheduling Aware Task Placement in Datacenters. In CoNEXT.
[23]
Ali Munir, Ihsan A Qazi, Zartash A Uzmi, Aisha Mushtaq, Saad N Ismail, M Safdar Iqbal, and Basma Khan. 2013. Minimizing flow completion times in data centers. In INFOCOM.
[24]
Rong Pan, Balaji Prabhakar, and Ashvin Laxmikantha. 2007. QCN: Quantized congestion notification. In IEEE802.
[25]
Arjun Roy, Hongyi Zeng, Jasmeet Bagga, George Porter, and Alex C Snoeren. 2015. Inside the social network's (datacenter) network. In SIGCOMM.
[26]
Arjun Singh, Joon Ong, Amit Agarwal, Glen Anderson, Ashby Armistead, Roy Bannon, Seb Boving, Gaurav Desai, Bob Felderman, Paulie Germano, and others. 2015. Jupiter rising: A decade of clos topologies and centralized control in google's datacenter network. In SIGCOMM.
[27]
Balajee Vamanan, Jahangir Hasan, and TN Vijaykumar. 2012. Deadline-aware datacenter tcp (d2tcp). In SIGCOMM.
[28]
Christo Wilson, Hitesh Ballani, Thomas Karagiannis, and Ant Rowtron. 2011. Better never than late: Meeting deadlines in datacenter networks. In SIGCOMM.
[29]
Lisong Xu, Khaled Harfoush, and Injong Rhee. 2004. Binary increase congestion control (BIC) for fast long-distance networks. In INFOCOM.
[30]
Matei Zaharia, Dhruba Borthakur, Joydeep Sen Sarma, Khaled Elmeleegy, Scott Shenker, and Ion Stoica. 2010. Delay scheduling: a simple technique for achieving locality and fairness in cluster scheduling. In EuroSys.
[31]
Yibo Zhu, Haggai Eran, Daniel Firestone, Chuanxiong Guo, Marina Lipshteyn, Yehonatan Liron, Jitendra Padhye, Shachar Raindel, Mohamad Haj Yahia, and Ming Zhang. 2015. Congestion control for large-scale RDMA deployments. In SIGCOMM.
[32]
Yibo Zhu, Monia Ghobadi, Vishal Misra, and Jitendra Padhye. 2016. ECN or Delay: Lessons Learnt from Analysis of DCQCN and TIMELY. In CoNEXT.

Cited By

View all
  • (2024)RECC: Joint Congestion Control Based on RTT and ECN for High-speed RDMA NetworksProceedings of the ACM on Networking10.1145/36964022:CoNEXT4(1-18)Online publication date: 1-Dec-2024
  • (2024)MPVSched: Multipath Transmissions and Video Frame Scheduling for Content Delivery Networks2024 International Conference on Networking, Architecture and Storage (NAS)10.1109/NAS63802.2024.10781355(1-4)Online publication date: 9-Nov-2024
  • (2024)DCCSComputer Networks: The International Journal of Computer and Telecommunications Networking10.1016/j.comnet.2024.110457247:COnline publication date: 18-Jul-2024
  • Show More Cited By

Index Terms

  1. Combining ECN and RTT for Datacenter Transport

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    APNet '17: Proceedings of the First Asia-Pacific Workshop on Networking
    August 2017
    127 pages
    ISBN:9781450352444
    DOI:10.1145/3106989
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 03 August 2017

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Congestion Signal
    2. Datacenter Networks
    3. ECN
    4. RTT
    5. Transport Protocol

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    APNet'17
    APNet'17: First Asia-Pacific Workshop on Networking
    August 3 - 4, 2017
    Hong Kong, China

    Acceptance Rates

    Overall Acceptance Rate 50 of 118 submissions, 42%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)24
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 11 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)RECC: Joint Congestion Control Based on RTT and ECN for High-speed RDMA NetworksProceedings of the ACM on Networking10.1145/36964022:CoNEXT4(1-18)Online publication date: 1-Dec-2024
    • (2024)MPVSched: Multipath Transmissions and Video Frame Scheduling for Content Delivery Networks2024 International Conference on Networking, Architecture and Storage (NAS)10.1109/NAS63802.2024.10781355(1-4)Online publication date: 9-Nov-2024
    • (2024)DCCSComputer Networks: The International Journal of Computer and Telecommunications Networking10.1016/j.comnet.2024.110457247:COnline publication date: 18-Jul-2024
    • (2024)Deep Reinforcement Learning Based Load Balancing for Heterogeneous Traffic in Datacenter NetworksAlgorithms and Architectures for Parallel Processing10.1007/978-981-97-0798-0_16(270-289)Online publication date: 1-Mar-2024
    • (2023)A Receiver-Driven Transport Protocol With High Link Utilization Using Anti-ECN Marking in Data Center NetworksIEEE Transactions on Network and Service Management10.1109/TNSM.2022.321834320:2(1898-1912)Online publication date: Jun-2023
    • (2023)An INT-based TCP window modulator for congestion control in data center networksJournal of Network and Computer Applications10.1016/j.jnca.2023.103688217(103688)Online publication date: Aug-2023
    • (2022)Congestion Control for Cross-Datacenter NetworksIEEE/ACM Transactions on Networking10.1109/TNET.2022.316158030:5(2074-2089)Online publication date: Oct-2022
    • (2022)Aeolus: A Building Block for Proactive Transport in Datacenter NetworksIEEE/ACM Transactions on Networking10.1109/TNET.2021.311998630:2(542-556)Online publication date: Apr-2022
    • (2022)Cutting Tail Latency in Commodity Datacenters with CloudburstIEEE INFOCOM 2022 - IEEE Conference on Computer Communications10.1109/INFOCOM48880.2022.9796898(600-609)Online publication date: 2-May-2022
    • (2022)Edge-Based Congestion-Aware Datacenter Load Balancing with Smart ProbingThe Computer Journal10.1093/comjnl/bxac13566:12(2908-2920)Online publication date: 21-Oct-2022
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media