[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3338533.3366606acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Generalizing Rate Control Strategies for Realtime Video Streaming via Learning from Deep Learning

Published: 10 January 2020 Publication History

Abstract

The leading learning-based rate control method, i.e., QARC, achieves state-of-the-art performances but fails to interpret the fundamental principles, and thus lacks the abilities to further improve itself efficiently. In this paper, we propose EQARC (Explainable QARC) via reconstructing QARC's modules, aiming to demystify how QARC works. In details, we first utilize a novel hybrid attention-based CNN+GRU model to re-characterize the original quality prediction network and reasonably replace the QARC's 1D-CNN layers with 2D-CNN layers. Using trace-driven experiment, we demonstrate the superiority of EQARC over existing state-of-the-art approaches. Next, we collect several useful information from each interpretable modules and learn the insight of EQARC. Following this step, we further propose AQARC (Advanced QARC), which is the light-weighted version of QARC. Experimental results show that AQARC achieves the same performances as the QARC with an overhead reduction of 90%. In short, through learning from deep learning, we generalize a rate control method which can both reach high performance and reduce computation cost.

References

[1]
Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2014. Neural Machine Translation by Jointly Learning to Align and Translate. CoRR abs/1409.0473 (2014). arXiv:1409.0473 http://arxiv.org/abs/1409.0473
[2]
Lawrence S. Brakmo and Larry L. Peterson. 1995. TCP Vegas: End to end congestion avoidance on a global Internet. IEEE Journal on selected Areas in communications 13, 8 (1995), 1465--1480.
[3]
Gaetano Carlucci, Luca De Cicco, Stefan Holmer, and Saverio Mascolo. 2016. Analysis and design of the google congestion control for web real-time communication (WebRTC). In Proceedings of the 7th International Conference on Multimedia Systems. ACM, 13.
[4]
Junyoung Chung, Caglar Gulcehre, Kyunghyun Cho, and Yoshua Bengio. 2014. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling. arXiv: Neural and Evolutionary Computing (2014).
[5]
Mark Handley, Sally Floyd, Jitendra Padhye, and Jörg Widmer. 2002. TCP friendly rate control (TFRC): Protocol specification. Technical Report.
[6]
Tianchi Huang, Rui-Xiao Zhang, Chao Zhou, and Lifeng Sun. 2018. Qarc: Video quality aware rate control for real-time video streaming based on deep reinforcement learning. In 2018 ACM Multimedia Conference. ACM, 1208--1216.
[7]
Eymen Kurdoglu, Yong Liu, Yao Wang, Yongfang Shi, ChenChen Gu, and Jing Lyu. 2016. Realtime bandwidth prediction and rate adaptation for video calls over cellular networks. In Proceedings of the 7th International Conference on Multimedia Systems. ACM, 12.
[8]
Aleksandar Kuzmanovic and Edward W Knightly. 2006. TCP-LP: low-priority service via end-point congestion control. IEEE/ACM Transactions on Networking (TON) 14, 4 (2006), 739--752.
[9]
Zhouhan Lin, Minwei Feng, Cicero Nogueira dos Santos, Mo Yu, Bing Xiang, Bowen Zhou, and Yoshua Bengio. 2017. A structured self-attentive sentence embedding. arXiv preprint arXiv:1703.03130 (2017).
[10]
Minh-Thang Luong, Hieu Pham, and Christopher D. Manning. 2015. Effective Approaches to Attention-based Neural Machine Translation. CoRR abs/1508.04025 (2015). arXiv:1508.04025 http://arxiv.org/abs/1508.04025
[11]
Volodymyr Mnih, Adria Puigdomenech Badia, Mehdi Mirza, Alex Graves, Timothy Lillicrap, Tim Harley, David Silver, and Koray Kavukcuoglu. 2016. Asynchronous methods for deep reinforcement learning. In International Conference on Machine Learning. 1928--1937.
[12]
R. Rejaie, M. Handley, and D. Estrin. 1999. RAP: An end-to-end rate-based congestion control mechanism for real-time streams in the Internet. In INFOCOM 99, IEEE, Vol. 3. 1337--1345 vol.3. https://doi.org/10.1109/INFCOM.1999.752152
[13]
Dario Rossi, Claudio Testa, Silvio Valenti, and Luca Muscariello. 2010. LEDBAT: The New BitTorrent Congestion Control Protocol. In ICCCN. 1--6.
[14]
International Telecommunications. 2007. R-REC-BT.1788. https://www.itu.int/rec/R-REC-BT.1788-0-200701-I/en. (2007).
[15]
Kelvin Xu, Jimmy Ba, Ryan Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhudinov, Rich Zemel, and Yoshua Bengio. 2015. Show, attend and tell: Neural image caption generation with visual attention. In International conference on machine learning. 2048--2057.
[16]
Chaoyun Zhang, Paul Patras, and Hamed Haddadi. 2018. Deep Learning in Mobile and Wireless Networking: A Survey. arXiv preprint arXiv:1803.04311 (2018).
[17]
Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, and Antonio Torralba. 2016. Learning deep features for discriminative localization. In CVPR.

Cited By

View all
  • (2020)Referenceless Rate-Distortion Modeling with Learning from Bitstream and Pixel FeaturesProceedings of the 28th ACM International Conference on Multimedia10.1145/3394171.3413545(2481-2489)Online publication date: 12-Oct-2020

Index Terms

  1. Generalizing Rate Control Strategies for Realtime Video Streaming via Learning from Deep Learning
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Please enable JavaScript to view thecomments powered by Disqus.

          Information & Contributors

          Information

          Published In

          cover image ACM Conferences
          MMAsia '19: Proceedings of the 1st ACM International Conference on Multimedia in Asia
          December 2019
          403 pages
          ISBN:9781450368414
          DOI:10.1145/3338533
          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Sponsors

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          Published: 10 January 2020

          Permissions

          Request permissions for this article.

          Check for updates

          Qualifiers

          • Research-article
          • Research
          • Refereed limited

          Funding Sources

          • Beijing Key Lab of Networked Multimedia
          • Kuaishou-Tsinghua Joint Project
          • National Key R&D Program of China
          • NSFC

          Conference

          MMAsia '19
          Sponsor:
          MMAsia '19: ACM Multimedia Asia
          December 15 - 18, 2019
          Beijing, China

          Acceptance Rates

          MMAsia '19 Paper Acceptance Rate 59 of 204 submissions, 29%;
          Overall Acceptance Rate 59 of 204 submissions, 29%

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Downloads (Last 12 months)6
          • Downloads (Last 6 weeks)1
          Reflects downloads up to 23 Dec 2024

          Other Metrics

          Citations

          Cited By

          View all
          • (2020)Referenceless Rate-Distortion Modeling with Learning from Bitstream and Pixel FeaturesProceedings of the 28th ACM International Conference on Multimedia10.1145/3394171.3413545(2481-2489)Online publication date: 12-Oct-2020

          View Options

          Login options

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          Media

          Figures

          Other

          Tables

          Share

          Share

          Share this Publication link

          Share on social media