More Web Proxy on the site http://driver.im/

research-article

Generalizing Rate Control Strategies for Realtime Video Streaming via Learning from Deep Learning

Authors:

Rui-Xiao Zhang,

Lifeng SunAuthors Info & Claims

MMAsia '19: Proceedings of the 1st ACM International Conference on Multimedia in Asia

Article No.: 52, Pages 1 - 6

https://doi.org/10.1145/3338533.3366606

Published: 10 January 2020 Publication History

Abstract

The leading learning-based rate control method, i.e., QARC, achieves state-of-the-art performances but fails to interpret the fundamental principles, and thus lacks the abilities to further improve itself efficiently. In this paper, we propose EQARC (Explainable QARC) via reconstructing QARC's modules, aiming to demystify how QARC works. In details, we first utilize a novel hybrid attention-based CNN+GRU model to re-characterize the original quality prediction network and reasonably replace the QARC's 1D-CNN layers with 2D-CNN layers. Using trace-driven experiment, we demonstrate the superiority of EQARC over existing state-of-the-art approaches. Next, we collect several useful information from each interpretable modules and learn the insight of EQARC. Following this step, we further propose AQARC (Advanced QARC), which is the light-weighted version of QARC. Experimental results show that AQARC achieves the same performances as the QARC with an overhead reduction of 90%. In short, through learning from deep learning, we generalize a rate control method which can both reach high performance and reduce computation cost.

References

[1]

Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2014. Neural Machine Translation by Jointly Learning to Align and Translate. CoRR abs/1409.0473 (2014). arXiv:1409.0473 http://arxiv.org/abs/1409.0473

[2]

Lawrence S. Brakmo and Larry L. Peterson. 1995. TCP Vegas: End to end congestion avoidance on a global Internet. IEEE Journal on selected Areas in communications 13, 8 (1995), 1465--1480.

Digital Library

[3]

Gaetano Carlucci, Luca De Cicco, Stefan Holmer, and Saverio Mascolo. 2016. Analysis and design of the google congestion control for web real-time communication (WebRTC). In Proceedings of the 7th International Conference on Multimedia Systems. ACM, 13.

Digital Library

[4]

Junyoung Chung, Caglar Gulcehre, Kyunghyun Cho, and Yoshua Bengio. 2014. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling. arXiv: Neural and Evolutionary Computing (2014).

[5]

Mark Handley, Sally Floyd, Jitendra Padhye, and Jörg Widmer. 2002. TCP friendly rate control (TFRC): Protocol specification. Technical Report.

[6]

Tianchi Huang, Rui-Xiao Zhang, Chao Zhou, and Lifeng Sun. 2018. Qarc: Video quality aware rate control for real-time video streaming based on deep reinforcement learning. In 2018 ACM Multimedia Conference. ACM, 1208--1216.

Digital Library

[7]

Eymen Kurdoglu, Yong Liu, Yao Wang, Yongfang Shi, ChenChen Gu, and Jing Lyu. 2016. Realtime bandwidth prediction and rate adaptation for video calls over cellular networks. In Proceedings of the 7th International Conference on Multimedia Systems. ACM, 12.

Digital Library

[8]

Aleksandar Kuzmanovic and Edward W Knightly. 2006. TCP-LP: low-priority service via end-point congestion control. IEEE/ACM Transactions on Networking (TON) 14, 4 (2006), 739--752.

Digital Library

[9]

Zhouhan Lin, Minwei Feng, Cicero Nogueira dos Santos, Mo Yu, Bing Xiang, Bowen Zhou, and Yoshua Bengio. 2017. A structured self-attentive sentence embedding. arXiv preprint arXiv:1703.03130 (2017).

[10]

Minh-Thang Luong, Hieu Pham, and Christopher D. Manning. 2015. Effective Approaches to Attention-based Neural Machine Translation. CoRR abs/1508.04025 (2015). arXiv:1508.04025 http://arxiv.org/abs/1508.04025

[11]

Volodymyr Mnih, Adria Puigdomenech Badia, Mehdi Mirza, Alex Graves, Timothy Lillicrap, Tim Harley, David Silver, and Koray Kavukcuoglu. 2016. Asynchronous methods for deep reinforcement learning. In International Conference on Machine Learning. 1928--1937.

Digital Library

[12]

R. Rejaie, M. Handley, and D. Estrin. 1999. RAP: An end-to-end rate-based congestion control mechanism for real-time streams in the Internet. In INFOCOM 99, IEEE, Vol. 3. 1337--1345 vol.3. https://doi.org/10.1109/INFCOM.1999.752152

[13]

Dario Rossi, Claudio Testa, Silvio Valenti, and Luca Muscariello. 2010. LEDBAT: The New BitTorrent Congestion Control Protocol. In ICCCN. 1--6.

[14]

International Telecommunications. 2007. R-REC-BT.1788. https://www.itu.int/rec/R-REC-BT.1788-0-200701-I/en. (2007).

[15]

Kelvin Xu, Jimmy Ba, Ryan Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhudinov, Rich Zemel, and Yoshua Bengio. 2015. Show, attend and tell: Neural image caption generation with visual attention. In International conference on machine learning. 2048--2057.

Digital Library

[16]

Chaoyun Zhang, Paul Patras, and Hamed Haddadi. 2018. Deep Learning in Mobile and Wireless Networking: A Survey. arXiv preprint arXiv:1803.04311 (2018).

[17]

Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, and Antonio Torralba. 2016. Learning deep features for discriminative localization. In CVPR.

Cited By

Sun YLi LLi ZLiu SWen Chen CCucchiara RHua XQi GRicci EZhang ZZimmermann R(2020)Referenceless Rate-Distortion Modeling with Learning from Bitstream and Pixel FeaturesProceedings of the 28th ACM International Conference on Multimedia10.1145/3394171.3413545(2481-2489)Online publication date: 12-Oct-2020
https://dl.acm.org/doi/10.1145/3394171.3413545

Index Terms

Generalizing Rate Control Strategies for Realtime Video Streaming via Learning from Deep Learning

Index terms have been assigned to the content through auto-classification.

Recommendations

Joint Deep Multi-View Learning for Image Clustering
In this paper, a novel <bold>D</bold>eep <bold>M</bold>ulti-view <bold>J</bold>oint <bold>C</bold>lustering (<bold>DMJC</bold>) framework is proposed, where multiple deep embedded features, multi-view fusion mechanism, and clustering assignments can be ...
Advanced Deep Learning with Keras: Apply deep learning techniques, autoencoders, GANs, variational autoencoders, deep reinforcement learning, policy gradients, and more
Deep Learning Essentials: Your hands-on guide to the fundamentals of deep learning and neural network modeling

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

MMAsia '19: Proceedings of the 1st ACM International Conference on Multimedia in Asia

December 2019

403 pages

ISBN:9781450368414

DOI:10.1145/3338533

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 January 2020

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Research-article
Research
Refereed limited

Funding Sources

Beijing Key Lab of Networked Multimedia
Kuaishou-Tsinghua Joint Project
National Key R&D Program of China
NSFC

Conference

MMAsia '19

Sponsor:

SIGMM

MMAsia '19: ACM Multimedia Asia

December 15 - 18, 2019

Beijing, China

Acceptance Rates

MMAsia '19 Paper Acceptance Rate 59 of 204 submissions, 29%;

Overall Acceptance Rate 59 of 204 submissions, 29%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
169
Total Downloads

Downloads (Last 12 months)6
Downloads (Last 6 weeks)1

Reflects downloads up to 23 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Sun YLi LLi ZLiu SWen Chen CCucchiara RHua XQi GRicci EZhang ZZimmermann R(2020)Referenceless Rate-Distortion Modeling with Learning from Bitstream and Pixel FeaturesProceedings of the 28th ACM International Conference on Multimedia10.1145/3394171.3413545(2481-2489)Online publication date: 12-Oct-2020
https://dl.acm.org/doi/10.1145/3394171.3413545

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents