[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

A Practical Learning-based Approach for Viewer Scheduling in the Crowdsourced Live Streaming

Published: 21 June 2020 Publication History

Abstract

Scheduling viewers effectively among different Content Delivery Network (CDN) providers is challenging owing to the extreme diversity in the crowdsourced live streaming (CLS) scenarios. Abundant algorithms have been proposed in recent years, which, however, suffer from a critical limitation: Due to their inaccurate feature engineering or naive rules, they cannot optimally schedule viewers. To address this concern, we put forward LTS (Learn to Schedule), a novel scheduling algorithm that can adapt to the dynamics from both viewer traffics and CDN performance. In detail, we first propose LTS-RL, an approach that schedules CLS viewers based on deep reinforcement learning (DRL). Since LTS-RL is trained in an end-to-end way, it can automatically learn scheduling algorithms without any pre-programmed models or assumptions about the environment dynamics. At the same time, to practically deploy LTS-RL, we then use the decision tree and imitation learning to convert LTS-RL into a more light-weighted and interpretable model, which is denoted as Fast-LTS. After the extensive evaluation of the real data from a leading CLS platform in China, we demonstrate that our proposed model (both LTS-RL and Fast-LTS) can improve the average quality of experience (QoE) over state-of-the-art approaches by 8.71--15.63%. At the same time, we also demonstrate that Fast-LTS can faithfully convert the complicated LTS-RL with slight performance degradation (< 2%), while significantly reducing the decision time (×7--10).

References

[1]
Vijay Kumar Adhikari et al. 2012. Unreeling netflix: Understanding and improving multi-cdn movie delivery. In Proceedings of the IEEE International Conference on Computer Communications (INFOCOM’12). IEEE, 1620--1628.
[2]
Vijay Kumar Adhikari et al. 2012. A tale of three CDNs: An active measurement study of Hulu and its CDNs. In Proceedings of the 2012 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS’12). IEEE, 7--12.
[3]
Alexandre Alahi, Kratarth Goel, Vignesh Ramanathan, Alexandre Robicquet, Li Feifei, and Silvio Savarese. 2016. Social LSTM: Human trajectory prediction in crowded spaces. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 961--971.
[4]
Hendrik Blockeel and Luc De Raedt. 1998. Top-down induction of first-order logical decision trees. Artif. Intell. 101, 1--2 (1998), 285--297.
[5]
Leo Breiman. 2017. Classification and Regression Trees. Routledge.
[6]
Arnaud Dethise, Marco Canini, and Srikanth Kandula. 2019. Cracking open the black box: What observations can tell us about reinforcement learning agents. In Proceedings of the 2019 Workshop on Network Meets AI 8 ML. ACM, 29--36.
[7]
Florin Dobrian and et al. 2011. Understanding the impact of video quality on user engagement. In ACM SIGCOMM Computer Communication Review, Vol. 41. ACM, 362--373.
[8]
Mnih et al. 2016. Asynchronous methods for deep reinforcement learning. In Proceedings of the International Conference on Machine Learning. 1928--1937.
[9]
Chen Fei et al. 2015. Cloud-assisted live streaming for crowdsourced multimedia content. IEEE Trans. Multimedia 17, 9 (2015), 1471--1483.
[10]
Rui Fu, Zuo Zhang, and L. Li. 2016. Using LSTM and GRU neural network methods for traffic flow prediction. In 2016 31st Youth Academic Annual Conference of Chinese Association of Automation (YAC’16). IEEE, 324–328.
[11]
Aurélien Garivier and Eric Moulines. 2008. On upper-confidence bound policies for non-stationary bandit problems. arXiv preprint arXiv:0805.3415 (2008).
[12]
Wenbo Guo, Dongliang Mu, Jun Xu, Purui Su, Gang Wang, and Xinyu Xing. 2018. Lemna: Explaining deep learning based security applications. In Proceedings of the 2018 ACM SIGSAC Conference on Computer and Communications Security. ACM, 364--379.
[13]
Tianchi Huang, Rui-Xiao Zhang, Chao Zhou, and Lifeng Sun. 2018. QARC: Video quality aware rate control for real-time video streaming based on deep reinforcement learning. arXiv preprint arXiv:1805.02482 (2018).
[14]
Tianchi Huang, Chao Zhou, Rui-Xiao Zhang, Chenglei Wu, Xin Yao, and Lifeng Sun. 2019. Comyco: Quality-aware adaptive video streaming via imitation learning. In Proceedings of the 27th ACM International Conference on Multimedia. ACM, 429--437.
[15]
Junchen Jiang et al. 2016. CFA: A practical prediction system for video QoE optimization. In Proceedings of the USENIX Symposium on Networked Systems Design and Implementation (NSDI’16). 137--150.
[16]
Junchen Jiang et al. 2017. Pytheas: Enabling data-driven quality of experience optimization using group-based exploration-exploitation. In Proceedings of the USENIX Symposium on Networked Systems Design and Implementation (NSDI’17), Vol. 1. 3.
[17]
Jessica Klein. 2018. Twitch Ended 2017 With 15 Million Daily Visitors, 27K Partnered Streamers. Retrieved February, 6, 2018 from https://www.tubefilter.com/2018/02/06/twitch-2017-year-in-review/.
[18]
Hongqiang Harry Liu, Ye Wang, Yang Richard Yang, Hao Wang, and Chen Tian. 2012. Optimizing cost and performance for content multihoming. ACM Spec. Interest Group Data Commun. 42, 4 (2012), 371--382.
[19]
Xi Liu et al. 2012. A case for a coordinated internet video control plane. ACM SIGCOMM Comput. Commun. Rev. 42, 4 (2012), 359--370.
[20]
Scott M. Lundberg and Su-In Lee. 2017. A unified approach to interpreting model predictions. In Advances in Neural Information Processing Systems. 4765--4774.
[21]
Zoltán Ádám Mann. 2015. Allocation of virtual machines in cloud data centers—a survey of problem models and optimization algorithms. ACM Comput. Surv. 48, 1 (2015), 11.
[22]
Hongzi Mao, Ravi Netravali, and Mohammad Alizadeh. 2017. Neural adaptive video streaming with pensieve. In Proceedings of the Conference of the ACM Special Interest Group on Data Communication. ACM, 197--210.
[23]
Zili Meng, Jing Chen, Yaning Guo, Chen Sun, Hongxin Hu, and Mingwei Xu. 2019. PiTree: Practical implementation of ABR algorithms using decision trees. In Proceedings of the 27th ACM International Conference on Multimedia. ACM, 2431--2439.
[24]
Zili Meng, Minhu Wang, Mingwei Xu, Hongzi Mao, Jiasong Bai, and Hongxin Hu. 2019. Explaining deep learning-based networked systems. arXiv preprint arXiv:1910.03835 (2019).
[25]
Volodymyr Mnih et al. 2015. Human-level control through deep reinforcement learning. Nature 518, 7540 (2015), 529.
[26]
Haitian Pang, Cong Zhang, Fangxin Wang, Han Hu, Zhi Wang, Jiangchuan Liu, and Lifeng Sun. 2018. Optimizing personalized interaction experience in crowd-interactive livecast: A cloud-edge approach. In Proceedings of the 26th ACM International Conference on Multimedia. 1217--1225.
[27]
Marco Tulio Ribeiro, Sameer Singh, and Carlos Guestrin. 2016. Why should i trust you?: Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1135--1144.
[28]
Stéphane Ross, Geoffrey Gordon, and Drew Bagnell. 2011. A reduction of imitation learning and structured prediction to no-regret online learning. In Proceedings of the 14th International Conference on Artificial Intelligence and Statistics. 627--635.
[29]
David Silver, Guy Lever, Nicolas Heess, Thomas Degris, Daan Wierstra, and Martin Riedmiller. 2014. Deterministic policy gradient algorithms. In Proceedings of the 31st International Conference on International Conference on Machine Learning, Vol. 32. I--387.
[30]
Mark Stemm and et al. 2000. A network measurement architecture for adaptive applications. In Proceedings of the IEEE International Conference on Computer Communications (INFOCOM’00). Vol. 1. IEEE, 285--294.
[31]
Richard S. Sutton and Andrew G. Barto. 1998. Reinforcement Learning: An Introduction. Vol. 1. MIT Press, Cambridge, MA.
[32]
Fengxiao Tang, Bomin Mao, Zubair Md Fadlullah, Nei Kato, Osamu Akashi, Takeru Inoue, and Kimihiro Mizutani. 2017. On removing routing protocol from future wireless networks: A real-time deep learning approach for intelligent traffic control. IEEE Wireless Commun. 25, 1 (2017), 154--160.
[33]
Ruben Torres et al. 2011. Dissecting video server selection strategies in the youtube cdn. In Proceedings of the 2011 31st International Conference on Distributed Computing Systems (ICDCS’11). IEEE, 248--257.
[34]
Andrew Trask et al. 2018. Neural arithmetic logic units. In Advances in Neural Information Processing Systems. 8046--8055.
[35]
Jason Min Wang, Jun Zhang, and Brahim Bensaou. 2014. Content multi-homing: An alternative approach. In Proceedings of the 2014 IEEE International Conference on Communications (ICC’14). IEEE, 3118--3123.
[36]
Yifei Wei and et al. 2018. User scheduling and resource allocation in HetNets with hybrid energy supply: An actor-critic reinforcement learning approach. IEEE Trans. Wireless Commun. 17, 1 (2018), 680--692.
[37]
Cheng-Zhong Xu, Jia Rao, and Xiangping Bu. 2012. URL: A unified reinforcement learning approach for autonomic cloud management. J. Parallel Distrib. Comput. 72, 2 (2012), 95--105.
[38]
Zhiyuan Xu and et al. 2018. Experience-driven networking: A deep reinforcement learning based approach. arXiv preprint arXiv:1801.05757 (2018).
[39]
Bo Yan and et al. 2017. LiveJack: Integrating CDNs and edge clouds for live content broadcasting. In Proceedings of the 2017 ACM on Multimedia Conference. ACM, 73--81.
[40]
Francis Y. Yan, Jestin Ma, Greg D. Hill, Deepti Raghavan, Riad S. Wahby, Philip Levis, and Keith Winstein. 2018. Pantheon: The training ground for Internet congestion-control research. In Proceedings of the 2018 USENIX Annual Technical Conference (USENIX ATC’18). 731--743.
[41]
Zhu Yifei and et al. 2017. When cloud meets uncertain crowd: An auction approach for crowdsourced livecast transcoding. In Proceedings of the 25th ACM International Conference on Multimedia. ACM, 1372--1380.
[42]
Rui-Xiao Zhang, Tianchi Huang, Ming Ma, Haitian Pang, Xin Yao, Chenglei Wu, and Lifeng Sun. 2019. Enhancing the crowdsourced live streaming: A deep reinforcement learning approach. In Proceedings of the 29th ACM Workshop on Network and Operating Systems Support for Digital Audio and Video. ACM, 55--60.
[43]
Rui-Xiao Zhang, Tianchi Huang, Ming Ma, Haitian Pang, Xin Yao, Chenglei Wu, and Lifeng Sun. 2019. Enhancing the crowdsourced live streaming: A deep reinforcement learning approach. In Proceedings of the 29th ACM Workshop on Network and Operating Systems Support for Digital Audio and Video. ACM, 55--60.
[44]
Ying Zheng, Ziyu Liu, Xinyu You, Yuedong Xu, and Junchen Jiang. 2018. Demystifying deep learning in networking. In Proceedings of the 2nd Asia-Pacific Workshop on Networking. ACM, 1--7.

Cited By

View all
  • (2024)Reviving Peer-to-Peer Networking for Scalable Crowdsourced Live Video StreamingIEEE/ACM Transactions on Networking10.1109/TNET.2024.338039532:4(3205-3220)Online publication date: Aug-2024
  • (2022)Zwei: A Self-Play Reinforcement Learning Framework for Video Transmission ServicesIEEE Transactions on Multimedia10.1109/TMM.2021.306362024(1350-1365)Online publication date: 2022
  • (2021)Green Communication for Next-Generation Wireless SystemsWireless Communications & Mobile Computing10.1155/2021/55285842021Online publication date: 1-Jan-2021
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Multimedia Computing, Communications, and Applications
ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 16, Issue 2s
Special Issue on Smart Communications and Networking for Future Video Surveillance and Special Section on Extended MMSYS-NOSSDAV 2019 Best Papers
April 2020
291 pages
ISSN:1551-6857
EISSN:1551-6865
DOI:10.1145/3407689
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 June 2020
Online AM: 07 May 2020
Accepted: 01 April 2020
Revised: 01 March 2020
Received: 01 December 2019
Published in TOMM Volume 16, Issue 2s

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Crowdsourced live streaming
  2. reinforcement learning
  3. scheduling

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

  • National Key R8D Program of China
  • NSFC
  • Beijing Key Lab of Networked Multimedia

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)17
  • Downloads (Last 6 weeks)3
Reflects downloads up to 09 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Reviving Peer-to-Peer Networking for Scalable Crowdsourced Live Video StreamingIEEE/ACM Transactions on Networking10.1109/TNET.2024.338039532:4(3205-3220)Online publication date: Aug-2024
  • (2022)Zwei: A Self-Play Reinforcement Learning Framework for Video Transmission ServicesIEEE Transactions on Multimedia10.1109/TMM.2021.306362024(1350-1365)Online publication date: 2022
  • (2021)Green Communication for Next-Generation Wireless SystemsWireless Communications & Mobile Computing10.1155/2021/55285842021Online publication date: 1-Jan-2021
  • (2021)Alignment Enhancement Network for Fine-grained Visual CategorizationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/344620817:1s(1-20)Online publication date: 31-Mar-2021
  • (2021)Augmented Queue-Based Transmission and Transcoding Optimization for Livecast Services Based on Cloud-Edge-Crowd IntegrationIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2020.304785931:11(4470-4484)Online publication date: Nov-2021

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media