QARC: Video quality aware rate control for real-time video streaming based on deep reinforcement learning

T Huang, RX Zhang, C Zhou, L Sun - Proceedings of the 26th ACM …, 2018 - dl.acm.org
Proceedings of the 26th ACM international conference on Multimedia, 2018dl.acm.org
Real-time video streaming is now one of the main applications in all network environments.
Due to the fluctuation of throughput under various network conditions, how to choose a
proper bitrate adaptively has become an upcoming and interesting issue. To tackle this
problem, most proposed rate control methods work for providing high video bitrates instead
of video qualities. Nevertheless, we notice that there exists a trade-off between sending
bitrate and video quality, which motivates us to focus on how to reach a balance between …
Real-time video streaming is now one of the main applications in all network environments. Due to the fluctuation of throughput under various network conditions, how to choose a proper bitrate adaptively has become an upcoming and interesting issue. To tackle this problem, most proposed rate control methods work for providing high video bitrates instead of video qualities. Nevertheless, we notice that there exists a trade-off between sending bitrate and video quality, which motivates us to focus on how to reach a balance between them.
In this paper, we propose QARC (video Quality Aware Rate Control), a rate control algorithm that aims to obtain a higher perceptual video quality with possible lower sending rate and transmission latency. Starting from scratch, QARC uses deep reinforcement learning(DRL) algorithm to train a neural network to select future bitrates based on previously observed network status and past video frames. To overcome the "state explosion problem'', we design a neural network to predict future perceptual video quality as a vector for taking the place of the raw picture in the DRL's inputs.
We evaluate QARC via trace-driven simulation, outperforming existing approach with improvements in average video quality of 18% - 25% and decreasing in average latency with 23% -45%. Meanwhile, comparing QARC with offline optimal high bitrate method on various network conditions, we find that QARC also yields a solid result.
ACM Digital Library