[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3625468.3647619acmconferencesArticle/Chapter ViewAbstractPublication PagesmmsysConference Proceedingsconference-collections
research-article
Open access

QV4: QoE-based Viewpoint-Aware V-PCC-encoded Volumetric Video Streaming

Published: 17 April 2024 Publication History

Abstract

Volumetric videos allow six degrees of freedom (6DoF) movement for viewers, enabling numerous applications in domains such as entertainment, healthcare, and education. MPEG's Video-based Point Cloud Compression (V-PCC) is a recent new standard for volumetric video compression that achieves a considerable compression rate while maintaining the quality of the point cloud sequence. However, V-PCC is hard to fit into existing tiling-based volumetric video streaming framework due to the lack of proper user viewing adaptive techniques. In this paper, we propose QV4, a Quality-of-Experience (QoE) based streaming pipeline for viewpoint-aware V-PCC-encoded volumetric video. Specifically, we leverage the intermediate results produced by the V-PCC encoder to achieve effective and efficient viewpoint-aware tiling for V-PCC. We then build a QoE model and a 6DoF movement model based on real-world user data, to predict the users' viewing experience and behaviors, respectively. The proposed QoE model and 6DoF movement model are combined with viewpoint-aware V-PCC tiling to maximize the visual quality of volumetric videos. Extensive simulations show that by enabling viewpoint-aware adaptation and optimization for V-PCC-encoded volumetric videos, QV4 can achieve up to 14.67% improvement in structural similarity index (SSIM) and 7.39% improvement in video multi-method assessment fusion (VMAF) over highly dynamic viewing behaviors in a network with limited and fluctuating bandwidth.

References

[1]
Anne Aaron, Zhi Li, Megha Manohara, Joe Yuchieh Lin, Eddy Chi-Hao Wu, and C.-C. Jay Kuo. Challenges in cloud based ingest and encoding for high quality streaming media. In Proceedings of the 2015 IEEE International Conference on Image Processing, ICIP 2015, Quebec City, QC, Canada, September 27-30, 2015, pages 1732--1736. IEEE, 2015.
[2]
Evangelos Alexiou and Touradj Ebrahimi. Exploiting user interactivity in quality assessment of point cloud imaging. In Proceedings of the 11th International Conference on Quality of Multimedia Experience QoMEX 2019, Berlin, Germany, June 5-7, 2019, pages 1--6. IEEE, 2019.
[3]
Evangelos Alexiou, Irene Viola, Tomás M Borges, Tiago A Fonseca, Ricardo L De Queiroz, and Touradj Ebrahimi. A comprehensive study of the rate-distortion performance in mpeg point cloud compression. APSIPA Transactions on Signal and Information Processing, 8:e27, Nov 2019.
[4]
Jon Louis Bentley. Multidimensional binary search trees used for associative searching. Commun. ACM, 18(9):509--517, Sept 1975.
[5]
Luca De Cicco, Vito Caldaralo, Vittorio Palmisano, and Saverio Mascolo. ELASTIC: A client-side controller for dynamic adaptive streaming over HTTP (DASH). In Proceedings of the 20th International Packet Video Workshop, PV 2013, San Jose, CA, USA, December 12-13, 2013, pages 1--8. IEEE, 2013.
[6]
Samuel Rhys Cox, May Lim, and Wei Tsang Ooi. VOLVQAD: an MPEG V-PCC volumetric video quality assessment dataset. In Proceedings of the 14th Conference on ACM Multimedia Systems, MMSys 2023, Vancouver, BC, Canada, June 7-10, 2023, pages 357--362. ACM, 2023.
[7]
DASH Industry Forum. Guidelines for implementation: DASH-AVC/264 test cases and vectors. http://dashif.org/guidelines/, 2014.
[8]
Olivier Devillers and Pierre-Marie Gandoin. Geometric compression for interactive transmission. In Proceedings of the 11th IEEE Visualization Conference, IEEE Vis 2000, Salt Lake City, UT, USA, October 8-13, 2000, Proceedings, pages 319--326. IEEE Computer Society and ACM, 2000.
[9]
Eugene d'Eon, Bob Harrison, Taos Myers, and Philip A Chou. 8i voxelized full bodies-a voxelized point cloud dataset. ISO/IEC JTC1/SC29 Joint WG11/WG1 (MPEG/JPEG) input document WG11M40059/WG1M74006, 7(8):11, 2017.
[10]
Eugene d'Eon, Bob Harrison, Taos Myers, and Philip A Chou. Common test conditions for point cloud compression. ISO/IEC JTC1/SC29/WG11 w17766, 2018.
[11]
Danillo Graziosi, Ohji Nakagami, Satoru Kuma, Alexandre Zaghetto, Teruhiko Suzuki, and Ali Tabatabai. An overview of ongoing point cloud compression standardization activities: Video-based (V-PCC) and geometry-based (G-PCC). APSIPA Transactions on Signal and Information Processing, 9:e13, Apr 2020.
[12]
Serhan Gül, Dimitri Podborski, Thomas Buchholz, Thomas Schierl, and Cornelius Hellge. Low-latency cloud-based volumetric video streaming using head motion prediction. In Proceedings of the 30th ACM Workshop on Network and Operating Systems Support for Digital Audio and Video, NOSSDAV 2020, Istanbul, Turkey, June 10-11, 2020, pages 27--33. ACM, 2020.
[13]
Bo Han, Yu Liu, and Feng Qian. ViVo: visibility-aware mobile volumetric video streaming. In Proceedings of the 26th Annual International Conference on Mobile Computing and Networking, MobiCom 2020, London, United Kingdom, September 21-25, 2020, pages 11:1-11:13. ACM, 2020.
[14]
Hugues Hoppe, Tony DeRose, Tom Duchamp, John McDonald, and Werner Stuetzle. Surface reconstruction from unorganized points. In Proceedings of the 19th Annual Conference on Computer Graphics and Interactive Techniques, SIGGRAPH 1992, Chicago, IL, USA, July 27-31, 1992, pages 71--78. ACM, 1992.
[15]
Te-Yuan Huang, Ramesh Johari, Nick McKeown, Matthew Trunnell, and Mark Watson. A buffer-based approach to rate adaptation: evidence from a large video streaming service. In Proceedings of the 2014 ACM conference on SIGCOMM, SIGCOMM 2014, Chicago, IL, USA, August 17-22, 2014, pages 187--198. ACM, 2014.
[16]
Yakun Huang, Yuanwei Zhu, Xiuquan Qiao, Zhijie Tan, and Boyuan Bai. AITransfer: Progressive AI-powered transmission for real-time point cloud video streaming. In Proceedings of the 29th ACM Multimedia Conference, MM 2021, Virtual Event, China, October 20 - 24, 2021, pages 3989--3997. ACM, 2021.
[17]
Gazi Karam Illahi, Ashutosh Vaishnav, Teemu Kämäräinen, Matti Siekkinen, and Mario Di Francesco. Learning to predict head pose in remotely-rendered virtual reality. In Proceedings of the 14th Conference on ACM Multimedia Systems, MMSys 2023, Vancouver, BC, Canada, June 7-10, 2023, pages 27--38. ACM, 2023.
[18]
Kyungjin Lee, Juheon Yi, Youngki Lee, Sunghyun Choi, and Young Min Kim. GROOT: a real-time streaming system of high-fidelity volumetric videos. In Proceedings of the 26th Annual International Conference on Mobile Computing and Networking, MobiCom 2020, London, United Kingdom, September 21-25, 2020, pages 57:1-57:14. ACM, 2020.
[19]
Jie Li, Cong Zhang, Zhi Liu, Richang Hong, and Han Hu. Optimal volumetric video streaming with hybrid saliency based tiling. IEEE Transactions on Multimedia, 25:2939--2953, Feb 2023.
[20]
Ruihui Li, Xianzhi Li, Chi-Wing Fu, Daniel Cohen-Or, and Pheng-Ann Heng. PU-GAN: A point cloud upsampling adversarial network. In Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, ICCV 2019, Seoul, Korea (South), October 27 - November 2, 2019, pages 7202--7211. IEEE, 2019.
[21]
Donald Meagher. Geometric modeling using octree encoding. Computer Graphics and Image Processing, 19(2):129--147, Jun 1982.
[22]
Jingliang Peng and Chung J. Kuo. Octree-based progressive geometry encoder. In Internet Multimedia Management Systems IV, volume 5242, pages 301--311. SPIE, Nov 2003.
[23]
David Pisinger. A minimal algorithm for the multiple-choice knapsack problem. European Journal of Operational Research, 83(2):394--410, Jan 1995.
[24]
Feng Qian, Bo Han, Jarrell Pair, and Vijay Gopalakrishnan. Toward practical volumetric video streaming on commodity smartphones. In Proceedings of the 20th International Workshop on Mobile Computing Systems and Applications, HotMobile 2019, Santa Cruz, CA, USA, February 27-28, 2019, pages 135--140. ACM, 2019.
[25]
Maurice Quach, Jiahao Pang, Dong Tian, Giuseppe Valenzise, and Frederic Dufaux. Survey on deep learning-based point cloud compression. Frontiers in Signal Processing, 2, Feb 2022.
[26]
Maurice Quach, Giuseppe Valenzise, and Frédéric Dufaux. Learning convolutional transforms for lossy point cloud geometry compression. In Proceedings of the 2019 IEEE International Conference on Image Processing, ICIP 2019, Taipei, Taiwan, September 22-25, 2019, pages 4320--4324. IEEE, 2019.
[27]
Zizheng Que, Guo Lu, and Dong Xu. VoxelContext-Net: An Octree based framework for point cloud compression. In Proceedings of the 2021 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2021, virtual, June 19-25, 2021, pages 6042--6051.Computer Vision Foundation / IEEE, 2021.
[28]
Darijo Raca, Jason J. Quinlan, Ahmed H. Zahran, and Cormac J. Sreenan. Beyond throughput: a 4g LTE dataset with channel and context metrics. In Proceedings of the 9th ACM Multimedia Systems Conference, MMSys 2018, Amsterdam, The Netherlands, June 12-15, 2018, pages 460--465. ACM, 2018.
[29]
Silvia Rossi, Irene Viola, and Pablo César. Behavioural analysis in a 6-DoF VR system: Influence of content, quality and user disposition. In Proceedings of the 1st Workshop on Interactive eXtended Reality, IXR@MM 2022, Lisbon, Portugal, 14 October 2022, pages 3--10. ACM, 2022.
[30]
Michael Rudolph and Amr Rizk. View-adaptive streaming of point cloud scenes through combined decomposition and video-based coding. In Proceedings of the 1st International Workshop on Advances in Point Cloud Compression, Processing and Analysis, APCCPA 2022, October 28 - November 1, Lisbon, Portugal, APCCPA '22, page 41-49, New York, NY, USA, 2022. Association for Computing Machinery.
[31]
Xihua Sheng, Li Li, Dong Liu, Zhiwei Xiong, Zhu Li, and Feng Wu. Deep-PCAC: An end-to-end deep lossy compression framework for point cloud attributes. IEEE Transactions on Multimedia, 24:2617--2632, Jun 2022.
[32]
Yuang Shi, Pranav Venkatram, Yifan Ding, and Wei Tsang Ooi. Enabling low bitrate MPEG V-PCC-encoded volumetric video streaming with 3D sub-sampling. In Proceedings of the 14th Conference on ACM Multimedia Systems, MMSys 2023, Vancouver, BC, Canada, June 7-10, 2023, pages 108--118. ACM, 2023.
[33]
Shishir Subramanyam, Irene Viola, Alan Hanjalic, and Pablo César. User centered adaptive streaming of dynamic point clouds with low complexity tiling. In Proceedings of the 28th ACM International Conference on Multimedia, MM 2020, Virtual Event / Seattle, WA, USA, October 12-16, 2020, pages 3669--3677. ACM, 2020.
[34]
Cheng-Hao Wu, Chih-Fan Hsu, Tzu-Kuan Hung, Carsten Griwodz, Wei Tsang Ooi, and Cheng-Hsin Hsu. Quantitative comparison of point cloud compression algorithms with PCC arena. IEEE Transactions on Multimedia, 25:3073--3088, Feb 2023.
[35]
Praveen Kumar Yadav, Arash Shafiei, and Wei Tsang Ooi. QUETRA: A queuing theory approach to DASH rate adaptation. In Proceedings of the 2017 ACM on Multimedia Conference, MM 2017, Mountain View, CA, USA, October 23-27, 2017, pages 1130--1138. ACM, 2017.
[36]
Wei Yan, Shan Liu, Thomas H Li, Zhu Li, Ge Li, et al. Deep autoencoder-based lossy geometry compression for point clouds. arXiv preprint arXiv:1905.03691, 2019.
[37]
Mengyu Yang, Zhenxiao Luo, Miao Hu, Min Chen, and Di Wu. A comparative measurement study of point cloud-based volumetric video codecs. IEEE Transactions on Broadcasting, 69(3):715--726, May 2023.
[38]
Wang Yifan, Shihao Wu, Hui Huang, Daniel Cohen-Or, and Olga Sorkine-Hornung. Patch-based progressive 3D point set upsampling. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5958--5967, 2019.
[39]
Emin Zerman, Radhika Kulkarni, and Aljosa Smolic. User behaviour analysis of volumetric video in augmented reality. In Proceedings of the 13th International Conference on Quality of Multimedia Experience, QoMEX 2021, Montreal, Canada, June 14-17, 2021, pages 129--132. IEEE, 2021.
[40]
Anlan Zhang, Chendong Wang, Bo Han, and Feng Qian. YuZu: Neural-enhanced volumetric video streaming. In Proceedings of the 19th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2022, Renton, WA, USA, April 4-6, 2022, pages 137--154. USENIX Association, 2022.
[41]
Wenjie Zhu, Zhan Ma, Yiling Xu, Li Li, and Zhu Li. View-dependent dynamic point cloud compression. IEEE Transactions on Circuits and Systems for Video Technology, 31(2):765--781, Feb 2021.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
MMSys '24: Proceedings of the 15th ACM Multimedia Systems Conference
April 2024
557 pages
ISBN:9798400704123
DOI:10.1145/3625468
This work is licensed under a Creative Commons Attribution International 4.0 License.

Sponsors

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 April 2024

Check for updates

Author Tags

  1. Volumetric video
  2. media streaming
  3. video-based point cloud compression

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

  • Singapore Ministry of Education

Conference

MMSys '24
Sponsor:

Acceptance Rates

Overall Acceptance Rate 176 of 530 submissions, 33%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 438
    Total Downloads
  • Downloads (Last 12 months)438
  • Downloads (Last 6 weeks)64
Reflects downloads up to 13 Dec 2024

Other Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media