[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3664647.3681225acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

ViewPCGC: View-Guided Learned Point Cloud Geometry Compression

Published: 28 October 2024 Publication History

Abstract

With the rise of immersive media applications such as digital museums, virtual reality, and interactive exhibitions, point clouds, as a three-dimensional data storage format, have gained increasingly widespread attention. The massive data volume of point clouds imposes extremely high requirements on transmission bandwidth in the above applications, gradually becoming a bottleneck for immersive media applications. Although existing learning-based point cloud compression methods have achieved specific successes in compression efficiency by mining the spatial redundancy of their local structural features, these methods often overlook the intrinsic connections between point cloud data and other modality data (such as image modality), thereby limiting further improvements in compression efficiency. To address the limitation, we innovatively propose a view-guided learned point cloud geometry compression scheme, namely ViewPCGC. We adopt a novel self-attention mechanism and cross-modality attention mechanism based on sparse convolution to align the modality features of the point cloud and the view image, removing view redundancy through Modality Redundancy Removal Module (MRRM). Simultaneously, side information of the view image is introduced into the Conditional Checkboard Entropy Model (CCEM), significantly enhancing the accuracy of the probability density function estimation for point cloud geometry. In addition, we design a View-Guided Quality Enhancement Module (VG-QEM) in the decoder, utilizing the contour information of the point cloud in the view image to supplement reconstruction details. The superior experimental performance demonstrates the effectiveness of our method. Compared to the state-of-the-art point cloud geometry compression methods, ViewPCGC exhibits an average performance gain exceeding 10% on D1-PSNR metric.

References

[1]
Evangelos Alexiou, Nanyang Yang, and Touradj Ebrahimi. 2020. PointXR: A toolbox for visualization and subjective evaluation of point clouds in virtual reality. In 2020 Twelfth International Conference on Quality of Multimedia Experience (QoMEX). IEEE, 1--6.
[2]
Johannes Ballé, Valero Laparra, and Eero P Simoncelli. 2016. End-to-end optimized image compression. arXiv preprint arXiv:1611.01704 (2016).
[3]
Marcin Barszcz, Krzysztof Dziedzic, Maria Skublewska-Paszkowska, and Pawel Powroznik. 2023. 3D scanning digital models for virtual museums. Computer Animation and Virtual Worlds, Vol. 34, 3--4 (2023), e2154.
[4]
G. Bjontegaard. 2001. Calculation of average PSNR differences between RD-curves. VCEG-M33, Austin, TX, USA (2001).
[5]
Angel X Chang, Thomas Funkhouser, Leonidas Guibas, Pat Hanrahan, Qixing Huang, Zimo Li, Silvio Savarese, Manolis Savva, Shuran Song, Hao Su, et al. 2015. Shapenet: An information-rich 3d model repository. arXiv preprint arXiv:1512.03012 (2015).
[6]
Loop Charles, Cai Qin, O.Escolano Sergio, and A. Chou Philip. May 2016. Microsoft voxelized upper bodies - a voxelized point cloud dataset. ISO/IEC JTC1/SC29 Joint WG11/WG1 (MPEG/JPEG) input document m38673/M72012 ( May 2016).
[7]
Hui Chen and Yipeng Zuo. 2022. 3D-ARNet: An accurate 3D point cloud reconstruction network from a single-image. Multimedia Tools and Applications, Vol. 81, 9 (2022), 12127--12140.
[8]
Minghui Chen, Pingping Zhang, Zhuo Chen, Yun Zhang, Xu Wang, and Sam Kwong. 2022. End-to-end depth map compression framework via rgb-to-depth structure priors learning. In 2022 IEEE International Conference on Image Processing (ICIP). IEEE, 3206--3210.
[9]
Jun Cheng, Honglei Su, and Jari Korhonen. 2023. No-Reference Point Cloud Quality Assessment via Weighted Patch Quality Prediction. arXiv preprint arXiv:2305.07829 (2023).
[10]
Christopher Choy, JunYoung Gwak, and Silvio Savarese. 2019. 4d spatio-temporal convnets: Minkowski convolutional neural networks. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 3075--3084.
[11]
Xin Deng, Wenzhe Yang, Ren Yang, Mai Xu, Enpeng Liu, Qianhan Feng, and Radu Timofte. 2021. Deep homography for efficient stereo image compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 1492--1501.
[12]
Eugene d'Eon, Bob Harrison, Taos Myers, and Philip A. Chou. May 2016. 8i Voxelized Full Bodies - A Voxelized Point Cloud Dataset. ISO/IEC JTC1/SC29 Joint WG11/WG1 (MPEG/JPEG) input document m38673/M72012 ( May 2016).
[13]
Achref Doula, Tobias Güdelhöfer, Andrii Matviienko, Max Mühlhäuser, and Alejandro Sanchez Guinea. 2022. Immersive-labeler: Immersive annotation of large-scale 3d point clouds in virtual reality. In ACM SIGGRAPH 2022 Posters. 1--2.
[14]
Chunyang Fu, Ge Li, Rui Song, Wei Gao, and Shan Liu. 2022. Octattention: Octree-based large-scale contexts model for point cloud compression. In Proceedings of the AAAI conference on artificial intelligence, Vol. 36. 625--633.
[15]
Linyao Gao, Tingyu Fan, Jianqiang Wan, Yiling Xu, Jun Sun, and Zhan Ma. 2021. Point cloud geometry compression via neural graph sampling. In 2021 IEEE International Conference on Image Processing (ICIP). IEEE, 3373--3377.
[16]
Wei Gao, Shangkun Sun, Huiming Zheng, Yuyang Wu, Hua Ye, and Yongchi Zhang. 2023. OpenDMC: An open-source library and performance evaluation for deep-learning-based multi-frame compression. In Proceedings of the 31st ACM International Conference on Multimedia. 9685--9688.
[17]
Wei Gao, Hua Ye, Ge Li, Huiming Zheng, Yuyang Wu, and Liang Xie. 2022. Openpointcloud: An open-source algorithm library of deep learning based point cloud compression. In Proceedings of the 30th ACM international conference on multimedia. 7347--7350.
[18]
Wei Gao, Hang Yuan, Ge Li, Zhu Li, and Hui Yuan. 2023. Low complexity coding unit decision for video-based point cloud compression. IEEE Transactions on Image Processing, Vol. 33 (2023), 149--162.
[19]
Danillo Graziosi, Ohji Nakagami, Satoru Kuma, Alexandre Zaghetto, Teruhiko Suzuki, and Ali Tabatabai. 2020. An overview of ongoing point cloud compression standardization activities: Video-based (V-PCC) and geometry-based (G-PCC). APSIPA Transactions on Signal and Information Processing, Vol. 9 (2020), e13.
[20]
Dailan He, Yaoyan Zheng, Baocheng Sun, Yan Wang, and Hongwei Qin. 2021. Checkerboard context model for efficient learned image compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 14771--14780.
[21]
Yun He, Xinlin Ren, Danhang Tang, Yinda Zhang, Xiangyang Xue, and Yanwei Fu. 2022. Density-preserving deep point cloud compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2333--2342.
[22]
Lila Huang, Shenlong Wang, Kelvin Wong, Jerry Liu, and Raquel Urtasun. 2020. Octsqueeze: Octree-structured entropy model for lidar compression. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 1313--1323.
[23]
Tianxin Huang and Yong Liu. 2019. 3d point cloud geometry compression on deep learning. In Proceedings of the 27th ACM international conference on multimedia. 890--898.
[24]
Alireza Javaheri, Catarina Brites, Fernando Pereira, and Jo ao Ascenso. 2020. A Generalized Hausdorff Distance Based Quality Metric for Point Cloud Geometry. In International Conference on Quality of Multimedia Experience. IEEE, 1--6.
[25]
Wei Jiang, Jiayu Yang, Yongqi Zhai, Peirong Ning, Feng Gao, and Ronggang Wang. 2023. Mlic: Multi-reference entropy model for learned image compression. In Proceedings of the 31st ACM International Conference on Multimedia. 7618--7627.
[26]
Xuhao Jiang, Weimin Tan, Tian Tan, Bo Yan, and Liquan Shen. 2023. Multi-modality deep network for extreme learned image compression. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 37. 1033--1041.
[27]
Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
[28]
Yuhuan Lin, Tongda Xu, Ziyu Zhu, Yanghao Li, Zhe Wang, and Yan Wang. 2023. Your Camera Improves Your Point Cloud Compression. In ICASSP 2023--2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 1--5.
[29]
Jerry Liu, Shenlong Wang, and Raquel Urtasun. 2019. Dsic: Deep stereo image compression. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 3136--3145.
[30]
Guo Lu, Tianxiong Zhong, Jing Geng, Qiang Hu, and Dong Xu. 2022. Learning based multi-modality image and video compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 6083--6092.
[31]
Luke Melas-Kyriazi, Christian Rupprecht, and Andrea Vedaldi. 2023. Pc2: Projection-conditioned point cloud diffusion for single-image 3d reconstruction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 12923--12932.
[32]
David Minnen, Johannes Ballé, and George D Toderici. 2018. Joint autoregressive and hierarchical priors for learned image compression. Advances in neural information processing systems, Vol. 31 (2018).
[33]
MPEG. 2017. MPEG-PCC-TMC13. https://github.com/MPEGGroup/mpeg-pcc-tmc13 (2017).
[34]
Xingming Mu, Wei Gao, Hang Yuan, Shunzhou Wang, and Ge Li. 2024. Adaptive LPU Decision for Dynamic Point Cloud Compression. IEEE Signal Processing Letters (2024).
[35]
Dat Thanh Nguyen, Maurice Quach, Giuseppe Valenzise, and Pierre Duhamel. 2021. Learning-based lossless compression of 3d point cloud geometry. In ICASSP 2021--2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 4220--4224.
[36]
Dat Thanh Nguyen, Maurice Quach, Giuseppe Valenzise, and Pierre Duhamel. 2021. Multiscale deep context modeling for lossless point cloud geometry compression. In 2021 IEEE International Conference on Multimedia & Expo Workshops (ICMEW). IEEE, 1--6.
[37]
Rachna Pathak, Anil Saini, Arnav Wadhwa, Himanshu Sharma, and Dhiraj Sangwan. 2021. An object detection approach for detecting damages in heritage sites using 3-D point clouds and 2-D visual data. Journal of Cultural Heritage, Vol. 48 (2021), 74--82.
[38]
JPEG Pleno PCC. 2019. JPEG Pleno Point Cloud Coding Common Test Conditions. JPEG (ISO/IEC JTC 1/SC 29/WG1) (2019).
[39]
Charles R Qi, Hao Su, Kaichun Mo, and Leonidas J Guibas. 2017. Pointnet: Deep learning on point sets for 3d classification and segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition. 652--660.
[40]
Charles Ruizhongtai Qi, Li Yi, Hao Su, and Leonidas J Guibas. 2017. Pointnet: Deep hierarchical feature learning on point sets in a metric space. Advances in neural information processing systems, Vol. 30 (2017).
[41]
Shiyu Qin, Bin Chen, Yujun Huang, Baoyi An, Tao Dai, and Shu-Tao Via. 2023. Perceptual Image Compression with Cooperative Cross-Modal Side Information. arXiv preprint arXiv:2311.13847 (2023).
[42]
Maurice Quach, Giuseppe Valenzise, and Frederic Dufaux. 2020. Improved deep point cloud geometry compression. In 2020 IEEE 22nd International Workshop on Multimedia Signal Processing (MMSP). IEEE, 1--6.
[43]
Sebastian Schwarz, Philip A. Chou, and Indranil Sinharoy. 2018. Common test conditions for point cloud compression. ISO/IEC JTC1/SC29/WG11 N18474 (2018).
[44]
Yashbir Singh, Colleen M Farrelly, Quincy A Hathaway, Tim Leiner, Jaidip Jagtap, Gunnar E Carlsson, and Bradley J Erickson. 2023. Topological data analysis in medical imaging: current state of the art. Insights into Imaging, Vol. 14, 1 (2023), 58.
[45]
Maria Skublewska-Paszkowska, Marek Milosz, Pawel Powroznik, and Edyta Lukasik. 2022. 3D technologies for intangible cultural heritage preservation?literature review for selected databases. Heritage Science, Vol. 10, 1 (2022), 3.
[46]
Fei Song, Yiting Shao, Wei Gao, Haiqiang Wang, and Thomas Li. 2021. Layer-wise geometry aggregation framework for lossless LiDAR point cloud compression. IEEE Transactions on Circuits and Systems for Video Technology, Vol. 31, 12 (2021), 4603--4616.
[47]
Rui Song, Chunyang Fu, Shan Liu, and Ge Li. 2023. Efficient Hierarchical Entropy Model for Learned Point Cloud Compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 14368--14377.
[48]
Lvfang Tao, Wei Gao, Ge Li, and Chenhao Zhang. 2023. Adanic: Towards practical neural image compression via dynamic transform routing. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 16879--16888.
[49]
Sinh Van Nguyen, Son Thanh Le, Minh Khai Tran, and Ha Manh Tran. 2022. Reconstruction of 3D digital heritage objects for VR and AR applications. Journal of Information and Telecommunication, Vol. 6, 3 (2022), 254--269.
[50]
Sulaiman Vesal, Mingxuan Gu, Ronak Kosti, Andreas Maier, and Nishant Ravikumar. 2021. Adapt everywhere: unsupervised adaptation of point-clouds and entropy minimization for multi-modal cardiac image segmentation. IEEE Transactions on Medical Imaging, Vol. 40, 7 (2021), 1838--1851.
[51]
Jianqiang Wang, Dandan Ding, Zhu Li, Xiaoxing Feng, Chuntong Cao, and Zhan Ma. 2022. Sparse tensor-based multiscale representation for point cloud geometry compression. IEEE Transactions on Pattern Analysis and Machine Intelligence (2022).
[52]
Jianqiang Wang, Dandan Ding, Zhu Li, and Zhan Ma. 2021. Multiscale point cloud geometry compression. In 2021 Data Compression Conference (DCC). IEEE, 73--82.
[53]
Jilong Wang, Wei Gao, and Ge Li. 2024. Zoom to Perceive Better: No-reference Point Cloud Quality Assessment via Exploring Effective Multiscale Feature. IEEE Transactions on Circuits and Systems for Video Technology (2024).
[54]
Jianqiang Wang, Hao Zhu, Haojie Liu, and Zhan Ma. 2021. Lossy point cloud geometry compression via end-to-end learning. IEEE Transactions on Circuits and Systems for Video Technology, Vol. 31, 12 (2021), 4909--4923.
[55]
Matthias Wödlinger, Jan Kotera, Jan Xu, and Robert Sablatnig. 2022. Sasic: Stereo image compression with latent shifts and stereo attention. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 661--670.
[56]
Yuyang Wu, Zhiyang Qi, Huiming Zheng, Lvfang Tao, and Wei Gao. 2021. Deep image compression with latent optimization and piece-wise quantization approximation. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 1926--1930.
[57]
Liang Xie, Wei Gao, and Huiming Zheng. 2022. End-to-End point cloud geometry compression and analysis with sparse tensor. In Proceedings of the 1st International Workshop on Advances in Point Cloud Compression, Processing and Analysis. 27--32.
[58]
Liang Xie, Wei Gao, Huiming Zheng, and Ge Li. 2024. SPCGC: Scalable Point Cloud Geometry Compression for Machine Vision. In 2024 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 17272--17278.
[59]
Liang Xie, Wei Gao, Huiming Zheng, and Hua Ye. 2024. Semantic-aware visual decomposition for point cloud geometry compression. In 2024 Data Compression Conference (DCC). IEEE, 595--595.
[60]
Wei Yan, Shan Liu, Thomas H Li, Zhu Li, Ge Li, et al. 2019. Deep autoencoder-based lossy geometry compression for point clouds. arXiv preprint arXiv:1905.03691 (2019).
[61]
Xu Yi, Lu Yao, and Wen Ziyu. October 2017. Owlii Dynamic human mesh sequence dataset. ISO/IEC JTC1/SC29/WG11 (MPEG/JPEG) m41658 ( October 2017).
[62]
Qian Yu, Chengzhuan Yang, and Hui Wei. 2022. Part-wise AtlasNet for 3D point cloud reconstruction from a single image. Knowledge-Based Systems, Vol. 242 (2022), 108395.
[63]
Xinjie Zhang, Shenyuan Gao, Zhening Liu, Xingtong Ge, Dailan He, Tongda Xu, Yan Wang, and Jun Zhang. 2024. Content-aware Masked Image Modeling Transformer for Stereo Image Compression. arXiv preprint arXiv:2403.08505 (2024).
[64]
Huiming Zheng and Wei Gao. 2024. End-to-End RGB-D Image Compression via Exploiting Channel-Modality Redundancy. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 38. 7562--7570.

Cited By

View all
  • (2024)Point Cloud Compression, Enhancement and Applications: From 3D Perception to Large ModelsProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3689172(11292-11293)Online publication date: 28-Oct-2024
  • (2024)Open-Source Projects for 3D Point CloudsDeep Learning for 3D Point Clouds10.1007/978-981-97-9570-3_9(255-272)Online publication date: 10-Oct-2024
  • (2024)Point Cloud-Language Multi-modal LearningDeep Learning for 3D Point Clouds10.1007/978-981-97-9570-3_8(227-254)Online publication date: 10-Oct-2024
  • Show More Cited By

Index Terms

  1. ViewPCGC: View-Guided Learned Point Cloud Geometry Compression

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '24: Proceedings of the 32nd ACM International Conference on Multimedia
    October 2024
    11719 pages
    ISBN:9798400706868
    DOI:10.1145/3664647
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 28 October 2024

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. deep learning
    2. geometry compression
    3. multimodal learning
    4. point cloud

    Qualifiers

    • Research-article

    Funding Sources

    • The Major Key Project of PCL
    • Natural Science Foundation of China
    • Shenzhen Science and Technology Program
    • CAAI-MindSpore Open Fund
    • Guangdong Province Pearl River Talent Program
    • Guangdong Basic and Applied Basic Research Foundation

    Conference

    MM '24
    Sponsor:
    MM '24: The 32nd ACM International Conference on Multimedia
    October 28 - November 1, 2024
    Melbourne VIC, Australia

    Acceptance Rates

    MM '24 Paper Acceptance Rate 1,150 of 4,385 submissions, 26%;
    Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)71
    • Downloads (Last 6 weeks)49
    Reflects downloads up to 13 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Point Cloud Compression, Enhancement and Applications: From 3D Perception to Large ModelsProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3689172(11292-11293)Online publication date: 28-Oct-2024
    • (2024)Open-Source Projects for 3D Point CloudsDeep Learning for 3D Point Clouds10.1007/978-981-97-9570-3_9(255-272)Online publication date: 10-Oct-2024
    • (2024)Point Cloud-Language Multi-modal LearningDeep Learning for 3D Point Clouds10.1007/978-981-97-9570-3_8(227-254)Online publication date: 10-Oct-2024
    • (2024)Point Cloud Pre-trained Models and Large ModelsDeep Learning for 3D Point Clouds10.1007/978-981-97-9570-3_7(195-225)Online publication date: 10-Oct-2024
    • (2024)Deep-Learning-Based Point Cloud Analysis IIDeep Learning for 3D Point Clouds10.1007/978-981-97-9570-3_6(163-193)Online publication date: 10-Oct-2024
    • (2024)Deep-Learning-Based Point Cloud Analysis IDeep Learning for 3D Point Clouds10.1007/978-981-97-9570-3_5(131-162)Online publication date: 10-Oct-2024
    • (2024)Deep-Learning-Based Point Cloud Enhancement IIDeep Learning for 3D Point Clouds10.1007/978-981-97-9570-3_4(99-130)Online publication date: 10-Oct-2024
    • (2024)Deep-Learning-based Point Cloud Enhancement IDeep Learning for 3D Point Clouds10.1007/978-981-97-9570-3_3(71-97)Online publication date: 10-Oct-2024
    • (2024)Learning Basics for 3D Point CloudsDeep Learning for 3D Point Clouds10.1007/978-981-97-9570-3_2(29-70)Online publication date: 10-Oct-2024
    • (2024)Future Work on Deep Learning-Based Point Cloud TechnologiesDeep Learning for 3D Point Clouds10.1007/978-981-97-9570-3_11(301-315)Online publication date: 10-Oct-2024
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media