[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3664647.3689172acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
abstract

Point Cloud Compression, Enhancement and Applications: From 3D Perception to Large Models

Published: 28 October 2024 Publication History

Abstract

Point clouds have the strong capability for modeling 3D objects and scenes, which can be widely used in diverse applications and thus generate the burdens of transmission and storage. Efficient compression algorithms have been explored extensively, and research efforts have also been invested to enhancement algorithms. Moreover, the quality of point clouds can influence 3D analysis tasks, e.g., classification, segmentation, detection, and multimodal understanding, etc. Recent 3D multimodal large models can bring better perception optimizations. This tutorial will provide the fundamental knowledge for point cloud compression, enhancement and applications, and place emphasis on the influences of point cloud quality to human and machine perceptions. We will also discuss the progress of international standards and open source projects for point cloud technologies. From this tutorial, audiences are expected to grasp the basic knowledge and recent progress of point cloud technologies, and promote the research developments in both academia and industrial communities.

References

[1]
Songlin Fan and Wei Gao. 2023. Screen-based 3D Subjective Experiment Software. In Proceedings of the 31st ACM International Conference on Multimedia. 9672--9675.
[2]
Chunyang Fu, Ge Li, Rui Song, Wei Gao, and Shan Liu. 2022. Octattention: Octreebased large-scale contexts model for point cloud compression. In Proceedings of the AAAI conference on artificial intelligence, Vol. 36. 625--633.
[3]
Wei Gao, Ge Li, Hui Yuan, Raouf Hamzaoui, Zhu Li, and Shan Liu. 2022. APCCPA? 22: 1st International Workshop on Advances in Point Cloud Compression, Processing and Analysis. In Proceedings of the 30th ACM International Conference on Multimedia. 7392--7393.
[4]
Wei Gao, Hua Ye, Ge Li, Huiming Zheng, Yuyang Wu, and Liang Xie. 2022. OpenPointCloud: An open-source algorithm library of deep learning based point cloud compression. In Proceedings of the 30th ACM international conference on multimedia. 7347--7350.
[5]
Wei Gao, Hang Yuan, Ge Li, Zhu Li, and Hui Yuan. 2023. Low Complexity Coding Unit Decision for Video-Based Point Cloud Compression. IEEE Transactions on Image Processing 33 (2023), 149--162.
[6]
Ge Li, Gao Wei, and Wen Gao. 2024. Point Cloud Compression: Technologies and Standardization. Springer Nature.
[7]
Zhuangzi Li, Ge Li, Thomas H Li, Shan Liu, and Wei Gao. 2022. Semantic point cloud upsampling. IEEE Transactions on Multimedia (2022).
[8]
Wang Liu, Wei Gao, and Xingming Mu. 2024. Fast inter-frame motion prediction for compressed dynamic point cloud attribute enhancement. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 38. 3720--3728.
[9]
Xijing Lu and Wei Gao. 2023. AttentiveNet: Detecting Small Objects for Li-DAR Point Clouds by Attending to Important Points. In 2023 IEEE International Conference on Visual Communications and Image Processing.
[10]
Shuqing Luo and Wei Gao. 2024. A general framework for rotation invariant point cloud analysis. In ICASSP 2024--2024 IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, 3665--3669.
[11]
Zhiyi Pan, Nan Zhang, Wei Gao, Shan Liu, and Ge Li. 2024. Less is more: label recommendation for weakly supervised point cloud semantic segmentation. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 38. 4397--4405.
[12]
Changhao Peng and Wei Gao. 2024. Laplacian Matrix Learning for Point Cloud Attribute Compression with Ternary Search-Based Adaptive Block Partition. In Proceedings of the 32nd ACM International Conference on Multimedia.
[13]
Zhiyang Qi and Wei Gao. 2024. Variable-rate point cloud geometry compression based on feature adjustment and interpolation. In 2024 Data Compression Conference. IEEE, 63--72.
[14]
Yiting Shao, Ge Li, Qi Zhang, Wei Gao, and Shan Liu. 2023. Nonrigid Registration-Based Progressive Motion Compensation for Point Cloud Geometry Compression. IEEE Transactions on Geoscience and Remote Sensing (2023).
[15]
Yiting Shao, Xiaodong Yang, Wei Gao, Shan Liu, and Ge Li. 2024. 3D Point Cloud Attribute Compression Using Diffusion-based Texture-aware Intra Prediction. IEEE Transactions on Circuits and Systems for Video Technology (2024).
[16]
Fangyu Shen and Wei Gao. 2021. A rate control algorithm for video-based point cloud compression. In 2021 International Conference on Visual Communications and Image Processing. IEEE, 1--5.
[17]
Fei Song, Ge Li, Wei Gao, and Thomas H Li. 2022. Rate-distortion optimized graph for point cloud attribute coding. IEEE Signal Processing Letters (2022).
[18]
Fei Song, Ge Li, Xiaodong Yang, Wei Gao, and Shan Liu. 2023. Block-adaptive point cloud attribute coding with region-aware optimized transform. IEEE Transactions on Circuits and Systems for Video Technology (2023).
[19]
Jilong Wang, Wei Gao, and Ge Li. 2023. Applying collaborative adversarial learning to blind point cloud quality measurement. IEEE Transactions on Instrumentation and Measurement (2023).
[20]
Jilong Wang, Wei Gao, and Ge Li. 2024. Zoom to Perceive Better: No-reference Point Cloud Quality Assessment via Exploring Effective Multiscale Feature. IEEE Transactions on Circuits and Systems for Video Technology (2024).
[21]
Yang Wang, Wei Gao, Xingming Mu, and Hang Yuan. 2023. Rate Control Optimization for Joint Geometry and Attribute Coding of LiDAR Point Clouds. In 2023 IEEE International Conference on Visual Communications and Image Processing.
[22]
Liang Xie and Wei Gao. 2024. Learning PCC: A PyTorch Library for Learning-Based Point Cloud Compression. In Proceedings of the 32nd ACM International Conference on Multimedia.
[23]
Liang Xie and Wei Gao. 2024. PCHMVision: An Open-Source Library of Point Cloud Compression for Human and Machine Vision. In Proceedings of the 32nd ACM International Conference on Multimedia.
[24]
Liang Xie, Wei Gao, Songlin Fan, and Zhaojian Yao. 2024. Pdnet: Parallel dualbranch network for point cloud geometry compression and analysis. In 2024 Data Compression Conference. IEEE, 596--596.
[25]
Liang Xie, Wei Gao, Huiming Zheng, and Ge Li. 2024. ROI-Guided Point Cloud Geometry Compression Towards Human and Machine Vision. In Proceedings of the 32nd ACM International Conference on Multimedia.
[26]
Liang Xie, Wei Gao, Huiming Zheng, and Ge Li. 2024. SPCGC: Scalable Point Cloud Geometry Compression for Machine Vision. In Proceedings of IEEE International Conference on Robotics and Automation.
[27]
Liang Xie, Wei Gao, Huiming Zheng, and Hua Ye. 2024. Semantic-aware visual decomposition for point cloud geometry compression. In 2024 Data Compression Conference. IEEE, 595--595.
[28]
Dinghao Yang, Wei Gao, Ge Li, Hui Yuan, Junhui Hou, and Sam Kwong. 2023. Exploiting Manifold Feature Representation for Efficient Classification of 3D Point Clouds. ACM Transactions on Multimedia Computing, Communications and Applications 19, 1s (2023), 1--21.
[29]
Zetao Yang, Wei Gao, and Xijing Lu. 2023. DANet: Density-Adaptive Network for Geometry-based Point Cloud Compression Artifacts Removal. In 2023 IEEE International Conference on Visual Communications and Image Processing.
[30]
Zhuozhen Yu and Wei Gao. 2024. When dynamic neural network meets point cloud compression: Computation-aware variable rate and checkerboard context. In 2024 Data Compression Conference. IEEE, 600--600.
[31]
Nan Zhang, Zhiyi Pan, Thomas H Li, Wei Gao, and Ge Li. 2023. Improving graph representation for point cloud segmentation via attentive filtering. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 1244--1254.
[32]
Ruonan Zhang, Jingyi Chen, Wei Gao, Ge Li, and Thomas H Li. 2022. Pointot: Interpretable geometry-inspired point cloud generative model via optimal transport. IEEE Transactions on Circuits and Systems for Video Technology (2022).
[33]
Ruonan Zhang, Wei Gao, Ge Li, and Thomas H Li. 2022. Qinet: Decision surface learning and adversarial enhancement for quasi-immune completion of diverse corrupted point clouds. IEEE Transactions on Geoscience and Remote Sensing 60 (2022), 1--14.
[34]
Xiaoyu Zhang, Guibiao Liao, Wei Gao, and Ge Li. 2022. TDRnet: Transformerbased dual-branch restoration network for geometry based point cloud compression artifacts. In 2022 IEEE International Conference on Multimedia and Expo. IEEE, 1--6.
[35]
Yongchi Zhang, Wei Gao, and Ge Li. 2022. OpenPointCloud-V2: A deep learning based open-source algorithm library of point cloud processing. In Proceedings of the 1st InternationalWorkshop on Advances in Point Cloud Compression, Processing and Analysis. 51--55.
[36]
Huiming Zheng, Wei Gao, Zhuozhen Yu, Tiesong Zhao, and Ge Li. 2024. View-PCGC: View-Guided Learned Point Cloud Geometry Compression. In Proceedings of the 32nd ACM International Conference on Multimedia.

Index Terms

  1. Point Cloud Compression, Enhancement and Applications: From 3D Perception to Large Models

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      MM '24: Proceedings of the 32nd ACM International Conference on Multimedia
      October 2024
      11719 pages
      ISBN:9798400706868
      DOI:10.1145/3664647
      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 28 October 2024

      Check for updates

      Author Tags

      1. 3d human and machine perception
      2. large models
      3. point cloud applications
      4. point cloud compression
      5. point cloud enhancement

      Qualifiers

      • Abstract

      Funding Sources

      • Natural Science Foundation of China
      • Guangdong Basic and Applied Basic Research Foundation
      • Shenzhen Science and Technology Program
      • CAAI-MindSpore Open Fund, developed on OpenI Community
      • The Major Key Project of PCL
      • Guangdong Province Pearl River Talent Program

      Conference

      MM '24
      Sponsor:
      MM '24: The 32nd ACM International Conference on Multimedia
      October 28 - November 1, 2024
      Melbourne VIC, Australia

      Acceptance Rates

      MM '24 Paper Acceptance Rate 1,150 of 4,385 submissions, 26%;
      Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 110
        Total Downloads
      • Downloads (Last 12 months)110
      • Downloads (Last 6 weeks)86
      Reflects downloads up to 13 Dec 2024

      Other Metrics

      Citations

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media