[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Weakly supervised semantic segmentation for point cloud based on view-based adversarial training and self-attention fusion

Published: 04 March 2024 Publication History

Abstract

Traditional methods of weakly supervised semantic segmentation (WSsegmentation) for point cloud scenes have several limitations including limited precision and difficulty in handling complex scenes due to imprecise labels or partial annotations. To address these issues, we perform view-based adversarial training on the original point cloud scene samples through view resampling and Gaussian noise perturbation to reduce overfitting. Combining a self-attention mechanism with multi-layer perceptrons and point cloud segmentation strategy, we can perform dimensionality enhancement and dimensionality reduction operations to better capture the local features of point cloud data. Finally, we can obtain the semantic segmentation results of point cloud scene by fusing local and global semantic features. In the design of the network loss function, we combine the Siamese loss and the smoothness loss and the cross-entropy loss to improve the ability and fidelity of semantic networks. Specifically, the Siamese loss is used to compute the distance between different augmented point cloud data in their feature embedding space and the smoothness loss is used to penalize the discontinuity of semantic information between adjacent regions. The proposed weakly supervised segmentation network achieves an overall segmentation accuracy close to fully supervised segmentation methods and outperforms most of the existing weakly supervised segmentation methods by 5% to 10% in scene segmentation in terms of mIoU on S3DIS, ShapeNet, and PartNet datasets. Extensive experiments demonstrate the robustness, effectiveness, and generalization of the proposed point cloud segmentation network.

Graphical abstract

Display Omitted

Highlights

We propose effective data augmentation methods, including view resampling and adversarial training, to tackle partial annotations.
To fully integrate the global and local features of point cloud, we employ high-dimensional feature self-attention mechanism and multi-layer perceptrons to fuse multi-level features.
We present a novel point cloud segmentation framework, which can achieve the higher segmentation accuracy than traditional segmentation networks for different point cloud scenes.
Extensive experiments demonstrate the robustness, effectiveness, and generalization of the proposed weakly supervised point cloud semantic segmentation network.

References

[1]
Miao Y., Xiao C., Geometric processing and shape modeling of 3D point-sampled models, Science Press, Beijing, 2014.
[2]
Zhu Y., Mottaghi R., Kolve E., Lim J.J., Gupta A., Fei-Fei L., Farhadi A., Target-driven visual navigation in indoor scenes using deep reinforcement learning, in: IEEE International conference on robotics and automation (ICRA), Singapore, May 29-Jun. 3, 2017, pp. 3357–3364.
[3]
Zhang Y., Qu Y., Xie Y., Li Z., Zheng S., Li C., Perturbed self-distillation: Weakly supervised large-scale point cloud semantic segmentation, in: IEEE/CVF international conference on computer vision (ICCV), Montreal, Canada, Oct. 10-17, 2021, pp. 15500–15508.
[4]
Kim S., Lee S., Hwang D., Lee J., Hwang S.J., Kim H.J., Point cloud augmentation with weighted local transformations, in: IEEE/CVF international conference on computer vision (ICCV), Montreal, Canada, Oct. 10-17, 2021, pp. 528–537.
[5]
Li R., Li X., Heng P.A., Fu C.W., PointAugment: An auto-augmentation framework for point cloud classification, in: IEEE/CVF conference on computer vision and pattern recognition (CVPR), Seattle, USA, Jun. 13-19, 2020, pp. 6377–6386.
[6]
Tchapmi L.P., Choy C.B., Armeni I., Gwak J., Savarese S., Segcloud: Semantic segmentation of 3d point clouds, in: International conference on 3D vision, Qingdao, China, Oct. 10-12, 2017, pp. 537–547.
[7]
Gong J., Ye Z., Ma L., Neighborhood co-occurrence modeling in 3d point cloud segmentation, Comput Vis Media 8 (2) (2022) 303–315.
[8]
Zhao Y., Ma X., Hu B., Zhang Q., Ye M., Zhou G., A large-scale point cloud semantic segmentation network via local dual features and global correlations, Comput Graph 111 (2023) 133–144.
[9]
Su H., Maji S., Kalogerakis E., Learned-Miller E., Multi-view convolutional neural networks for 3d shape recognition, in: IEEE international conference on computer vision (ICCV), Santiago, Chile, Dec.7-13, 2015, pp. 945–953.
[10]
Hou J., Dai A., Nießner M., 3D-SIS: 3D semantic instance segmentation of RGB-D scans, in: IEEE conference on computer vision and pattern recognition (CVPR), Long Beach, USA, Jun. 16-20, 2019, pp. 4421–4430.
[11]
Nan L., Xie K., Sharf A., A search-classify approach for cluttered indoor scene understanding, ACM Trans Graph 31 (6) (2012) 137:1–137:10.
[12]
Li Y., Dai A., Guibas L., Nießner M., Database-assisted object retrieval for real-time 3d reconstruction, Comput Graph Forum 34 (2) (2015) 435–446.
[13]
Mattausch O., Panozzo D., Mura C., Sorkine-Hornung O., Pajarola R., Object detection and classification from large-scale cluttered indoor scans, Comput Graph Forum 33 (2) (2014) 11–21.
[14]
Wu Y., Cai S., Yan Z., Li G., Yu Y., Han X., Cui S., Pointmatch: A consistency training framework for weakly supervised semantic segmentation of 3d point clouds, 2022, arXiv:2202.10705, 2022.
[15]
Guo M.H., Cai J.X., Liu Z.N., Mu T.J., Martin R.R., Hu S.M., PCT: Point cloud transformer, Comput Vis Media 7 (2) (2021) 187–199.
[16]
Vanian V., Zamanakos G., Pratikakis I., Improving performance of deep learning models for 3d point cloud semantic segmentation via attention mechanisms, Comput Graph 106 (2022) 277–287.
[17]
Hu Q., Yang B., Fang G., Guo Y., Leonardis A., Trigoni N., Markham A., Sqn: Weakly-supervised semantic segmentation of large-scale 3d point clouds., in: European conference on computer vision (ECCV), Tel Aviv, Israel; Oct. 23-27, 2022, pp. 600–619.
[18]
Zhou Z., Zhou Y., Wang D., Mu J., Zhou H., Self-attention feature fusion network for semantic segmentation, Neurocomputing 453 (2021) 50–59.
[19]
Chopra S., Hadsell R., LeCun Y., Learning a similarity metric discriminatively, with application to face verification, in: IEEE/CVF conference on computer vision and pattern recognition (CVPR), San Diego, USA, Jun. 20-25, 2005, pp. 539–546.
[20]
Armeni I., Sener O., Zamir A.R., Jiang H., Brilakis I., Fischer M., Savarese S., 3D semantic parsing of large-scale indoor spaces, in: IEEE conference on computer vision and pattern recognition (CVPR), Las Vegas, USA, Jun. 27-30, 2016, pp. 1534–1543.
[21]
Chang A.X., Funkhouser T.A., Guibas L.J., Hanrahan P., Huang Q., Li Z., Savarese S., Savva M., Song S., Su H., Xiao J., Yi L., Yu F., ShapeNet: An information-rich 3d model repository, 2015,. arXiv:1512.03012, 2015.
[22]
Mo K., Zhu S., Chang A.X., Yi L., Tripathi S., Guibas L.J., Su H., PartNet: A large-scale benchmark for fine-grained and hierarchical part-level 3d object understanding, in: IEEE conference on computer vision and pattern recognition (CVPR), Long Beach, USA, Jun. 16-20, 2019, pp. 909–918.
[23]
Qi C.R., Yi L., Su H., Guibas L.J., PointNet++: Deep hierarchical feature learning on point sets in a metric space, in: Annual conference on neural information processing systems (NeurIPS), Long Beach, USA, Dec. 4-9, 2017, pp. 5099–5108.
[24]
Phan A.V., Nguyen M.L., Nguyen Y.L.H., Bui L.T., DGCNN: A convolutional neural network over large-scale labeled graphs, Neural Netw 108 (2018) 533–543.
[25]
Li Y., Bu R., Sun M., Wu W., Di X., Chen B., Pointcnn: Convolution on X-transformed points, in: Annual conference on neural information processing systems (NeurIPS), Montréal, Canada, Dec. 3-8, 2018, pp. 828–838.
[26]
Dhanachandra N., Manglem K., Chanu Y.J., Image segmentation using K-means clustering algorithm and subtractive clustering algorithm, Procedia Comput Sci 54 (2015) 764–771.
[27]
Tarvainen A., Valpola H., Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results, in: Annual conference on neural information processing systems (NeurIPS), Long Beach, USA, Dec. 4-9, 2017, pp. 1195–1204.
[28]
Xu X., Lee G.H., Weakly supervised semantic point cloud segmentation: Towards 10x fewer labels, in: IEEE/CVF conference on computer vision and pattern recognition (CVPR), Jun. 16-18, 2020, pp. 13706–13715.
[29]
Laine S., Aila T., Temporal ensembling for semi-supervised learning, 2016,. arXiv:1610.02242, 2016.
[30]
Su Y., Xu X., Jia K., Weakly supervised 3D point cloud segmentation via multi-prototype learning, IEEE Trans Circuits Syst Video Technol (2023),.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Computers and Graphics
Computers and Graphics  Volume 116, Issue C
Nov 2023
518 pages

Publisher

Pergamon Press, Inc.

United States

Publication History

Published: 04 March 2024

Author Tags

  1. Point cloud
  2. Semantic segmentation
  3. Weakly supervised learning
  4. Self-attention mechanism
  5. Siamese loss
  6. Smoothness loss

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 0
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 09 Jan 2025

Other Metrics

Citations

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media