[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Unsupervised Detection of Distinctive Regions on 3D Shapes

Published: 31 May 2020 Publication History

Abstract

This article presents a novel approach to learn and detect distinctive regions on 3D shapes. Unlike previous works, which require labeled data, our method is unsupervised. We conduct the analysis on point sets sampled from 3D shapes, then formulate and train a deep neural network for an unsupervised shape clustering task to learn local and global features for distinguishing shapes with respect to a given shape set. To drive the network to learn in an unsupervised manner, we design a clustering-based nonparametric softmax classifier with an iterative re-clustering of shapes, and an adapted contrastive loss for enhancing the feature embedding quality and stabilizing the learning process. By then, we encourage the network to learn the point distinctiveness on the input shapes. We extensively evaluate various aspects of our approach and present its applications for distinctiveness-guided shape retrieval, sampling, and view selection in 3D scenes.

Supplementary Material

a158-li-suppl.pdf (li.zip)
Supplemental movie, appendix, image and software files for, Unsupervised Detection of Distinctive Regions on 3D Shapes

References

[1]
Martin Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, Manjunath Kudlur, Josh Levenberg, Rajat Monga, Sherry Moore, Derek G. Murray, Benoit Steiner, Paul Tucker, Vijay Vasudevan, Pete Warden, Martin Wicke, Yuan Yu, and Xiaoqiang Zheng. 2016. TensorFlow: A system for large-scale machine learning. In 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16). 265--283. https://www.usenix.org/system/files/conference/osdi16/osdi16-abadi.pdf.
[2]
Pierre Alliez, Mark Meyer, and Mathieu Desbrun. 2002. Interactive geometry remeshing. ACM Transactions on Graphics (SIGGRAPH) 21, 3 (2002), 347--354.
[3]
Marco Ancona, Enea Ceolini, Cengiz Oztireli, and Markus Gross. 2018. Towards better understanding of gradient-based attribution methods for deep neural networks. In International Conference on Learning Representations (ICLR).
[4]
Yasuhiro Aoki, Hunter Goforth, Rangaprasad Arun Srivatsan, and Simon Lucey. 2019. PointNetLK: Robust 8 efficient point cloud registration using PointNet. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 7163--7172.
[5]
Philip Bachman, R. Devon Hjelm, and William Buchwalter. 2019. Learning representations by maximizing mutual information across views. In International Conference on Neural Information Processing Systems (NIPS). 15509--15519.
[6]
Umberto Castellani, Marco Cristani, Simone Fantoni, and Vittorio Murino. 2008. Sparse points matching by combining 3D mesh saliency with statistical descriptors. Computer Graphics Forum (Eurographics) 27, 2 (2008), 643--652.
[7]
Xiaobai Chen, Abulhair Saparov, Bill Pang, and Thomas Funkhouser. 2012. Schelling points on 3D surface meshes. ACM Transactions on Graphics (SIGGRAPH) 31, 4 (2012), 29:1--29:12.
[8]
Carl Doersch, Saurabh Singh, Abhinav Gupta, Josef Sivic, and Alexei Efros. 2012. What makes Paris look like Paris? ACM Transactions on Graphics (SIGGRAPH) 31, 4 (2012), 101:1--101:9.
[9]
Helin Dutagaci, Chun Pan Cheung, and Afzal Godil. 2012. Evaluation of 3D interest point detection techniques via human-generated ground truth. The Visual Computer 28, 9 (2012), 901--917.
[10]
Ran Gal and Daniel Cohen-Or. 2006. Salient geometric features for partial shape matching and similarity. ACM Transactions on Graphics 25, 1 (2006), 130--150.
[11]
Michael Garland and Paul S. Heckbert. 1997. Surface simplification using quadric error metrics. In Proceedings of SIGGRAPH. 209--216.
[12]
Raia Hadsell, Sumit Chopra, and Yann LeCun. 2006. Dimensionality reduction by learning an invariant mapping. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 1735--1742.
[13]
Olivier J. Hénaff, Ali Razavi, Carl Doersch, S. M. Eslami, and Aaron van den Oord. 2019. Data-efficient image recognition with contrastive predictive coding. arXiv preprint arXiv:1905.09272 (2019).
[14]
Geoffrey Hinton, Oriol Vinyals, and Jeff Dean. 2015. Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531 (2015).
[15]
R. Devon Hjelm, Alex Fedorov, Samuel Lavoie-Marchildon, Karan Grewal, Phil Bachman, Adam Trischler, and Yoshua Bengio. 2019. Learning deep representations by mutual information estimation and maximization. In International Conference on Learning Representations (ICLR).
[16]
Binh-Son Hua, Minh-Khoi Tran, and Sai-Kit Yeung. 2018. Pointwise convolutional neural networks. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 984--993.
[17]
Mayank Juneja, Andrea Vedaldi, C. V. Jawahar, and Andrew Zisserman. 2013. Blocks that shout: Distinctive parts for scene classification. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 923--930.
[18]
Diederik P. Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
[19]
Truc Le and Ye Duan. 2018. PointGrid: A deep network for 3D shape understanding. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 9204--9214.
[20]
Chang Ha Lee, Amitabh Varshney, and David W. Jacobs. 2005. Mesh saliency. ACM Transactions on Graphics (SIGGRAPH) 24, 3 (2005), 659--666.
[21]
George Leifman, Elizabeth Shtrom, and Ayellet Tal. 2012. Surface regions of interest for viewpoint selection. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 414--421.
[22]
Yangyan Li, Rui Bu, Mingchao Sun, Wei Wu, Xinhan Di, and Baoquan Chen. 2018. PointCNN: Convolution on -transformed points. In International Conference on Neural Information Processing Systems (NIPS). 828--838.
[23]
Yu Liu, Guanglu Song, Jing Shao, Xiao Jin, and Xiaogang Wang. 2018. Transductive centroid projection for semi-supervised large-scale recognition. In European Conference on Computer Vision (ECCV). 70--86.
[24]
Flora Ponjou Tasse, Jiri Kosinka, and Neil Dodgson. 2015. Cluster-based point set saliency. In IEEE International Conference on Computer Vision (ICCV). 163--171.
[25]
Charles R. Qi, Hao Su, Kaichun Mo, and Leonidas J. Guibas. 2017a. PointNet: Deep learning on point sets for 3D classification and segmentation. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 652--660.
[26]
Charles R. Qi, Li Yi, Hao Su, and Leonidas J. Guibas. 2017b. PointNet++: Deep hierarchical feature learning on point sets in a metric space. In International Conference on Neural Information Processing Systems (NIPS). 5099--5108.
[27]
Ramprasaath R. Selvaraju, Michael Cogswell, Abhishek Das, Ramakrishna Vedantam, Devi Parikh, and Dhruv Batra. 2017. Grad-CAM: Visual explanations from deep networks via gradient-based localization. In IEEE International Conference on Computer Vision (ICCV). 618--626.
[28]
Yiru Shen, Chen Feng, Yaoqing Yang, and Dong Tian. 2018. Mining point cloud local structures by kernel correlation and graph pooling. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 4548--4557.
[29]
Philip Shilane and Thomas Funkhouser. 2006. Selecting distinctive 3D shape descriptors for similarity retrieval. In IEEE International Conference on Shape Modeling and Applications (SMI). 18:1--18:10.
[30]
Philip Shilane and Thomas Funkhouser. 2007. Distinctive regions of 3D surfaces. ACM Transactions on Graphics 26, 2 (2007), 7:1--7:15.
[31]
Philip Shilane, Patrick Min, Michael Kazhdan, and Thomas Funkhouser. 2004. The Princeton shape benchmark. In IEEE International Conference on Shape Modeling and Applications (SMI). 167--178.
[32]
Avanti Shrikumar, Peyton Greenside, and Anshul Kundaje. 2017. Learning important features through propagating activation differences. In Proceedings of International Conference on Machine Learning (ICML). 3145--3153.
[33]
Elizabeth Shtrom, George Leifman, and Ayellet Tal. 2013. Saliency detection in large point sets. In IEEE International Conference on Computer Vision (ICCV). 3591--3598.
[34]
Zhenyu Shu, Shiqing Xin, Xin Xu, Ligang Liu, and Ladislav Kavan. 2019. Detecting 3D points of interest using multiple features and stacked auto-encoder. IEEE Transactions Visualization 8 Computer Graphics 25, 8 (2019), 2583--2596.
[35]
Saurabh Singh, Abhinav Gupta, and Alexei A. Efros. 2012. Unsupervised discovery of mid-level discriminative patches. In European Conference on Computer Vision (ECCV). 73--86.
[36]
Ran Song, Yonghuai Liu, and Paul Rosin. 2018. Distinction of 3D objects and scenes via classification network and Markov random field. IEEE Transactions Visualization 8 Computer Graphics (2018), To appear.
[37]
X. Yu Stella and Jianbo Shi. 2003. Multiclass spectral clustering. In IEEE International Conference on Computer Vision (ICCV). 313--320.
[38]
Hang Su, Varun Jampani, Deqing Sun, Subhransu Maji, Evangelos Kalogerakis, Ming-Hsuan Yang, and Jan Kautz. 2018. SPLATNet: Sparse lattice networks for point cloud processing. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2530--2539.
[39]
Jian Sun and Jean Ponce. 2013. Learning discriminative part detectors for image classification and cosegmentation. In IEEE International Conference on Computer Vision (ICCV). 3400--3407.
[40]
Mukund Sundararajan, Ankur Taly, and Qiqi Yan. 2017. Axiomatic attribution for deep networks. In International Conference on Machine Learning (ICML). 3319--3328.
[41]
Johan W. H. Tangelder and Remco C. Veltkamp. 2004. A survey of content based 3D shape retrieval methods. In IEEE International Conference on Shape Modeling and Applications (SMI). 145--156.
[42]
Ulrike Von Luxburg. 2007. A tutorial on spectral clustering. Statistics and Computing 17, 4 (2007), 395--416.
[43]
Xi Wang, Sebastian Koch, Kenneth Holmqvist, and Marc Alexa. 2018. Tracking the gaze on objects in 3D: How do people really look at the Bunny? ACM Transactions on Graphics (SIGGRAPH Asia) 37, 6 (2018), 188:1--188:18.
[44]
Yaming Wang, Jonghyun Choi, Vlad Morariu, and Larry S. Davis. 2016. Mining discriminative triplets of patches for fine-grained classification. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 1163--1172.
[45]
Yue Wang and Justin M. Solomon. 2019. Deep closest point: Learning representations for point cloud registration. In IEEE International Conference on Computer Vision (ICCV). 3523--3532.
[46]
Yue Wang, Yongbin Sun, Ziwei Liu, Sanjay E. Sarma, Michael M. Bronstein, and Justin M. Solomon. 2019. Dynamic graph CNN for learning on point clouds. ACM Transactions on Graphics 38, 5 (2019), 146:1--146:12.
[47]
3D Warehouse. 2019. Retrieved January 2, 2019 from https://3dwarehouse.sketchup.com/.
[48]
Yandong Wen, Kaipeng Zhang, Zhifeng Li, and Yu Qiao. 2016. A discriminative feature learning approach for deep face recognition. In European Conference on Computer Vision (ECCV). 499--515.
[49]
Sanghyun Woo, Jongchan Park, Joon-Young Lee, and In So Kweon. 2018. CBAM: Convolutional block attention module. In European Conference on Computer Vision (ECCV). 3--19.
[50]
Zhirong Wu, Shuran Song, Aditya Khosla, Fisher Yu, Linguang Zhang, Xiaoou Tang, and Jianxiong Xiao. 2015. 3D ShapeNets: A deep representation for volumetric shapes. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 1912--1920.
[51]
Zhirong Wu, Yuanjun Xiong, X. Yu Stella, and Dahua Lin. 2018. Unsupervised feature learning via non-parametric instance discrimination. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 3733--3742.
[52]
Yifan Xu, Tianqi Fan, Mingye Xu, Long Zeng, and Yu Qiao. 2018. SpiderCNN: Deep learning on point sets with parameterized convolutional filters. In European Conference on Computer Vision (ECCV). 90--105.
[53]
Yaoqing Yang, Chen Feng, Yiru Shen, and Dong Tian. 2018. FoldingNet: Point cloud auto-encoder via deep grid deformation. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 206--215.
[54]
Lequan Yu, Xianzhi Li, Chi-Wing Fu, Daniel Cohen-Or, and Pheng-Ann Heng. 2018. EC-Net: An edge-aware point set consolidation network. In European Conference on Computer Vision (ECCV). 398--414.
[55]
Matthew D. Zeiler and Rob Fergus. 2014. Visualizing and understanding convolutional networks. In European Conference on Computer Vision (ECCV). 818--833.
[56]
Jianming Zhang, Sarah Adel Bargal, Zhe Lin, Jonathan Brandt, Xiaohui Shen, and Stan Sclaroff. 2018. Top-down neural attention by excitation backprop. International Journal Computer Vision 126, 10 (2018), 1084--1102.
[57]
Xiaoting Zhang, Xinyi Le, Athina Panotopoulou, Emily Whiting, and Charlie C. L. Wang. 2015. Perceptual models of preference in 3D printing direction. ACM Transactions on Graphics (SIGGRAPH Asia) 34, 6 (2015), 215:1--215:12.
[58]
Hengshuang Zhao, Li Jiang, Chi-Wing Fu, and Jiaya Jia. 2019. PointWeb: Enhancing local neighborhood features for point cloud processing. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 5565--5573.
[59]
Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, and Antonio Torralba. 2016. Learning deep features for discriminative localization. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2921--2929.

Cited By

View all
  • (2024)Self-supervised rotation-equivariant spherical vector network for learning canonical 3D point cloud orientationEngineering Applications of Artificial Intelligence10.1016/j.engappai.2023.107529128(107529)Online publication date: Feb-2024
  • (2023)Automatic Schelling Point Detection From MeshesIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2022.314414329:6(2926-2939)Online publication date: 1-Jun-2023
  • (2023)3D Visual Saliency: An Independent Perceptual Measure or A Derivative of 2D Image Saliency?IEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2023.3287356(1-17)Online publication date: 2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Graphics
ACM Transactions on Graphics  Volume 39, Issue 5
October 2020
184 pages
ISSN:0730-0301
EISSN:1557-7368
DOI:10.1145/3403637
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 31 May 2020
Online AM: 07 May 2020
Accepted: 01 April 2020
Revised: 01 April 2020
Received: 01 July 2019
Published in TOG Volume 39, Issue 5

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Shape analysis
  2. distinctive regions
  3. learning
  4. neural network
  5. unsupervised

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

  • CUHK Research Committee Direct
  • ISF
  • Research Grants Council of the Hong Kong Special Administrative Region
  • Israel Science Foundation as part of the ISF-NSFC joint program

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)46
  • Downloads (Last 6 weeks)2
Reflects downloads up to 06 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Self-supervised rotation-equivariant spherical vector network for learning canonical 3D point cloud orientationEngineering Applications of Artificial Intelligence10.1016/j.engappai.2023.107529128(107529)Online publication date: Feb-2024
  • (2023)Automatic Schelling Point Detection From MeshesIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2022.314414329:6(2926-2939)Online publication date: 1-Jun-2023
  • (2023)3D Visual Saliency: An Independent Perceptual Measure or A Derivative of 2D Image Saliency?IEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2023.3287356(1-17)Online publication date: 2023
  • (2023)D-Net: Learning for distinctive point clouds by self-attentive point searching and learnable feature fusionComputer Aided Geometric Design10.1016/j.cagd.2023.102206(102206)Online publication date: May-2023
  • (2022)View-Agnostic Point Cloud Generation for Occlusion Reduction in Aerial LidarRemote Sensing10.3390/rs1413295514:13(2955)Online publication date: 21-Jun-2022
  • (2022)A Rotation-Invariant Framework for Deep Point Cloud AnalysisIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2021.309257028:12(4503-4514)Online publication date: 1-Dec-2022
  • (2022)AIM: an Auto-Augmenter for Images and Meshes2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52688.2022.00080(712-721)Online publication date: Jun-2022
  • (2021)PointSCNet: Point Cloud Structure and Correlation Learning Based on Space-Filling Curve-Guided SamplingSymmetry10.3390/sym1401000814:1(8)Online publication date: 22-Dec-2021
  • (2021)Dynamic neural garmentsACM Transactions on Graphics10.1145/3478513.348049740:6(1-15)Online publication date: 10-Dec-2021
  • (2021)DeepVecFontACM Transactions on Graphics10.1145/3478513.348048840:6(1-15)Online publication date: 10-Dec-2021
  • Show More Cited By

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media