[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article
Public Access

Learning hierarchical shape segmentation and labeling from online repositories

Published: 20 July 2017 Publication History

Abstract

We propose a method for converting geometric shapes into hierarchically segmented parts with part labels. Our key idea is to train category-specific models from the scene graphs and part names that accompany 3D shapes in public repositories. These freely-available annotations represent an enormous, untapped source of information on geometry. However, because the models and corresponding scene graphs are created by a wide range of modelers with different levels of expertise, modeling tools, and objectives, these models have very inconsistent segmentations and hierarchies with sparse and noisy textual tags. Our method involves two analysis steps. First, we perform a joint optimization to simultaneously cluster and label parts in the database while also inferring a canonical tag dictionary and part hierarchy. We then use this labeled data to train a method for hierarchical segmentation and labeling of new 3D shapes. We demonstrate that our method can mine complex information, detecting hierarchies in man-made objects and their constituent parts, obtaining finer scale details than existing alternatives. We also show that, by performing domain transfer using a few supervised examples, our technique outperforms fully-supervised techniques that require hundreds of manually-labeled models.

Supplementary Material

MP4 File (papers-0313.mp4)

References

[1]
Sugato Basu, Mikhail Bilenko, and Raymond J Mooney. 2004. A probabilistic framework for semi-supervised clustering. In Proc. KDD.
[2]
Serge Belongie, Jitendra Malik, and Jan Puzicha. 2002. Shape Matching and Object Recognition Using Shape Contexts. IEEE T-PAMI 24, 24 (2002), 509--521.
[3]
Yuri Boykov, Olga Veksler, and Ramin Zabih. 2001. Fast approximate energy minimization via graph cuts. IEEE Trans. PAMI (2001).
[4]
Olivier Cappé and Eric Moulines. 2009. On-line expectation-maximization algorithm for latent data models. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 71, 3 (2009), 593--613.
[5]
Angel X. Chang, Thomas Funkhouser, Leonidas Guibas, Pat Hanrahan, Qixing Huang, Zimo Li, Silvio Savarese, Manolis Savva, Shuran Song, Hao Su, Jianxiong Xiao, Li Yi, and Fisher Yu. 2015. ShapeNet: An Information-Rich 3D Model Repository. (2015). arXiv:1512.03012.
[6]
Ding-Yun Chen, Xiao-Pei Tian, Yu-Te Shen, and Ming Ouhyoung. 2003. On Visual Similarity Based 3D Model Retrieval. In Computer Graphics Forum (Eurographics).
[7]
Xiaobai Chen, Aleksey Golovinskiy and Thomas Funkhouser. 2009. A Benchmark for 3D Mesh Segmentation. In ACM SIGGRAPH (SIGGRAPH). Article 73, 73:1--73:12 pages.
[8]
Xinlei Chen and Abhinav Gupta. 2015. Webly Supervised Learning of Convolutional Networks. In Proc. ICCV.
[9]
Matthew Fisher, Manolis Savva, and Pat Hanrahan. 2011. Characterizing structural relationships in scenes using graph kernels. In ACM TOG, Vol. 30. 34.
[10]
Xavier Glorot and Yoshua Bengio. 2010. Understanding the difficulty of training deep feedforward neural networks. In AISTATS.
[11]
Aleksey Golovinskiy and Thomas Funkhouser. 2009. Consistent Segmentation of 3D Models. Proc. SMI 33, 3 (2009), 262--269.
[12]
Kan Guo, Dongqing Zou, and Xiaowu Chen. 2015. 3D Mesh Labeling via Deep Convolutional Neural Networks. ACM TOG 35, 1, Article 3 (2015), 12 pages.
[13]
Ruizhen Hu, Lubin Fan, and Ligang Liu. 2012. Co-segmentation of 3D shapes via subspace clustering. SGP 31, 5 (2012), 1703--1713.
[14]
Qixing Huang, Vladlen Koltun, and Leonidas Guibas. 2011. Joint shape segmentation with linear programming. In ACM SIGGRAPH Asia. 125:1--125:12.
[15]
Qixing Huang, Fan Wang, and Leonidas Guibas. 2014. Functional Map Networks for Analyzing and Exploring Large Shape Collections. SIGGRAPH 33, 4 (2014).
[16]
Hamid Izadinia, Bryan C. Russell, Ali Farhadi, Matthew D. Hoffman, and Aaron Hertzmann. 2015. Deep Classifiers from Image Tags in the Wild. In Proc. Multimedia COMMONS.
[17]
Andrew E. Johnson and Martial Hebert. 1999. Using Spin Images for Efficient Object Recognition in Cluttered 3D Scenes. IEEE T-PAMI 21, 5 (1999), 433--449.
[18]
Evangelos Kalogerakis, Aaron Hertzmann, and Karan Singh. 2010. Learning 3D mesh segmentation and labeling. ACM Transactions on Graphics (TOG) 29, 4 (2010), 102.
[19]
Vladimir G Kim, Wilmot Li, Niloy J Mitra, Siddhartha Chaudhuri, Stephen DiVerdi, and Thomas Funkhouser. 2013. Learning part-based templates from large collections of 3D shapes. ACM Transactions on Graphics (TOG) 32, 4 (2013), 70.
[20]
Diederik P. Kingma and Jimmy Lei Ba. 2015. Adam: A Method for Stochastic Optimization. In Proc. ICLR.
[21]
Xirong Li, Tiberio Uricchio, Lamberto Ballan, Marco Bertini, Cees G. M. Snoek, and Alberto Del Bimbo. 2016. Socializing the Semantic Gap: A Comparative Survey on Image Tag Assignment, Refinement, and Retrieval. ACM Comput. Surv. 49, 1 (2016).
[22]
Tianqiang Liu, Siddhartha Chaudhuri, Vladimir G. Kim, Qi-Xing Huang, Niloy J. Mitra, and Thomas Funkhouser. 2014. Creating Consistent Scene Graphs Using a Probabilistic Grammar. SIGGRAPH Asia 33, 6 (2014).
[23]
Niloy J Mitra, Michael Wand, Hao Zhang, Daniel Cohen-Or, and Martin Bokeloh. 2013. Structure-aware shape processing. In Eurographics STARs. 175--197.
[24]
Radford M Neal and Geoffrey E Hinton. 1998. A view of the EM algorithm that justifies incremental, sparse, and other variants. In Learning in graphical models. Springer, 355--368.
[25]
Vicente Ordonez, Girish Kulkarni, and Tamara L. Berg. 2011. Im2text: Describing images using 1 million captioned photographs. In Proc. NIPS.
[26]
Robert Osada, Thomas Funkhouser, Bernard Chazelle, and David Dobkin. 2002. Shape Distributions. ACM Transactions on Graphics (2002).
[27]
Oana Sidi, Oliver van Kaick, Yanir Kleiman, Hao Zhang, and Daniel Cohen-Or. 2011. Unsupervised Co-Segmentation of a Set of Shapes via Descriptor-Space Spectral Clustering. ACM SIGGRAPH Asia 30, 6 (2011), 126:1--126:9.
[28]
Jerry Talton, Lingfeng Yang, Ranjitha Kumar, Maxine Lim, Noah Goodman, and Radomír Měch. 2012. Learning design patterns with bayesian grammar induction. In UIST.
[29]
Oliver van Kaick, Kai Xu, Hao Zhang, Yanzhen Wang, Shuyang Sun, Ariel Shamir, and Daniel Cohen-Or. 2013. Co-hierarchical analysis of shape structures. ACM Transactions on Graphics (TOG) 32, 4 (2013), 69.
[30]
Yunhai Wang, Shmulik Asafi, Oliver van Kaick, Hao Zhang, Daniel Cohen-Or, and Baoquan Chenand. 2012. Active Co-Analysis of a Set of Shapes. SIGGRAPH Asia (2012).
[31]
Yanzhen Wang, Kai Xu, Jun Li, Hao Zhang, Ariel Shamir, Ligang Liu, Zhiquan Cheng, and Yueshan Xiong. 2011. Symmetry Hierarchy of Man-Made Objects. Eurographics 30, 2 (2011).
[32]
Kai Xu, Vladimir G. Kim, Qixing Huang, Niloy J. Mitra, and Evangelos Kalogerakis. 2016. Data-Driven Shape Analysis and Processing. SIGGRAPH Asia Course (2016).
[33]
Li Yi, Vladimir G Kim, Duygu Ceylan, I Shen, Mengyan Yan, Hao Su, Cewu Lu, Qixing Huang, Alla Sheffer, and Leonidas Guibas. 2016. A scalable active framework for region annotation in 3D shape collections. TOG 35, 6 (2016), 210.
[34]
Mehmet Ersin Yumer, Won Chun, and Ameesh Makadia. 2014. Co-segmentation of textured 3D shapes with sparse annotations. In 2014 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 240--247.
[35]
Qingnan Zhou and Alec Jacobson. 2016. Thingi10K: A Dataset of 10,000 3D-Printing Models. (2016). arxiv:1605.04797.

Cited By

View all
  • (2024)3D shape knowledge graph for cross‐domain 3D shape retrievalCAAI Transactions on Intelligence Technology10.1049/cit2.12326Online publication date: 2-Apr-2024
  • (2023)What’s in a Name? Evaluating Assembly-Part Semantic Knowledge in Language Models Through User-Provided Names in Computer Aided Design FilesJournal of Computing and Information Science in Engineering10.1115/1.406245424:1Online publication date: 23-Jun-2023
  • (2023)Seg&Struct: The Interplay Between Part Segmentation and Structure Inference for 3D Shape Parsing2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV56688.2023.00128(1226-1235)Online publication date: Jan-2023
  • Show More Cited By

Index Terms

  1. Learning hierarchical shape segmentation and labeling from online repositories

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Transactions on Graphics
      ACM Transactions on Graphics  Volume 36, Issue 4
      August 2017
      2155 pages
      ISSN:0730-0301
      EISSN:1557-7368
      DOI:10.1145/3072959
      Issue’s Table of Contents
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 20 July 2017
      Published in TOG Volume 36, Issue 4

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. hierarchical shape structure
      2. learning
      3. shape labeling
      4. siamese networks

      Qualifiers

      • Research-article

      Funding Sources

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)446
      • Downloads (Last 6 weeks)26
      Reflects downloads up to 10 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)3D shape knowledge graph for cross‐domain 3D shape retrievalCAAI Transactions on Intelligence Technology10.1049/cit2.12326Online publication date: 2-Apr-2024
      • (2023)What’s in a Name? Evaluating Assembly-Part Semantic Knowledge in Language Models Through User-Provided Names in Computer Aided Design FilesJournal of Computing and Information Science in Engineering10.1115/1.406245424:1Online publication date: 23-Jun-2023
      • (2023)Seg&Struct: The Interplay Between Part Segmentation and Structure Inference for 3D Shape Parsing2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV56688.2023.00128(1226-1235)Online publication date: Jan-2023
      • (2023)HAL3D: Hierarchical Active Learning for Fine-Grained 3D Part Labeling2023 IEEE/CVF International Conference on Computer Vision (ICCV)10.1109/ICCV51070.2023.00086(865-875)Online publication date: 1-Oct-2023
      • (2023)Semi-supervised 3D shape segmentation with multilevel consistency and part substitutionComputational Visual Media10.1007/s41095-022-0281-99:2(229-247)Online publication date: 3-Jan-2023
      • (2022)DSG-Net: Learning Disentangled Structure and Geometry for 3D Shape GenerationACM Transactions on Graphics10.1145/352621242:1(1-17)Online publication date: 12-Aug-2022
      • (2022)CarHoods10k: An Industry-Grade Data Set for Representation Learning and Design Optimization in Engineering ApplicationsIEEE Transactions on Evolutionary Computation10.1109/TEVC.2022.314701326:6(1221-1235)Online publication date: Dec-2022
      • (2022)A Deep Learning Driven Active Framework for Segmentation of Large 3D Shape CollectionsComputer-Aided Design10.1016/j.cad.2021.103179144:COnline publication date: 1-Mar-2022
      • (2022)A Hybrid Deep Learning Network CNN-SVM for 3D Mesh SegmentationAdvanced Intelligent Systems for Sustainable Development (AI2SD’2020)10.1007/978-3-030-90639-9_93(1146-1155)Online publication date: 10-Feb-2022
      • (2021)PTRProceedings of the 35th International Conference on Neural Information Processing Systems10.5555/3540261.3541594(17427-17440)Online publication date: 6-Dec-2021
      • Show More Cited By

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Login options

      Full Access

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media