More Web Proxy on the site http://driver.im/

research-article

Public Access

Learning hierarchical shape segmentation and labeling from online repositories

Authors:

Leonidas Guibas,

Aaron Hertzmann,

Vladimir G. Kim,

Ersin YumerAuthors Info & Claims

ACM Transactions on Graphics (TOG), Volume 36, Issue 4

Article No.: 70, Pages 1 - 12

https://doi.org/10.1145/3072959.3073652

Published: 20 July 2017 Publication History

Abstract

We propose a method for converting geometric shapes into hierarchically segmented parts with part labels. Our key idea is to train category-specific models from the scene graphs and part names that accompany 3D shapes in public repositories. These freely-available annotations represent an enormous, untapped source of information on geometry. However, because the models and corresponding scene graphs are created by a wide range of modelers with different levels of expertise, modeling tools, and objectives, these models have very inconsistent segmentations and hierarchies with sparse and noisy textual tags. Our method involves two analysis steps. First, we perform a joint optimization to simultaneously cluster and label parts in the database while also inferring a canonical tag dictionary and part hierarchy. We then use this labeled data to train a method for hierarchical segmentation and labeling of new 3D shapes. We demonstrate that our method can mine complex information, detecting hierarchies in man-made objects and their constituent parts, obtaining finer scale details than existing alternatives. We also show that, by performing domain transfer using a few supervised examples, our technique outperforms fully-supervised techniques that require hundreds of manually-labeled models.

Supplementary Material

MP4 File (papers-0313.mp4)

Download
296.02 MB

References

[1]

Sugato Basu, Mikhail Bilenko, and Raymond J Mooney. 2004. A probabilistic framework for semi-supervised clustering. In Proc. KDD.

Digital Library

[2]

Serge Belongie, Jitendra Malik, and Jan Puzicha. 2002. Shape Matching and Object Recognition Using Shape Contexts. IEEE T-PAMI 24, 24 (2002), 509--521.

Digital Library

[3]

Yuri Boykov, Olga Veksler, and Ramin Zabih. 2001. Fast approximate energy minimization via graph cuts. IEEE Trans. PAMI (2001).

[4]

Olivier Cappé and Eric Moulines. 2009. On-line expectation-maximization algorithm for latent data models. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 71, 3 (2009), 593--613.

[5]

Angel X. Chang, Thomas Funkhouser, Leonidas Guibas, Pat Hanrahan, Qixing Huang, Zimo Li, Silvio Savarese, Manolis Savva, Shuran Song, Hao Su, Jianxiong Xiao, Li Yi, and Fisher Yu. 2015. ShapeNet: An Information-Rich 3D Model Repository. (2015). arXiv:1512.03012.

[6]

Ding-Yun Chen, Xiao-Pei Tian, Yu-Te Shen, and Ming Ouhyoung. 2003. On Visual Similarity Based 3D Model Retrieval. In Computer Graphics Forum (Eurographics).

[7]

Xiaobai Chen, Aleksey Golovinskiy and Thomas Funkhouser. 2009. A Benchmark for 3D Mesh Segmentation. In ACM SIGGRAPH (SIGGRAPH). Article 73, 73:1--73:12 pages.

[8]

Xinlei Chen and Abhinav Gupta. 2015. Webly Supervised Learning of Convolutional Networks. In Proc. ICCV.

Digital Library

[9]

Matthew Fisher, Manolis Savva, and Pat Hanrahan. 2011. Characterizing structural relationships in scenes using graph kernels. In ACM TOG, Vol. 30. 34.

Digital Library

[10]

Xavier Glorot and Yoshua Bengio. 2010. Understanding the difficulty of training deep feedforward neural networks. In AISTATS.

[11]

Aleksey Golovinskiy and Thomas Funkhouser. 2009. Consistent Segmentation of 3D Models. Proc. SMI 33, 3 (2009), 262--269.

[12]

Kan Guo, Dongqing Zou, and Xiaowu Chen. 2015. 3D Mesh Labeling via Deep Convolutional Neural Networks. ACM TOG 35, 1, Article 3 (2015), 12 pages.

[13]

Ruizhen Hu, Lubin Fan, and Ligang Liu. 2012. Co-segmentation of 3D shapes via subspace clustering. SGP 31, 5 (2012), 1703--1713.

Digital Library

[14]

Qixing Huang, Vladlen Koltun, and Leonidas Guibas. 2011. Joint shape segmentation with linear programming. In ACM SIGGRAPH Asia. 125:1--125:12.

Digital Library

[15]

Qixing Huang, Fan Wang, and Leonidas Guibas. 2014. Functional Map Networks for Analyzing and Exploring Large Shape Collections. SIGGRAPH 33, 4 (2014).

Digital Library

[16]

Hamid Izadinia, Bryan C. Russell, Ali Farhadi, Matthew D. Hoffman, and Aaron Hertzmann. 2015. Deep Classifiers from Image Tags in the Wild. In Proc. Multimedia COMMONS.

Digital Library

[17]

Andrew E. Johnson and Martial Hebert. 1999. Using Spin Images for Efficient Object Recognition in Cluttered 3D Scenes. IEEE T-PAMI 21, 5 (1999), 433--449.

Digital Library

[18]

Evangelos Kalogerakis, Aaron Hertzmann, and Karan Singh. 2010. Learning 3D mesh segmentation and labeling. ACM Transactions on Graphics (TOG) 29, 4 (2010), 102.

Digital Library

[19]

Vladimir G Kim, Wilmot Li, Niloy J Mitra, Siddhartha Chaudhuri, Stephen DiVerdi, and Thomas Funkhouser. 2013. Learning part-based templates from large collections of 3D shapes. ACM Transactions on Graphics (TOG) 32, 4 (2013), 70.

Digital Library

[20]

Diederik P. Kingma and Jimmy Lei Ba. 2015. Adam: A Method for Stochastic Optimization. In Proc. ICLR.

[21]

Xirong Li, Tiberio Uricchio, Lamberto Ballan, Marco Bertini, Cees G. M. Snoek, and Alberto Del Bimbo. 2016. Socializing the Semantic Gap: A Comparative Survey on Image Tag Assignment, Refinement, and Retrieval. ACM Comput. Surv. 49, 1 (2016).

Digital Library

[22]

Tianqiang Liu, Siddhartha Chaudhuri, Vladimir G. Kim, Qi-Xing Huang, Niloy J. Mitra, and Thomas Funkhouser. 2014. Creating Consistent Scene Graphs Using a Probabilistic Grammar. SIGGRAPH Asia 33, 6 (2014).

Digital Library

[23]

Niloy J Mitra, Michael Wand, Hao Zhang, Daniel Cohen-Or, and Martin Bokeloh. 2013. Structure-aware shape processing. In Eurographics STARs. 175--197.

[24]

Radford M Neal and Geoffrey E Hinton. 1998. A view of the EM algorithm that justifies incremental, sparse, and other variants. In Learning in graphical models. Springer, 355--368.

[25]

Vicente Ordonez, Girish Kulkarni, and Tamara L. Berg. 2011. Im2text: Describing images using 1 million captioned photographs. In Proc. NIPS.

[26]

Robert Osada, Thomas Funkhouser, Bernard Chazelle, and David Dobkin. 2002. Shape Distributions. ACM Transactions on Graphics (2002).

[27]

Oana Sidi, Oliver van Kaick, Yanir Kleiman, Hao Zhang, and Daniel Cohen-Or. 2011. Unsupervised Co-Segmentation of a Set of Shapes via Descriptor-Space Spectral Clustering. ACM SIGGRAPH Asia 30, 6 (2011), 126:1--126:9.

[28]

Jerry Talton, Lingfeng Yang, Ranjitha Kumar, Maxine Lim, Noah Goodman, and Radomír Měch. 2012. Learning design patterns with bayesian grammar induction. In UIST.

Digital Library

[29]

Oliver van Kaick, Kai Xu, Hao Zhang, Yanzhen Wang, Shuyang Sun, Ariel Shamir, and Daniel Cohen-Or. 2013. Co-hierarchical analysis of shape structures. ACM Transactions on Graphics (TOG) 32, 4 (2013), 69.

Digital Library

[30]

Yunhai Wang, Shmulik Asafi, Oliver van Kaick, Hao Zhang, Daniel Cohen-Or, and Baoquan Chenand. 2012. Active Co-Analysis of a Set of Shapes. SIGGRAPH Asia (2012).

[31]

Yanzhen Wang, Kai Xu, Jun Li, Hao Zhang, Ariel Shamir, Ligang Liu, Zhiquan Cheng, and Yueshan Xiong. 2011. Symmetry Hierarchy of Man-Made Objects. Eurographics 30, 2 (2011).

[32]

Kai Xu, Vladimir G. Kim, Qixing Huang, Niloy J. Mitra, and Evangelos Kalogerakis. 2016. Data-Driven Shape Analysis and Processing. SIGGRAPH Asia Course (2016).

[33]

Li Yi, Vladimir G Kim, Duygu Ceylan, I Shen, Mengyan Yan, Hao Su, Cewu Lu, Qixing Huang, Alla Sheffer, and Leonidas Guibas. 2016. A scalable active framework for region annotation in 3D shape collections. TOG 35, 6 (2016), 210.

Digital Library

[34]

Mehmet Ersin Yumer, Won Chun, and Ameesh Makadia. 2014. Co-segmentation of textured 3D shapes with sparse annotations. In 2014 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 240--247.

Digital Library

[35]

Qingnan Zhou and Alec Jacobson. 2016. Thingi10K: A Dataset of 10,000 3D-Printing Models. (2016). arxiv:1605.04797.

Cited By

Chang RMa YHao TWang WNie W(2024)3D shape knowledge graph for cross‐domain 3D shape retrievalCAAI Transactions on Intelligence Technology10.1049/cit2.12326Online publication date: 2-Apr-2024
https://doi.org/10.1049/cit2.12326
Meltzer PLambourne JGrandi D(2023)What’s in a Name? Evaluating Assembly-Part Semantic Knowledge in Language Models Through User-Provided Names in Computer Aided Design FilesJournal of Computing and Information Science in Engineering10.1115/1.406245424:1Online publication date: 23-Jun-2023
https://doi.org/10.1115/1.4062454
Kim JMo KSung MWoo W(2023)Seg&Struct: The Interplay Between Part Segmentation and Structure Inference for 3D Shape Parsing2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV56688.2023.00128(1226-1235)Online publication date: Jan-2023
https://doi.org/10.1109/WACV56688.2023.00128
Show More Cited By

Index Terms

Learning hierarchical shape segmentation and labeling from online repositories
1. Computing methodologies
  1. Computer graphics
    1. Shape modeling
      1. Shape analysis
  2. Machine learning
    1. Machine learning approaches

Recommendations

Automatic online labeling images via co-active-learning
ICIMCS '09: Proceedings of the First International Conference on Internet Multimedia Computing and Service

The well-built dataset is a pre-requisite for computer vision research. However, the process of collecting and labeling the images is laborious and monotonous. In this paper, we aim to automatic labeling and collecting the images for the visual object ...
A Survey of Semi-Supervised Learning Methods
CIS '08: Proceedings of the 2008 International Conference on Computational Intelligence and Security - Volume 02

In traditional machine learning approaches to classification, one uses only a labelled set to train the classifier. Labelled instances however are often difficult, expensive, or time consuming to obtain, as they require the efforts of experienced human ...
A fuzzy set approach for shape-based image annotation
WILF'11: Proceedings of the 9th international conference on Fuzzy logic and applications

In this paper, we present a shape labeling approach for automatic image annotation. A fuzzy clustering process is applied to shapes represented by Fourier descriptors in order to derive a set of shape prototypes. Then, prototypes are manually annotated ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Graphics

ACM Transactions on Graphics Volume 36, Issue 4

August 2017

2155 pages

ISSN:0730-0301

EISSN:1557-7368

DOI:10.1145/3072959

Issue’s Table of Contents

Copyright © 2017 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 July 2017

Published in TOG Volume 36, Issue 4

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

44
Total Citations
View Citations
1,122
Total Downloads

Downloads (Last 12 months)446
Downloads (Last 6 weeks)26

Reflects downloads up to 10 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Chang RMa YHao TWang WNie W(2024)3D shape knowledge graph for cross‐domain 3D shape retrievalCAAI Transactions on Intelligence Technology10.1049/cit2.12326Online publication date: 2-Apr-2024
https://doi.org/10.1049/cit2.12326
Meltzer PLambourne JGrandi D(2023)What’s in a Name? Evaluating Assembly-Part Semantic Knowledge in Language Models Through User-Provided Names in Computer Aided Design FilesJournal of Computing and Information Science in Engineering10.1115/1.406245424:1Online publication date: 23-Jun-2023
https://doi.org/10.1115/1.4062454
Kim JMo KSung MWoo W(2023)Seg&Struct: The Interplay Between Part Segmentation and Structure Inference for 3D Shape Parsing2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV56688.2023.00128(1226-1235)Online publication date: Jan-2023
https://doi.org/10.1109/WACV56688.2023.00128
Yu FQian YGil-Ureta FJackson BBennett EZhang H(2023)HAL3D: Hierarchical Active Learning for Fine-Grained 3D Part Labeling2023 IEEE/CVF International Conference on Computer Vision (ICCV)10.1109/ICCV51070.2023.00086(865-875)Online publication date: 1-Oct-2023
https://doi.org/10.1109/ICCV51070.2023.00086
Sun CYang YGuo HWang PTong XLiu YShum H(2023)Semi-supervised 3D shape segmentation with multilevel consistency and part substitutionComputational Visual Media10.1007/s41095-022-0281-99:2(229-247)Online publication date: 3-Jan-2023
https://doi.org/10.1007/s41095-022-0281-9
Yang JMo KLai YGuibas LGao L(2022)DSG-Net: Learning Disentangled Structure and Geometry for 3D Shape GenerationACM Transactions on Graphics10.1145/352621242:1(1-17)Online publication date: 12-Aug-2022
https://dl.acm.org/doi/10.1145/3526212
Wollstadt PBujny MRamnath SShah JDetwiler DMenzel S(2022)CarHoods10k: An Industry-Grade Data Set for Representation Learning and Design Optimization in Engineering ApplicationsIEEE Transactions on Evolutionary Computation10.1109/TEVC.2022.314701326:6(1221-1235)Online publication date: Dec-2022
https://doi.org/10.1109/TEVC.2022.3147013
George DXie XLai YTam G(2022)A Deep Learning Driven Active Framework for Segmentation of Large 3D Shape CollectionsComputer-Aided Design10.1016/j.cad.2021.103179144:COnline publication date: 1-Mar-2022
https://dl.acm.org/doi/10.1016/j.cad.2021.103179
Abouqora YMoumoun L(2022)A Hybrid Deep Learning Network CNN-SVM for 3D Mesh SegmentationAdvanced Intelligent Systems for Sustainable Development (AI2SD’2020)10.1007/978-3-030-90639-9_93(1146-1155)Online publication date: 10-Feb-2022
https://doi.org/10.1007/978-3-030-90639-9_93
Hong YYi LTenenbaum JTorralba AGan CRanzato MBeygelzimer ADauphin YLiang PVaughan J(2021)PTRProceedings of the 35th International Conference on Neural Information Processing Systems10.5555/3540261.3541594(17427-17440)Online publication date: 6-Dec-2021
https://dl.acm.org/doi/10.5555/3540261.3541594
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Media

Figures

Other

Tables

View Issue’s Table of Contents