Segmentation over Detection by Coupled Global and Local Sparse Representations

Wei Xia²¹,
Zheng Song²¹,
Jiashi Feng²¹,
Loong-Fah Cheong²¹ &
…
Shuicheng Yan²¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7576))

Included in the following conference series:

European Conference on Computer Vision

10k Accesses
5 Citations

Abstract

Motivated by the rising performances of object detection algorithms, we investigate how to further precisely segment out objects within the output bounding boxes. The task is formulated as a unified optimization problem, pursuing a unique latent object mask in non-parametric manner. For a given test image, the objects are first detected by detectors. Then for each detected bounding box, the objects of the same category along with their object masks are extracted from the training set. The latent mask of the object within the bounding box is inferred based on three objectives: 1) the latent mask should be coherent, subject to sparse errors caused by within-category diversities, with the global bounding-box-level mask inferred by sparse representation over the bounding boxes of the same category within the training set; 2) the latent mask should be coherent with local patch-level mask inferred by sparse representation of the individual patch over all spatially nearby (handling local deformations) patches of the same category in the training set; and 3) mask property within each sufficiently small super-pixel should be consistent. All these three objectives are integrated into a unified optimization problem, and finally the sparse representation coefficients and the latent mask are alternately optimized based on Lasso optimization and smooth approximation followed by Accelerated Proximal Gradient method, respectively. Extensive experiments on the Pascal VOC object segmentation datasets, VOC2007 and VOC2010, show that our proposed algorithm achieves competitive results with the state-of-the-art learning based algorithms, and is superior over other detection based object segmentation algorithms.

Download to read the full chapter text

Chapter PDF

Robust Edge-Based Model with Sparsity Representation for Object Segmentation

TS $$^{2}$$ C: Tight Box Mining with Surrounding Segmentation Context for Weakly Supervised Object Detection

Efficient Perceptual Region Detector Based on Object Boundary

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Gonfaus, J., Bosch, X., Weijer, J., Bagdanov, A., Gual, J.: Harmony potentials for joint classification and segmentation. In: CVPR (2010)
Google Scholar
Li, F., Carreira, J., Sminchisescu, C.: Object recognition as ranking holistic figure-ground hypotheses. In: CVPR (2010)
Google Scholar
Brox, T., Bourdev, L., Maji, S., Malik, J.: Object segmentation by alignment of poselet activations to image contours. In: CVPR (2011)
Google Scholar
Yang, Y., Hallman, S., Ramanan, D., Fowlkes, C.: Layered object detection for multi-class segmentation. In: CVPR (2010)
Google Scholar
Felzenszwalb, P., Girshick, R., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. TPAMI 32 (2010)
Google Scholar
Vedaldi, A., Gulshan, V., Varma, M., Zisserman, A.: Multiple kernels for object detection. In: ICCV (2009)
Google Scholar
Kumar, M., Torr, P., Zisserman, A.: OBJ CUT. In: CVPR (2005)
Google Scholar
Ladicky, L., Sturgess, P., Alahari, K., Russell, C., Torr, P.: What, Where and How Many? Combining Object Detectors and CRFs. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 424–437. Springer, Heidelberg (2010)
Chapter Google Scholar
Ladicky, L., Russell, C., Kohli, P., Torr, P.: Associative hierarchical CRFs for object class image segmentation. In: ICCV (2009)
Google Scholar
Ladicky, L., Russell, C., Kohli, P., Torr, P.: Graph Cut Based Inference with Co-occurrence Statistics. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part V. LNCS, vol. 6315, pp. 239–253. Springer, Heidelberg (2010)
Chapter Google Scholar
Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The PASCAL Visual Object Classes Challenge, VOC 2007 (2007) (results)
Google Scholar
Yuan, X., Yan, S.: Visual classification with multi-task joint sparse representation. In: CVPR (2010)
Google Scholar
Mori, G., Ren, X., Efros, A., Malik, J.: Recovering human body configurations: Combining segmentation and recognition. In: CVPR (2004)
Google Scholar
Martin, D., Fowlkes, C., Malik, J.: Learning to detect natural image boundaries using brightness and texture. In: NIPS (2002)
Google Scholar
Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The PASCAL Visual Object Classes Challenge, VOC 2010 (2010) (results)
Google Scholar
Malisiewic, T., Gupta, A., Efros, A.: Ensemble of exemplar-svms for object detection and beyond. In: ICCV (2011)
Google Scholar
Wright, J., Yang, A., Ganesh, A., Sastry, S., Ma, Y.: Robust face recognition via sparse representation. TPAMI 31 (2009)
Google Scholar
Yang, J., Yu, K., Gong, Y., Huang, T.: Linear spatial pyramid matching using sparse coding for image classification. In: CVPR (2009)
Google Scholar
Liu, X., Feng, J., Yan, S., Jin, H.: Image segmentation with patch-pair density priors. In: ACM Multimedia (2010)
Google Scholar
Rao, S., Tron, R., Vidal, R., Ma, Y.: Motion segmentation via robust subspace separation in the presence of outlying, incomplete, or corrupted trajectories. In: CVPR (2008)
Google Scholar
Elhamifar, E., Vidal, R.: Sparse subspace clustering. In: CVPR (2009)
Google Scholar
Zhu, L., Chen, Y., Yuille, A., Freeman, W.: Latent hierarchical structural learning for object detection. In: CVPR (2010)
Google Scholar
Chen, Y., Zhu, L., Yuille, A.: Active Mask Hierarchies for Object Detection. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part V. LNCS, vol. 6315, pp. 43–56. Springer, Heidelberg (2010)
Chapter Google Scholar
Emmanuel Candes, J.R.: L1-magic: Recovery of sparse signals via convex programming (2005)
Google Scholar
Nesterov, Y.: Smooth minimization of non-smooth functions. Math. Program. (2005)
Google Scholar
Wang, J., Yang, J., Yu, K., Lv, F., Huang, T., Gong, Y.: Locality-constrained linear coding for image classification. In: CVPR (2010)
Google Scholar
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: CVPR (2006)
Google Scholar
Lowe, D.: Distinctive image features from scale-invariant keypoints. IJCV 60 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of ECE, National University of Singapore, Singapore
Wei Xia, Zheng Song, Jiashi Feng, Loong-Fah Cheong & Shuicheng Yan

Authors

Wei Xia
View author publications
You can also search for this author in PubMed Google Scholar
Zheng Song
View author publications
You can also search for this author in PubMed Google Scholar
Jiashi Feng
View author publications
You can also search for this author in PubMed Google Scholar
Loong-Fah Cheong
View author publications
You can also search for this author in PubMed Google Scholar
Shuicheng Yan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Microsoft Research Ltd., CB3 0FB, Cambridge, UK
Andrew Fitzgibbon
Dept. of Computer Science, University of North Carolina, 27599, Chapel Hill, NC, USA
Svetlana Lazebnik
California Institute of Technology, 91125, Pasadena, CA, USA
Pietro Perona
Institute of Industrial Science, The University of Tokyo, 153-8505, Tokyo, Japan
Yoichi Sato
INRIA, 38330, Montbonnot, France
Cordelia Schmid

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xia, W., Song, Z., Feng, J., Cheong, LF., Yan, S. (2012). Segmentation over Detection by Coupled Global and Local Sparse Representations. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds) Computer Vision – ECCV 2012. ECCV 2012. Lecture Notes in Computer Science, vol 7576. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33715-4_48

Download citation

DOI: https://doi.org/10.1007/978-3-642-33715-4_48
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33714-7
Online ISBN: 978-3-642-33715-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Segmentation over Detection by Coupled Global and Local Sparse Representations

Abstract

Chapter PDF

Similar content being viewed by others

Robust Edge-Based Model with Sparsity Representation for Object Segmentation

TS $$^{2}$$ C: Tight Box Mining with Surrounding Segmentation Context for Weakly Supervised Object Detection

Efficient Perceptual Region Detector Based on Object Boundary

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Segmentation over Detection by Coupled Global and Local Sparse Representations

Abstract

Chapter PDF

Similar content being viewed by others

Robust Edge-Based Model with Sparsity Representation for Object Segmentation

TS $$^{2}$$ C: Tight Box Mining with Surrounding Segmentation Context for Weakly Supervised Object Detection

Efficient Perceptual Region Detector Based on Object Boundary

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation