More Web Proxy on the site http://driver.im/

Article

Inferring 3D body pose from silhouettes using activity manifold learning

Authors:

Ahmed Elgammal,

Chan-Su LeeAuthors Info & Claims

CVPR'04: Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition

Pages 681 - 688

Published: 27 June 2004 Publication History

Abstract

We aim to infer 3D body pose directly from human silhouettes. Given a visual input (silhouette), the objective is to recover the intrinsic body configuration, recover the view point, reconstruct the input and detect any spatial or temporal outliers. In order to recover intrinsic body configuration (pose) from the visual input (silhouette), we explicitly learn view-based representations of activity manifolds as well as learn mapping functions between such central representations and both the visual input space and the 3D body pose space. The body pose can be recovered in a closed form in two steps by projecting the visual input to the learned representations of the activity manifold, i.e., finding the point on the learned manifold representation corresponding to the visual input, followed by interpolating 3D pose.

References

[1]

D. Beymer and T. Poggio. Image representations for visual learning. Science, 272(5250), 1996.

[2]

M. Brand. Shadow puppetry. In International Conference on Computer Vision, volume 2, page 1237, 1999.

[3]

M. Brand and K. Huang. A unifying theorem for spectral embedding and clustering. In Proc. of the Ninth International Workshop on AI and Statistics, 2003.

[4]

C. Bregler and S. M. Omohundro. Nonlinear manifold learning for visual speech recognition. In ICCV, 1995.

[5]

L. W. Campbell and A. F. Bobick. Recognition of human body motion using phase space constraints. In ICCV, pages 624-630, 1995.

[6]

T. Darrell and A. Pentland. Space-time gesture. In Proc IEEE CVPR, 1993.

[7]

A. Elgammal and C.-S. Lee. Separating style and content on a nonlinear manifold. In Proc. of IEEE Conference on Computer Vision and Pattern Recognition, June-July 2004.

[8]

D. Gavrila. The visual analysis of human movement: A survey. Computer Vision and Image Understanding, 73(1):82- 98, Jan 1999.

[9]

D. Gavrila and L. Davis. 3-d model-based tracking of humans in action: a multi-view approach. In IEEE Conference on Computer Vision and Pattern Recognition, 1996.

[10]

D. Hogg. Model-based vision: a program to see a walking person. Image and Vision Computing, 1(1):5-20, 1983.

[11]

Howe, Leventon, and W. Freeman. Bayesian reconstruction of 3d human motion from single-camera video. In Proc. NIPS, 1999.

[12]

J.O'Rourke and Badler. Model-based image analysis of human motion using constraint propagation. IEEE PAMI, 2(6), 1980.

[13]

S. X. Ju, M. J. Black, and Y. Yacoob. Cardboard people: A parameterized model of articulated motion. In International Conference on Automatic Face and Gesture Recognition, pages 38-44, Killington, Vermont, 1996.

[14]

I. A. Kakadiaris and D. Metaxas. Model-based estimation of 3D human motion with occlusion based on active multiviewpoint selection. In Proc. IEEE Conf. Computer Vision and Pattern Recognition, CVPR, pages 81-87, Los Alamitos, California, U.S.A., 18-20 1996. IEEE Computer Society.

[15]

T. D. Kristen Grauman, Gregory Shakhnarovich. Inferring 3d structure with a statistical image-based shape model. In ICCV, 2003.

[16]

G. Mori and J. Malik. Estimating human body configurations using shape context matching. In European Conference on Computer Vision, 2002.

[17]

S. Nayar, H. Murase, and S. Nene. Parametric appearance representation. In Early Visual Learning. Oxford University Press, February 1996.

[18]

S. Osher and N. Paragios. Geometric Level Set Methods. Springer, 2003.

[19]

T. Poggio and F. Girosi. Network for approximation and learning. Proceedings of the IEEE, 78(9):1481-1497, 1990.

[20]

J. M. Rehg and T. Kanade. Model-based tracking of self-occluding articulated objects. In ICCV, pages 612-617, 1995.

[21]

K. Rohr. Towards model-based recognition of human movements in image sequence. CVGIP, 59(1):94-115, 1994.

[22]

R. Rosales, V. Athitsos, and S. Sclaroff. 3D hand pose reconstruction using specialized mappings. In Proc. ICCV, 2001.

[23]

R. Rosales and S. Sclaroff. Specialized mappings and the estimation of human body pose from a single image. In Workshop on Human Motion, pages 19-24, 2000.

[24]

S. Roweis and L. Saul. Nonlinear dimensionality reduction by locally linear embedding. Sciene, 290(5500):2323-2326, 2000.

[25]

G. Shakhnarovich, P. Viola, and T. Darrell. Fast pose estimation with parameter-sensitive hashing. In ICCV, 2003.

[26]

H. Sidenbladh, M. J. Black, and D. J. Fleet. Stochastic tracking of 3d human figures using 2d image motion. In ECCV (2), pages 702-718, 2000.

[27]

H. Sidenbladh, M. J. Black, and L. Sigal. Implicit probabilistic models of human motion for synthesis and tracking. In Proc. ECCV 2002, LNCS 2350, pages 784-800, 2002.

[28]

J. Tenenbaum. Mapping a manifold of perceptual observations. In Advances in Neural Information Processing, volume 10, pages 682-688, 1998.

[29]

K. Toyama and A. Blake. Probabilistic tracking in a metric space. In ICCV, pages 50-59, 2001.

[30]

Q. Wang, G. Xu, and H. Ai. Learning object interinsic structure for robust visual tracking. In CVPR, volume 2, page 227, 2003.

[31]

C. R. Wern, A. Azarbayejani, T. Darrell, and A. P. Pentland. Pfinder: Real-time tracking of human body. IEEE Transaction on Pattern Analysis and Machine Intelligence, 1997.

Cited By

Chen DLv JYin JZhang HLi X(2019)Angle-based embedding quality assessment method for manifold learningNeural Computing and Applications10.1007/s00521-017-3113-631:3(839-849)Online publication date: 1-Mar-2019
https://dl.acm.org/doi/10.1007/s00521-017-3113-6
Yu JSun JLiu SLuo S(2018)Multi-activity 3D human motion recognition and tracking in composite motion model with synthesized transition bridgesMultimedia Tools and Applications10.1007/s11042-017-4847-y77:10(12023-12055)Online publication date: 1-May-2018
https://dl.acm.org/doi/10.1007/s11042-017-4847-y
(2017)Scalable out-of-sample extension of graph embeddings using deep neural networksPattern Recognition Letters10.1016/j.patrec.2017.04.01694:C(1-6)Online publication date: 15-Jul-2017
https://dl.acm.org/doi/10.1016/j.patrec.2017.04.016
Show More Cited By

Recommendations

Estimating 3D Body Pose using Uncalibrated Cameras
Inferring Body Pose without Tracking Body Parts
Human pose estimation from corrupted silhouettes using a sub-manifold voting strategy in latent variable space

In this paper, a learning-based framework is proposed for human pose estimation in complicated environments. Human silhouettes extracted from input images are always incomplete and corrupted due to shadows, occlusions, motion blur, or foreground/...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings

CVPR'04: Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition

June 2004

1041 pages

Sponsors

IEEE-CS\DATC: IEEE Computer Society

Publisher

IEEE Computer Society

United States

Publication History

Published: 27 June 2004

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

58
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 11 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Chen DLv JYin JZhang HLi X(2019)Angle-based embedding quality assessment method for manifold learningNeural Computing and Applications10.1007/s00521-017-3113-631:3(839-849)Online publication date: 1-Mar-2019
https://dl.acm.org/doi/10.1007/s00521-017-3113-6
Yu JSun JLiu SLuo S(2018)Multi-activity 3D human motion recognition and tracking in composite motion model with synthesized transition bridgesMultimedia Tools and Applications10.1007/s11042-017-4847-y77:10(12023-12055)Online publication date: 1-May-2018
https://dl.acm.org/doi/10.1007/s11042-017-4847-y
(2017)Scalable out-of-sample extension of graph embeddings using deep neural networksPattern Recognition Letters10.1016/j.patrec.2017.04.01694:C(1-6)Online publication date: 15-Jul-2017
https://dl.acm.org/doi/10.1016/j.patrec.2017.04.016
Jáuregui DHorain P(2017)Real-time 3D motion capture by monocular vision and virtual renderingMachine Vision and Applications10.1007/s00138-017-0861-328:8(839-858)Online publication date: 1-Nov-2017
https://dl.acm.org/doi/10.1007/s00138-017-0861-3
Yu JGuo YTao DWan J(2015)Human pose recovery by supervised spectral embeddingNeurocomputing10.1016/j.neucom.2015.04.005166:C(301-308)Online publication date: 20-Oct-2015
https://dl.acm.org/doi/10.1016/j.neucom.2015.04.005
Zhang PSiu KZhang JLiu CChai J(2014)Leveraging depth cameras and wearable pressure sensors for full-body kinematics and dynamics captureACM Transactions on Graphics10.1145/2661229.266128633:6(1-14)Online publication date: 19-Nov-2014
https://dl.acm.org/doi/10.1145/2661229.2661286
Tian YSigal LDe La Torre FJia Y(2013)Editor's choice articleImage and Vision Computing10.1016/j.imavis.2012.06.00931:3(223-230)Online publication date: 1-Mar-2013
https://dl.acm.org/doi/10.1016/j.imavis.2012.06.009
Zhu MSun HDeng ZBoulic RKomura T(2012)Quaternion space sparse decomposition for motion compression and retrievalProceedings of the ACM SIGGRAPH/Eurographics Symposium on Computer Animation10.5555/2422356.2422383(183-192)Online publication date: 29-Jul-2012
https://dl.acm.org/doi/10.5555/2422356.2422383
Zhu MSun HDeng Z(2012)Quaternion space sparse decomposition for motion compression and retrievalProceedings of the 11th ACM SIGGRAPH / Eurographics conference on Computer Animation10.5555/2421731.2421758(183-192)Online publication date: 29-Jul-2012
https://dl.acm.org/doi/10.5555/2421731.2421758
Ren RCollomosse JJose JChen XLebanon GWang HZaki M(2012)Topic based pose relevance learning in dance archivesProceedings of the 21st ACM international conference on Information and knowledge management10.1145/2396761.2398694(2571-2574)Online publication date: 29-Oct-2012
https://dl.acm.org/doi/10.1145/2396761.2398694
Show More Cited By

View Options

View options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Table of Contents