[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/1896300.1896398guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Inferring 3D body pose from silhouettes using activity manifold learning

Published: 27 June 2004 Publication History

Abstract

We aim to infer 3D body pose directly from human silhouettes. Given a visual input (silhouette), the objective is to recover the intrinsic body configuration, recover the view point, reconstruct the input and detect any spatial or temporal outliers. In order to recover intrinsic body configuration (pose) from the visual input (silhouette), we explicitly learn view-based representations of activity manifolds as well as learn mapping functions between such central representations and both the visual input space and the 3D body pose space. The body pose can be recovered in a closed form in two steps by projecting the visual input to the learned representations of the activity manifold, i.e., finding the point on the learned manifold representation corresponding to the visual input, followed by interpolating 3D pose.

References

[1]
D. Beymer and T. Poggio. Image representations for visual learning. Science, 272(5250), 1996.
[2]
M. Brand. Shadow puppetry. In International Conference on Computer Vision, volume 2, page 1237, 1999.
[3]
M. Brand and K. Huang. A unifying theorem for spectral embedding and clustering. In Proc. of the Ninth International Workshop on AI and Statistics, 2003.
[4]
C. Bregler and S. M. Omohundro. Nonlinear manifold learning for visual speech recognition. In ICCV, 1995.
[5]
L. W. Campbell and A. F. Bobick. Recognition of human body motion using phase space constraints. In ICCV, pages 624-630, 1995.
[6]
T. Darrell and A. Pentland. Space-time gesture. In Proc IEEE CVPR, 1993.
[7]
A. Elgammal and C.-S. Lee. Separating style and content on a nonlinear manifold. In Proc. of IEEE Conference on Computer Vision and Pattern Recognition, June-July 2004.
[8]
D. Gavrila. The visual analysis of human movement: A survey. Computer Vision and Image Understanding, 73(1):82- 98, Jan 1999.
[9]
D. Gavrila and L. Davis. 3-d model-based tracking of humans in action: a multi-view approach. In IEEE Conference on Computer Vision and Pattern Recognition, 1996.
[10]
D. Hogg. Model-based vision: a program to see a walking person. Image and Vision Computing, 1(1):5-20, 1983.
[11]
Howe, Leventon, and W. Freeman. Bayesian reconstruction of 3d human motion from single-camera video. In Proc. NIPS, 1999.
[12]
J.O'Rourke and Badler. Model-based image analysis of human motion using constraint propagation. IEEE PAMI, 2(6), 1980.
[13]
S. X. Ju, M. J. Black, and Y. Yacoob. Cardboard people: A parameterized model of articulated motion. In International Conference on Automatic Face and Gesture Recognition, pages 38-44, Killington, Vermont, 1996.
[14]
I. A. Kakadiaris and D. Metaxas. Model-based estimation of 3D human motion with occlusion based on active multiviewpoint selection. In Proc. IEEE Conf. Computer Vision and Pattern Recognition, CVPR, pages 81-87, Los Alamitos, California, U.S.A., 18-20 1996. IEEE Computer Society.
[15]
T. D. Kristen Grauman, Gregory Shakhnarovich. Inferring 3d structure with a statistical image-based shape model. In ICCV, 2003.
[16]
G. Mori and J. Malik. Estimating human body configurations using shape context matching. In European Conference on Computer Vision, 2002.
[17]
S. Nayar, H. Murase, and S. Nene. Parametric appearance representation. In Early Visual Learning. Oxford University Press, February 1996.
[18]
S. Osher and N. Paragios. Geometric Level Set Methods. Springer, 2003.
[19]
T. Poggio and F. Girosi. Network for approximation and learning. Proceedings of the IEEE, 78(9):1481-1497, 1990.
[20]
J. M. Rehg and T. Kanade. Model-based tracking of self-occluding articulated objects. In ICCV, pages 612-617, 1995.
[21]
K. Rohr. Towards model-based recognition of human movements in image sequence. CVGIP, 59(1):94-115, 1994.
[22]
R. Rosales, V. Athitsos, and S. Sclaroff. 3D hand pose reconstruction using specialized mappings. In Proc. ICCV, 2001.
[23]
R. Rosales and S. Sclaroff. Specialized mappings and the estimation of human body pose from a single image. In Workshop on Human Motion, pages 19-24, 2000.
[24]
S. Roweis and L. Saul. Nonlinear dimensionality reduction by locally linear embedding. Sciene, 290(5500):2323-2326, 2000.
[25]
G. Shakhnarovich, P. Viola, and T. Darrell. Fast pose estimation with parameter-sensitive hashing. In ICCV, 2003.
[26]
H. Sidenbladh, M. J. Black, and D. J. Fleet. Stochastic tracking of 3d human figures using 2d image motion. In ECCV (2), pages 702-718, 2000.
[27]
H. Sidenbladh, M. J. Black, and L. Sigal. Implicit probabilistic models of human motion for synthesis and tracking. In Proc. ECCV 2002, LNCS 2350, pages 784-800, 2002.
[28]
J. Tenenbaum. Mapping a manifold of perceptual observations. In Advances in Neural Information Processing, volume 10, pages 682-688, 1998.
[29]
K. Toyama and A. Blake. Probabilistic tracking in a metric space. In ICCV, pages 50-59, 2001.
[30]
Q. Wang, G. Xu, and H. Ai. Learning object interinsic structure for robust visual tracking. In CVPR, volume 2, page 227, 2003.
[31]
C. R. Wern, A. Azarbayejani, T. Darrell, and A. P. Pentland. Pfinder: Real-time tracking of human body. IEEE Transaction on Pattern Analysis and Machine Intelligence, 1997.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
CVPR'04: Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
June 2004
1041 pages

Sponsors

  • IEEE-CS\DATC: IEEE Computer Society

Publisher

IEEE Computer Society

United States

Publication History

Published: 27 June 2004

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 11 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2019)Angle-based embedding quality assessment method for manifold learningNeural Computing and Applications10.1007/s00521-017-3113-631:3(839-849)Online publication date: 1-Mar-2019
  • (2018)Multi-activity 3D human motion recognition and tracking in composite motion model with synthesized transition bridgesMultimedia Tools and Applications10.1007/s11042-017-4847-y77:10(12023-12055)Online publication date: 1-May-2018
  • (2017)Scalable out-of-sample extension of graph embeddings using deep neural networksPattern Recognition Letters10.1016/j.patrec.2017.04.01694:C(1-6)Online publication date: 15-Jul-2017
  • (2017)Real-time 3D motion capture by monocular vision and virtual renderingMachine Vision and Applications10.1007/s00138-017-0861-328:8(839-858)Online publication date: 1-Nov-2017
  • (2015)Human pose recovery by supervised spectral embeddingNeurocomputing10.1016/j.neucom.2015.04.005166:C(301-308)Online publication date: 20-Oct-2015
  • (2014)Leveraging depth cameras and wearable pressure sensors for full-body kinematics and dynamics captureACM Transactions on Graphics10.1145/2661229.266128633:6(1-14)Online publication date: 19-Nov-2014
  • (2013)Editor's choice articleImage and Vision Computing10.1016/j.imavis.2012.06.00931:3(223-230)Online publication date: 1-Mar-2013
  • (2012)Quaternion space sparse decomposition for motion compression and retrievalProceedings of the ACM SIGGRAPH/Eurographics Symposium on Computer Animation10.5555/2422356.2422383(183-192)Online publication date: 29-Jul-2012
  • (2012)Quaternion space sparse decomposition for motion compression and retrievalProceedings of the 11th ACM SIGGRAPH / Eurographics conference on Computer Animation10.5555/2421731.2421758(183-192)Online publication date: 29-Jul-2012
  • (2012)Topic based pose relevance learning in dance archivesProceedings of the 21st ACM international conference on Information and knowledge management10.1145/2396761.2398694(2571-2574)Online publication date: 29-Oct-2012
  • Show More Cited By

View Options

View options

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media