Article

MODEC: Multimodal Decomposable Models for Human Pose Estimation

Authors:

Ben Sapp,

Ben TaskarAuthors Info & Claims

CVPR '13: Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition

Pages 3674 - 3681

https://doi.org/10.1109/CVPR.2013.471

Published: 23 June 2013 Publication History

Abstract

We propose a multimodal, decomposable model for articulated human pose estimation in monocular images. A typical approach to this problem is to use a linear structured model, which struggles to capture the wide range of appearance present in realistic, unconstrained images. In this paper, we instead propose a model of human pose that explicitly captures a variety of pose modes. Unlike other multimodal models, our approach includes both global and local pose cues and uses a convex objective and joint training for mode selection and pose estimation. We also employ a cascaded mode selection step which controls the trade-off between speed and accuracy, yielding a 5x speedup in inference and learning. Our model outperforms state-of-the-art approaches across the accuracy-speed trade-off curve for several pose datasets. This includes our newly-collected dataset of people in movies, FLIC, which contains an order of magnitude more labeled data for training and testing than existing datasets.

Cited By

View all

Huang WGhahremani SPei SZhang Y(2024)WheelPose: Data Synthesis Techniques to Improve Pose Estimation Performance on Wheelchair UsersProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642555(1-25)Online publication date: 11-May-2024
https://dl.acm.org/doi/10.1145/3613904.3642555
Aslanyan M(2024)On Mobile Pose Estimation and Action Recognition Design and ImplementationPattern Recognition and Image Analysis10.1134/S105466182401003634:1(126-136)Online publication date: 1-Mar-2024
https://dl.acm.org/doi/10.1134/S1054661824010036
Yao SSun MLi BYang FWang JZhang REl Saddik AMei TCucchiara RBertini MTobon Vallejo DAtrey PHossain M(2023)Dance with You: The Diversity Controllable Dancer Generation via Diffusion ModelsProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3612046(8504-8514)Online publication date: 26-Oct-2023
https://dl.acm.org/doi/10.1145/3581783.3612046
Show More Cited By

Recommendations

Catadioptric Stereo Using Planar Mirrors

By using mirror reflections of a scene, stereo images can be captured with a single camera (catadioptric stereo). In addition to simplifying data acquisition single camera stereo provides both geometric and radiometric advantages over traditional two ...
Omnistereo: Panoramic Stereo Imaging

An Omnistereo panorama consists of a pair of panoramic images, where one panorama is for the left eye and another panorama is for the right eye. The panoramic stereo pair provides a stereo sensation up to a full 360 degrees. Omnistereo panoramas cannot ...
Self-calibration of hybrid central catadioptric and perspective cameras

Hybrid central catadioptric and perspective cameras are desired in practice, because the hybrid camera system can capture large field of view as well as high-resolution images. However, the calibration of the system is challenging due to heavy ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

CVPR '13: Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition

June 2013

3752 pages

ISBN:9780769549897

Publisher

IEEE Computer Society

United States

Publication History

Published: 23 June 2013

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

31
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 20 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Huang WGhahremani SPei SZhang Y(2024)WheelPose: Data Synthesis Techniques to Improve Pose Estimation Performance on Wheelchair UsersProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642555(1-25)Online publication date: 11-May-2024
https://dl.acm.org/doi/10.1145/3613904.3642555
Aslanyan M(2024)On Mobile Pose Estimation and Action Recognition Design and ImplementationPattern Recognition and Image Analysis10.1134/S105466182401003634:1(126-136)Online publication date: 1-Mar-2024
https://dl.acm.org/doi/10.1134/S1054661824010036
Yao SSun MLi BYang FWang JZhang REl Saddik AMei TCucchiara RBertini MTobon Vallejo DAtrey PHossain M(2023)Dance with You: The Diversity Controllable Dancer Generation via Diffusion ModelsProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3612046(8504-8514)Online publication date: 26-Oct-2023
https://dl.acm.org/doi/10.1145/3581783.3612046
Hsu FWang TChen L(2023)Robust vision-based glove pose estimation for both hands in virtual realityVirtual Reality10.1007/s10055-023-00860-627:4(3133-3148)Online publication date: 1-Dec-2023
https://dl.acm.org/doi/10.1007/s10055-023-00860-6
Topham LKhan WAl-Jumeily DHussain A(2022)Human Body Pose Estimation for Gait Identification: A Comprehensive Survey of Datasets and ModelsACM Computing Surveys10.1145/353338455:6(1-42)Online publication date: 7-Dec-2022
https://dl.acm.org/doi/10.1145/3533384
Yin YRobinson JFu YMagalhães Jdel Bimbo ASatoh SSebe NAlameda-Pineda XJin QOria VToni L(2022)Multimodal In-bed Pose and Shape Estimation under the BlanketsProceedings of the 30th ACM International Conference on Multimedia10.1145/3503161.3548063(2411-2419)Online publication date: 10-Oct-2022
https://dl.acm.org/doi/10.1145/3503161.3548063
Haq FShin DBriand LStifter TWang JCadar CZhang X(2021)Automatic test suite generation for key-points detection DNNs using many-objective search (experience paper)Proceedings of the 30th ACM SIGSOFT International Symposium on Software Testing and Analysis10.1145/3460319.3464802(91-102)Online publication date: 11-Jul-2021
https://dl.acm.org/doi/10.1145/3460319.3464802
Li JZhao JLang CLi YWei YGuo GSim TYan SFeng J(2021)Multi-human Parsing with a Graph-based Generative Adversarial ModelACM Transactions on Multimedia Computing, Communications, and Applications10.1145/341821717:1(1-21)Online publication date: 16-Apr-2021
https://dl.acm.org/doi/10.1145/3418217
Ben Gamra MAkhloufi M(2021)A review of deep learning techniques for 2D and 3D human pose estimationImage and Vision Computing10.1016/j.imavis.2021.104282114:COnline publication date: 1-Oct-2021
https://dl.acm.org/doi/10.1016/j.imavis.2021.104282
Feng SHu H(2020)Learning Joint Structure for Human Pose EstimationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/339230216:3(1-17)Online publication date: 5-Jul-2020
https://dl.acm.org/doi/10.1145/3392302
Show More Cited By

Abstract

Cited By

Recommendations

Catadioptric Stereo Using Planar Mirrors

Omnistereo: Panoramic Stereo Imaging

Self-calibration of hybrid central catadioptric and perspective cameras

Comments

Information

Published In

Publisher

Publication History

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations