Abstract
We present a novel approach to the three-dimensional human body model acquisition from three mutually orthogonal views. Our technique is based on the spatiotemporal analysis of the deforming apparent contour of a human moving according to a protocol of movements. For generality and robustness our technique does not use a prior model of the human body and a prior body part segmentation is not assumed. Therefore, our technique applies to humans of any anthropometric dimension. To parameterize and segment over time a deforming apparent contour, we introduce a new shape representation technique based on primitive composition. The composed deformable model allows us to represent large local deformations and their evolution in a compact and intuitive way. In addition, this representation allows us to hypothesize an underlying part structure and test this hypothesis against the relative motion (due to forces exerted from the image data) of the defining primitives of the composed model. Furthermore, we develop a Human Body Part Decomposition Algorithm (HBPDA) that recovers all the body parts of a subject by monitoring the changes over time to the shape of the deforming silhouette. In addition, we modularize the process of simultaneous two-dimensional part determination and shape estimation by employing the Supervisory Control Theory of Discrete Event Systems. Finally, we present a novel algorithm which selectively integrates the (segmented by the HBPDA) apparent contours from three mutually orthogonal viewpoints to obtain a three-dimensional model of the subject's body parts. The effectiveness of the approach is demonstrated through a series of experiments where a subject performs a set of movements according to a protocol that reveals the structure of the human body.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Akita, K. 1984. Image sequence analysis of realworld human motion. Pattern Recognition, 17:73-83.
Azuola, F., Badler, N.I., Ho, P, Kakadiaris, I.A., Metaxas, D., and Ting, B. 1994. Building anthropometry-based virtual human models. In Proceedings of the IMAGE VII Society Conference, Tucson, AZ.
Barr, A.H. 1981. Superquadrics and angle-preserving transformations. IEEE Computer Graphics and Applications, 1(1):11-23.
Barr, A.H. 1984. Global and local deformations of solid primitives. Computer Graphics, 18(3):21-30.
Besl, P. and McKay, N.D. 1992. A method for registration of 3-D shapes. IEEE Transactions on Pattern Analysis and Machine Intelligence, 14(2).
Canny, J. 1986. A computational approach to edge detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 8(6):679-698.
Gavrila, D.M. and Davis, L.S. 1996. 3-D model-based tracking of humans in action: A multi-view approach. In Proceedings of the 1996 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, pp. 73-80, IEEE Computer Society Press: New York, NY.
Goncalves, L., Di Bernardom, E., Ursella, E., and Perona, P. 1995. Monocular tracking of the human arm in 3D. In Proceedings of the Fifth International Conference on Computer Vision, Boston, MA, pp. 764-770.
Hogg, D. 1983. Model-based vision: A program to see a walking person. Image and Vision Computing, 1(1):5-20.
Kakadiaris, I.A. 1997. Motion-based part segmentation, shape and motion estimation of multi-part objects. Ph.D. Dissertation, Department of Computer and Information Science, University of Pennsylvania, Philadelphia, PA.
Kakadiaris, I.A. and Metaxas, D. 1995. 3D Human body model acquisition from multiple views. In Proceedings of the Fifth International Conference on Computer Vision, Boston, MA, pp. 618-623.
Kakadiaris, I.A. and Metaxas, D. 1996. Model-based estimation of 3D human motion with occlusion based on active multi-viewpoint selection. In Proceedings of the 1996 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, pp. 81-87.
Kakadiaris, I.A., Metaxas, D., and Bajcsy, R. 1994. Active part-decomposition, shape and motion estimation of articulated objects: A physics-based approach. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Seattle, WA. pp. 980-984.
Kakadiaris, I.A., Metaxas, D., and Bajcsy, R. 1997. Inferring 2D object structure from the deformation of apparent contours. Computer Vision and Image Understanding, 65(2):129-147.
Leung, M.K. and Yang, Y.H. 1987a. Human body motion segmentation in a complex scene. Pattern Recognition, 20(1):55-64.
Leung, M.K. and Yang, Y.H. 1987b. A region based approach for human body motion analysis. Pattern Recognition, 20(3):321- 339.
Leung, M.K. and Yang, Y.H. 1995. First sight: A human body outline labeling system. IEEE Transactions on Pattern Analysis and Machine Intelligence, 17(4):369-377.
Mann, R., Jepson, A., and Siskind, J.M. 1996. Richard Mann, Allan Jepson, and Jeffrey Mark Siskind. Computational perception of scene dynamics. In Proc. of the Fourth European Conference on Computer Vision, Cambridge, UK, Bernard Buxton and Robert Cipola, (Eds.), Lecture Notes in Computer Science, Springer, pp. II:528-539.
Metaxas, D. 1992. Physics-based modeling of nonrigid objects for vision and graphics. Ph.D. Dissertation, Department of Computer Science, University of Toronto.
Metaxas, D. and Terzopoulos, D. 1993. Shape and nonrigid motion estimation through physics-based synthesis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 15(6):580-591.
NASA. 1978. Anthropometric source book. volume II: A handbook of anthropometric data. Technical Report NASA Reference Publication 1024, NASA Scientific and Technical Information Office, Johnson Space Center, Houston, TX.
O’Rourke, J. and Badler, N.I. 1980. Model-based image analysis of human motion using constraint propagation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2(6):522-536.
Pito, R.A. 1996. Mesh integration based on comeasurement. In IEEE International Conference on Image Processing, Vienna, Austria, pp. II:397-400.
Prasad, M. 1991. Intersection of line segments. In Graphics Gems II, James Arvo (Ed.), Academic Press.
Press, W.H., Teukolsky, S.A., Vetterling, W.T., and Flannery, B.P. 1992. Numerical Recipes in C: The Art of Scientific Computing. Cambridge University Press.
Ramadge, P.J. and Wonham, W.M. 1989. The control of discrete event systems. Proceedings of the IEEE, 77(1):81-97.
Rehg, J.M. and Kanade, T. 1994. Visual tracking of high DOF articulated structures: An application to human hand tracking. In Proceedings of the Third European Conference on Computer Vision, Jan-Olof Eklundh (Ed.), Stockholm, Sweden, pp. 35-46.
Rehg, J.M. and Kanade, T. 1995. Model-based tracking of selfoccluding articulated objects. In Proceedings of the Fifth International Conference on Computer Vision, Boston, MA, pp. 612-617.
Rohr, K. 1994. Towards model-based recognition of human movements in image sequences. Computer Vision, Graphics, and Image Processing: Image Understanding, 59(1):94-115.
Russell, K., Starner, T., and Pentland, A. 1995. Unencumbered virtual environments. In IJCAI-95 Workshop on Entertainment and AI/Alife, IEEE Computer Society Press.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Kakadiaris, I.A., Metaxas, D. Three-Dimensional Human Body Model Acquisition from Multiple Views. International Journal of Computer Vision 30, 191–218 (1998). https://doi.org/10.1023/A:1008071332753
Issue Date:
DOI: https://doi.org/10.1023/A:1008071332753