Abstract
Immersive virtual environments with life-like interaction capabilities have very demanding requirements including high-precision motion capture and high-processing speed. These issues raise many challenges for computer vision-based motion estimation algorithms. In this study, we consider the problem of hand tracking using multiple cameras and estimating its 3D global pose (i.e., position and orientation of the palm). Our interest is in developing an accurate and robust algorithm to be employed in an immersive virtual training environment, called “Virtual GloveboX” (VGX) (Twombly et al. in J Syst Cybern Inf 2:30–34, 2005), which is currently under development at NASA Ames. In this context, we present a marker-based, hand tracking and 3D global pose estimation algorithm that operates in a controlled, multi-camera, environment built to track the user’s hand inside VGX. The key idea of the proposed algorithm is tracking the 3D position and orientation of an elliptical marker placed on the dorsal part of the hand using model-based tracking approaches and active camera selection. It should be noted that, the use of markers is well justified in the context of our application since VGX naturally allows for the use of gloves without disrupting the fidelity of the interaction. Our experimental results and comparisons illustrate that the proposed approach is more accurate and robust than related approaches. A byproduct of our multi-camera ellipse tracking algorithm is that, with only minor modifications, the same algorithm can be used to automatically re-calibrate (i.e., fine-tune) the extrinsic parameters of a multi-camera system leading to more accurate pose estimates.
Similar content being viewed by others
References
Twombly A., Smith J., Montgomery K., Boyle R.D.: The virtual glovebox (vgx): a semi-immersive virtual environment for training astronauts in life science experiments. J. Syst. Cybern. Info. 2, 30–34 (2005)
Pavlovic V.I., Sharma R., Huang T.S.: Visual interpretation of hand gestures for human-computer interaction: A review. IEEE Trans. Pattern Anal. Mach. Intell. 19(7), 677–695 (1997)
Erol A., Bebis G., Nicolescu M., Boyle R.D., Twombly A.: Vision-based hand pose estimation: a review. Comp. Vis. Image Understanding 108(1-2), 52–73 (2007)
Chua, C.S., Guan, H.Y., Ho, Y.K.: Model-based finger posture estimation. IEEE Asian Comp. Vis. Conf. pp 43–48 (2000)
Chua C.S., Guan H., Ho Y.K.: Model-based 3d hand posture estimation from a single 2d image. Image Vis. Comput. 20(3), 191–202 (2002)
Holden, E.: Visual Recognition of Hand Motion. PhD Thesis, Department of Computer Science, University of Western Australia (1997)
Lien C.C., Huang C.L.: Model based articulated hand motion tracking for gesture recognition. Image Vis. Comput. 16, 121–134 (1998)
Lee, J., Kunii, T.: Constraint-based hand animation. In: Models and Techniques in Computer Animation pp. 110–127. Springer, Tokyo, 1993
Kim, H., Fellner, D.W.: Interaction with hand gesture for a back-projection wall. Comp. Graph. Int. pp. 395–402, 2004.
Lien C.C.: A scalable model-based hand posture analysis system. Mach. Vis. Appl. 16(3), 157–169 (2005)
Lowe D.G.: Fitting parameterized three-dimensional models to images. IEEE Trans. Pattern Anal. Mach. Intell. 13(5), 441–450 (1991)
Ma S.D.: Conics-based stereo, motion estimation and pose determination. Int. J. Comp. Vis. 10(1), 7–25 (1993)
Quan L.: Conic reconstruction and correspondence from two views. IEEE Trans. Pattern Anal. Mach. Intell. 18(2), 151–160 (1996)
Maggioni, C., Kammerer, B.: Gesturecomputer - history, design and applications. In: Computer Vision for Human-Machine Interaction, pp. 312–325. Cambridge University, Cambridge (1998)
Bouguet, J.-Y.: Camera calibration toolbox for matlab.
Svoboda T., Martinec D., Pajdla T.: A convenient multi-camera self-calibration for virtual environments. PRESENCE: Teleoperators Virt. Environ. 14(4), 407–422 (2005)
O’Rourke J., Badler N.I.: Model-based image analysis of human motion using constraint propagation. IEEE Trans. Pattern Anal. Mach. Intell. 2(6), 522–536 (1980)
Gavrila, D.M., Davis, L.S.: 3D Model-based tracking of humans in action: a multi-view approach. Comp. Vis. Pattern Recogn. Conf. pp. 73–80 (1996)
Martin, F., Horaud, R.: Multiple-camera tracking of rigid objects. Research Report 4268 INRIA Montbonnot, France, September (2001)
Canny J.: A computational approach to edge detection. IEEE Trans. Pattern Anal. Mach. Intell. 8(6), 679–698 (1986)
Fitzgibbon A.W., Pilu M., Fisher R.B.: Direct least square fitting of ellipses. IEEE Trans. Pattern Analy. Mach. Intell. 21(5), 476–480 (1999)
Gnu scientific library. http://www.gnu.org/software/gsl/
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Usabiaga, J., Erol, A., Bebis, G. et al. Global hand pose estimation by multiple camera ellipse tracking. Machine Vision and Applications 21, 1 (2009). https://doi.org/10.1007/s00138-008-0137-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00138-008-0137-z