[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/794189.794447guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Multi-Modal Tracking of Faces for Video Communications

Published: 17 June 1997 Publication History

Abstract

This paper describes a system which uses multiple visual processes to detect and track faces for video compression and transmission. The system is based on an architecture in which a supervisor selects and activates visual processes in cyclic manner. Control of visual processes is made possible by a confidence factor which accompanies each observation. Fusion of results into a unified estimation for tracking is made possible by estimating a covariance matrix with each observation. Visual processes for face tracking are described using blink detection, normalized color histogram matching, and cross correlation (SSD and NCC). Ensembles of visual processes are organized into processing states so as to provide robust tracking. Transition between states is determined by events detected by processes. The result of face detection is fed into recursive estimator (Kalman filter). The output from the estimator drives a PD controller for a pan/tilt/zoom camera. The resulting system provides robust and precise tracking which operates continuously at approximately 20 images per second on a 150 megahertz computer work-station.

Cited By

View all
  • (2009)A Reliable Skin Detection Using Dempster-Shafer Theory of EvidenceProceedings of the International Conference on Computational Science and Its Applications: Part II10.1007/978-3-642-02457-3_63(764-779)Online publication date: 9-Jul-2009
  • (2008)A skin detection approach based on color distance mapEURASIP Journal on Advances in Signal Processing10.1155/2008/8142832008(1-10)Online publication date: 1-Jan-2008
  • (2007)Ensemble TrackingIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2007.3529:2(261-271)Online publication date: 1-Feb-2007
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
CVPR '97: Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
June 1997
ISBN:0818678224

Publisher

IEEE Computer Society

United States

Publication History

Published: 17 June 1997

Author Tags

  1. Active and Real-Time Vision
  2. Integration and Control of Visual Processes

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 06 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2009)A Reliable Skin Detection Using Dempster-Shafer Theory of EvidenceProceedings of the International Conference on Computational Science and Its Applications: Part II10.1007/978-3-642-02457-3_63(764-779)Online publication date: 9-Jul-2009
  • (2008)A skin detection approach based on color distance mapEURASIP Journal on Advances in Signal Processing10.1155/2008/8142832008(1-10)Online publication date: 1-Jan-2008
  • (2007)Ensemble TrackingIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2007.3529:2(261-271)Online publication date: 1-Feb-2007
  • (2006)Speaker localization for microphone array-based ASRProceedings of the 8th international conference on Multimodal interfaces10.1145/1180995.1181004(35-38)Online publication date: 2-Nov-2006
  • (2006)Social PerceptionQueue10.1145/1147518.11475314:6(34-43)Online publication date: 1-Jul-2006
  • (2005)Facial Expression Analysis in E-Learning Systems " The Problems and FeasibilityProceedings of the Fifth IEEE International Conference on Advanced Learning Technologies10.1109/ICALT.2005.150(442-446)Online publication date: 5-Jul-2005
  • (2005)A robust method for detecting arbitrarily tilted human faces in color imagesPattern Recognition Letters10.1016/j.patrec.2005.05.00826:16(2518-2536)Online publication date: 1-Dec-2005
  • (2004)Generic multimedia multimodal agents paradigms and their dynamic reconfiguration at the architectural levelEURASIP Journal on Advances in Signal Processing10.1155/S11108657044022122004(1688-1707)Online publication date: 1-Jan-2004
  • (2003)Information selection and probabilistic 2D - 3D integration in mobile mappingProceedings of the 3rd international conference on Computer vision systems10.5555/1765473.1765491(151-161)Online publication date: 1-Apr-2003
  • (2003)Facial recognition in videoProceedings of the 4th international conference on Audio- and video-based biometric person authentication10.5555/1762222.1762290(505-514)Online publication date: 9-Jun-2003
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media