[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
article

Robust Visual Tracking by Integrating Multiple Cues Based on Co-Inference Learning

Published: 01 June 2004 Publication History

Abstract

Visual tracking can be treated as a parameter estimation problem that infers target states based on image observations from video sequences. A richer target representation may incur better chances of successful tracking in cluttered and dynamic environments, and thus enhance the robustness. Richer representations can be constructed by either specifying a detailed model of a single cue or combining a set of rough models of multiple cues. Both approaches increase the dimensionality of the state space, which results in a dramatic increase of computation. To investigate the integration of rough models from multiple cues and to explore computationally efficient algorithms, this paper formulates the problem of multiple cue integration and tracking in a probabilistic framework based on a factorized graphical model. Structured variational analysis of such a graphical model factorizes different modalities and suggests a co-inference process among these modalities. Based on the importance sampling technique, a sequential Monte Carlo algorithm is proposed to provide an efficient simulation and approximation of the co-inferencing of multiple cues. This algorithm runs in real-time at around 30 Hz. Our extensive experiments show that the proposed algorithm performs robustly in a large variety of tracking scenarios. The approach presented in this paper has the potential to solve other problems including sensor fusion problems.

References

[1]
Azoz, Y., Devi, L., and Sharma, R. 1998. Reliable tracking of human arm dynamics by multiple cue integration and constraint fusion. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition. Santa Barbara, California, pp. 905-910.]]
[2]
Birchfield, S. 1998. Ellitical head tracking using intensity gradient and color histograms. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition. Santa Barbara, California, pp. 232-237.]]
[3]
Black, M. and Jepson, A. 1996. Eigentracking: Robust matching and tracking of articulated object using a view-based representation. In Proc. European Conf. Computer Vision, vol. 1, pp. 343-356.]]
[4]
Blake, A. and Isard, M. 1998. Active Contours. Springer-Verlag: London.]]
[5]
Blum, A. and Mitchell, T. 1998. Combining labeled and unlabeled data with co-training. In Proc. Conf. on Computational Learning Theory, pp. 92-100.]]
[6]
Bregler, C. 1997. Learning and recognition human dynamics in video sequences. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition, pp. 568-574.]]
[7]
Cham, T.-J. and Rehg, J. 1999. A multiple hypothesis approach to figure tracking. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition. vol. 2, pp. 239-244.]]
[8]
Comaniciu, D., Ramesh, V., and Meer, P. 2000. Real-time tracking of non-rigid objects using mean shift. In Proc. IEEE Conf. on Computer Vision and pattern Recognition. Hilton Head Island, South Carolina, vol. II, pp. 142-149.]]
[9]
Darrell, T., Gordon, G., Harville, M., and Woodfill, J. 1998. Integrated person tracking using stereo, color and pattern detection. In IEEE Conf. on Computer Vision and Pattern Recognition. Santa Barbra, pp. 601-609]]
[10]
Dempster, A.P., Laird, N.M., and Rubin, D.B. 1977. Maximum likelihood from incomplete data via the EM algorithm. J. Royal Statistical Society Series B. 39:1-38.]]
[11]
Deutscher, J., Blake, A., and Reid, I. 2000. Articulated body motion capture by annealed particle filtering. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition, Hilton Head Island, South Carolina, vol. II, pp. 126-133.]]
[12]
Doucet, A., Godsill, S.J., and Andrieu, C. 2000. On sequential monte carlo sampling methods for Bayesian filtering. Statistics and Computing, 10:197-208.]]
[13]
Gavrila, D.M. 1999. The visual analysis of human movement: A survey. Computer Vision and Image Understanding, 73:82-98.]]
[14]
Ghahramani, Z. 1995. Factorial learning and the EM algorithm. In Advanced in Neural Information Processing Systems 7, G. Tesauro, D. Touretzky, and T. Leen (Eds.), Cambridge, MA, MIT Press, pp. 617-624.]]
[15]
Ghahramani, Z. and Jordan, M. 1997. Factorial hidden Markov models. Machine Learning, 29:245-275.]]
[16]
Hager, G. and Belhumeur, P. 1996. Real-time tracking of image regions with changes in geometry and illumination. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition, pp. 403-410.]]
[17]
Isard, M. and Blake, A. 1996. Contour tracking by stochastic propagation of conditional density. In Proc. of European Conf. on Computer Vision. Cambridge, UK, pp. 343-356.]]
[18]
Isard, M. and Blake, A. 1998a. Condensation--Conditional density propagation for visual tracking. Int'l Journal of Computer Vision, 29:5-28.]]
[19]
Isard, M. and Blake, A. 1998b. ICONDENSATION: Unifying low-level and high-level tracking in a stochastic framework. In Proc. of European Conf. on Computer Vision, vol. 1. pp. 767-781.]]
[20]
Jordan, M., Ghahramani, Z. Jaakkola, T., and Saul, L. 2000. An introduction to variational methods for graphical models. Machine Learning, 37:183-233.]]
[21]
Li, B. and Chellapa, R. 2000. Simultaneous tracking and verification via sequential posterior estimation. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition, Hilton Head Island, South Carolina, vol. II, pp. 110-117.]]
[22]
Liu, J. and Chen, R. 1998. Sequential Monte Carlo methods for dynamic systems. J. Amer. Statist. Assoc., 93:1032-1044.]]
[23]
Liu, J., Chen, R., and Logvinenko, T. 2000. A theoretical framework for sequential importance sampling and resampling. In Sequential Monte Carlo in Practice, A. Doucet, N. de Freitas, and N. Gordon (Eds.), New York: Springer-Verlag.]]
[24]
MacCormick, J. and Blake, A. 1999. A probabilistic exclusion principle for tracking multiple objects. In Proc. IEEE Int'l Conf. on Computer Vision. Greece, pp. 572-578.]]
[25]
MacCormick, J. and Isard, M. 2000. Partitioned sampling, articulated objects, and interface-quality hand tracking.In Proc. of European Conf. on Computer Vision, vol. 2, pp. 3-19.]]
[26]
Pavlović, V., Sharma, R., and Huang, T.S. 1997. Visual interpretation of hand gestures tbr human computer interaction: A review. IEEE Trans. on PAMI, 19:677-695.]]
[27]
Rabiner, L. 1989. A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE, 77:257-286.]]
[28]
Raja, Y., McKenna, S., and Gong, S. 1998. Colour model selection and adaptation in dynamic scenes. In Proc. of European Conf. on Computer Vision, pp. 460-475.]]
[29]
Rasmussen, C. and Hager, G. 1998. Joint probabilistic techniques for tracking multi-part objects. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition, pp. 16-21.]]
[30]
Saul, L. and Jordan, M. 1996. Exploiting tractable substructures in intractable, networks. In Advances in Neural Information Processing Systems, D. Touretzky, M. Mozer, and M. Hasselmo (Eds.), MIT Press, Cambridge, MA, vol. 8, pp. 486-492.]]
[31]
Swain, M. and Ballard, D. 1991. Color indexing. Int'l Journal of Computer Vision, 7:11-32.]]
[32]
Tanner, M.A. 1993. Tools for Statistical Inference: Methods for the Exploration of Posterior Distributions and Likelihood Functions. Springer-Verlag, New York.]]
[33]
Tao, H., Sawhney, H., and Kumar, R. 1999. A sampling algorithm for detecting and tracking multiple objects. In Proc. ICCV'99 Workshop on Vision Algorithm. Corfu, Greece.]]
[34]
Tao, H., Sawhney, H., and Kumar, R. 2000. Dynamic layer representation with applications to tracking. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition, vol.2, pp. 134-141.]]
[35]
Toyama, K. and Hager, G. 1996. Incremental focus of attention for robust visual tracking. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition, pp. 189-195.]]
[36]
Toyama, K., Krumm, J., Brumitt, B., and Meyers, B. 1999. Wallflower: Principles and practice of background maintenance. In Proc. IEEE Int'l Conf. on Computer Vision. Korfu, Greece, pp. 255-261.]]
[37]
Toyama, K. and Wu, Y. 2000. Bootstrap initialization of nonparametric texture models for tracking. In Proc. of European Conf. on Computer Vision. Irland.]]
[38]
Wren, C., Azarbayejani, A., Darrel, T., and Pentland, A. 1997. Pfinder: Real-time tracking of the human body. IEEE Trans. on Pattern Analysis and Machine Intelligence, 9:780-785.]]
[39]
Wu, Y. and Huang, T.S. 2000. Color tracking by transductive learning. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition, Hilton Head Island, South Carolina, vol. I, pp. 133-138.]]
[40]
Wu, Y. and Huang T.S. 2001a. Hand modeling, analysis and recognition for vision-based human computer interaction. IEEE Signal Processing Magazine, 18:51-60.]]
[41]
Wu, Y. and Huang, T.S. 2001b. Robust visual tracking by co-inference learning. In Proc. IEEE Int'l Conference on Computer Vision, Varcouver, vol. II, pp. 26-33.]]

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image International Journal of Computer Vision
International Journal of Computer Vision  Volume 58, Issue 1
Special Issue on Computer Vision Research at the Beckman Institute of Advanced Science and Technology
June 2004
77 pages

Publisher

Kluwer Academic Publishers

United States

Publication History

Published: 01 June 2004

Author Tags

  1. co-inference
  2. factorized graphical model
  3. importance sampling
  4. sequential Monte Carlo
  5. variational analysis
  6. visual tracking

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 29 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2021)Occluded object tracking using object-background prototypes and particle filterApplied Intelligence10.1007/s10489-020-02047-x51:8(5259-5279)Online publication date: 7-Jan-2021
  • (2019)Neuro-probabilistic model for object trackingPattern Analysis & Applications10.1007/s10044-019-00791-622:4(1609-1628)Online publication date: 1-Nov-2019
  • (2018)Multiple object tracking by employing shaped-based features and Kalman filterInternational Journal of Business Intelligence and Data Mining10.5555/3192182.319220213:1-3(331-346)Online publication date: 1-Jan-2018
  • (2017)Parallel implementation of color-based particle filter for object tracking in embedded systemsHuman-centric Computing and Information Sciences10.1186/s13673-016-0082-17:1(1-13)Online publication date: 1-Dec-2017
  • (2017)Adaptive Appearance Model in Particle filter based Visual TrackingProceedings of the 2nd international Conference on Big Data, Cloud and Applications10.1145/3090354.3090441(1-5)Online publication date: 29-Mar-2017
  • (2017)Object tracking using Particle Swarm Optimization and Earth mover's distance2017 IEEE Congress on Evolutionary Computation (CEC)10.1109/CEC.2017.7969313(193-200)Online publication date: 5-Jun-2017
  • (2015)Multi-target tracking by learning local-to-global trajectory modelsPattern Recognition10.1016/j.patcog.2014.08.01348:2(580-590)Online publication date: 1-Feb-2015
  • (2015)Robust tracking using visual cue integration for mobile mixed imagesJournal of Visual Communication and Image Representation10.1016/j.jvcir.2015.04.00630:C(208-218)Online publication date: 1-Jul-2015
  • (2015)Real-time multi-scale tracking based on compressive sensingThe Visual Computer: International Journal of Computer Graphics10.1007/s00371-014-0942-531:4(471-484)Online publication date: 1-Apr-2015
  • (2014)Object tracking with adaptive multicue incremental visual trackerAdvances in Multimedia10.1155/2014/3438602014(9-9)Online publication date: 1-Jan-2014
  • Show More Cited By

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media