Abstract
Models of object recognition in cortex have so far been mostly applied to tasks involving the recognition of isolated objects presented on blank backgrounds. However, ultimately models of the visual system have to prove themselves in real world object recognition tasks. Here we took a first step in this direction: We investigated the performance of the HMAX model of object recognition in cortex recently presented by Riesenhuber & Poggio [1],[2] on the task of face detection using natural images. We found that the standard version of hmax performs rather poorly on this task, due to the low specificity of the hardwired feature set of C2 units in the model (corresponding to neurons in intermediate visual area V4) that do not show any particular tuning for faces vs. background. We show how visual features of intermediate complexity can be learned in HMAX using a simple learning rule. Using this rule, hmax outperforms a classical machine vision face detection system presented in the literature. This suggests an important role for the set of features in intermediate visual areas in object recognition.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
M. Riesenhuber and T. Poggio. Hierarchical models of object recognition in cortex. Nat. Neurosci., 2(11):1019–25, 1999.
M. Riesenhuber and T. Poggio. Models of object recognition. Nature Neuroscience, 3 supp.:1199–1204, 2000.
B. Heisele, T. Serre, M. Pontil, and T. Poggio. Component-based face detection. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition, volume 1, pages 657–62, Hawaii, 2001.
K.-K. Sung. Learning and Example Selection for Object and Pattern Recognition. PhD thesis, MIT, Artificial Intelligence Laboratory and Center for Biological and Computational Learning, Cambridge, MA, 1996.
D. Hubel and T. Wiesel. Receptive fields, binocular interaction and functional architecture in the cat’s visual cortex. J. Phys., 160:106–54, 1962.
T. J. Gawne and J. M. Martin. Response of primate visual cortical V4 neurons to simultaneously presented stimuli. To appear in J. Neurophysiol., 2002.
D. J. Freedman, M. Riesenhuber, T. Poggio, and E. K. Miller. Categorical representation of visual stimuli in the primate prefrontal cortex. Science, 291:312–16, 2001.
V. Vapnik. The nature of statistical learning. Springer Verlag, 1995.
T. Vetter. Synthesis of novel views from a single face. International Journal of Computer Vision, 28(2):103–116, 1998.
S. Ullman, M. Vidal-Naquet, and E. Sali. Visual features of intermediate complexity and their use in classification. Nat. Neurosci., 5(7):682–87, 2002.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Serre, T., Riesenhuber, M., Louie, J., Poggio, T. (2002). On the Role of Object-Specific Features for Real World Object Recognition in Biological Vision. In: Bülthoff, H.H., Wallraven, C., Lee, SW., Poggio, T.A. (eds) Biologically Motivated Computer Vision. BMCV 2002. Lecture Notes in Computer Science, vol 2525. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36181-2_39
Download citation
DOI: https://doi.org/10.1007/3-540-36181-2_39
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00174-4
Online ISBN: 978-3-540-36181-7
eBook Packages: Springer Book Archive