Abstract
In this paper, we present a methodology to categorize camera captured documents into pre-defined logo classes. Unlike scanned documents, camera captured documents suffer from intensity variations, partial occlusions, cluttering, and large scale variations. Furthermore, the existence of non-uniform folds and the lack of document being flat make this task more challenging. We present the selection of robust local features and the corresponding parameters by comparisons among SIFT, SURF, MSER, Hessian-affine, and Harris-affine. We evaluate the system not only with respect to amount of space required to store the local features information but also with respect to categorization accuracy. Moreover, the system handles the identification of multiple logos on the document at the same time. Experimental results on a challenging set of real images demonstrate the efficiency of our approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Agam, G., Argamon, S., Frieder, O., Grossman, D., Lewis, D.: The IIT Complex Document Image Processing (CDIP) Test Collection Project. Illinois Institute of Technology, USA (2006)
Bay, H., Ess, A., Tuytelaars, T., Gool, L.V.: Speeded-up robust features (surf). Computer Vision and Image Understanding 110(3), 346–359 (2008)
Jegou, H., Douze, M., Schmid, C.: Hamming embedding and weak geometric consistency for large scale image search. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part I. LNCS, vol. 5302, pp. 304–317. Springer, Heidelberg (2008)
Kanungo, T., Mount, D.M., Netanyahu, N.S., Piatko, C., Silverman, R., Wu, A.Y.: The analysis of a simple k-means clustering algorithm. In: Proc. the Sixteenth Annual Symposium on Computational Geometry, pp. 100–109. Hong Kong University of Science and Technology (June 2000)
Li, Z., Schulte-Austum, M., Neschen, M.: Fast logo detection and recognition in document images. In: Proc. 20th International Conference on Pattern Recognition, Istanbul, Turkey, pp. 2716–2719 (August 2010)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)
Mikolajczyk, K., Tuytelaars, T., Schmid, C., Zisserman, A., Matas, J., Schaffalitzky, F., Kadir, T., Gool, L.V.: A comparison of affine region detectors. International Journal of Computer Vision 65(1-2), 43–72 (2005)
Olsen, D.L., Delen, D.: Advanced Data Mining Techniques, 1st edn. Springer, Heidelberg (2008)
Rusinol, M., Llados, J.: Logo spotting by a bag-of-words approach for document categorization. In: Proc. 10th International Conference on Document Analysis and Recognition, Barcelona, Spain, pp. 111–115 (July 2009)
Sivic, J., Zisserman, A.: Video google: a text retrieval approach to object matching in videos. In: Proc. 9th International Conference on Computer Vision, Nice, France, pp. 1470–1477 (October 2003)
Wang, H.: Document logo detection and recognition using bayesian model. In: Proc. 20th International Conference on Pattern Recognition, Istanbul, Turkey, pp. 1961–1964 (August 2010)
Wang, H., Chen, Y.: Logo detecion in document images based on boundary extension of feature rectangles. In: Proc. 10th International Conference on Document Analysis and Recognition, Barcelona, Spain, pp. 1335–1339 (July 2009)
Wu, Z., Ke, Q., Isard, M., Sun, J.: Bundling features for large scale partial-duplicate web image search. In: Proc. IEEE Intl. Conf. on Computer Vision and Pattern Recognition, Miami, FL, USA, pp. 25–32 (June 2009)
Zhu, G., Doermann, D.: Automatic document logo detection. In: Proc. 9th International Conference on Document Analysis and Recognition, Curitiba, Brazil, pp. 864–868 (September 2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Edupuganti, V.G., Shih, F.Y., Kompalli, S. (2011). Categorization of Camera Captured Documents Based on Logo Identification. In: Real, P., Diaz-Pernil, D., Molina-Abril, H., Berciano, A., Kropatsch, W. (eds) Computer Analysis of Images and Patterns. CAIP 2011. Lecture Notes in Computer Science, vol 6855. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23678-5_14
Download citation
DOI: https://doi.org/10.1007/978-3-642-23678-5_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23677-8
Online ISBN: 978-3-642-23678-5
eBook Packages: Computer ScienceComputer Science (R0)