[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
article

Skew detection and correction in document images based on straight-line fitting

Published: 01 August 2003 Publication History

Abstract

During document scanning, skew is inevitably introduced into the incoming document image. Since the algorithms for layout analysis and character recognition are generally very sensitive to the page skew, skew detection and correction in document images are the critical steps before layout analysis. In this paper, a novel skew detection method based on straight-line fitting is proposed. And a concept of Eigen-point is introduced. After the relations between the successive Eigen-points in every text line within a suitable sub-region were analyzed, the Eigen-points most possibly laid on the baselines are selected as samples for the straight-line fitting. The average of these baseline directions is computed, which corresponds to the degree of skew of the whole document image. Then a fast skew correction method based on the scanning line model is also presented. Experiments prove that the proposed approaches are fast and accurate.

References

[1]
Chen, M., Ding, X., 1999. A robust skew detection algorithm for grayscale document image. In: Proc. 5th Internat. Conf. on Document Anal. and Recognition, Bangalore, India, 20-22 September, pp. 617-620.
[2]
Ciardiello, G., Scafur, G., Degrandi, M.T., Spada, M.R., Roccoteli, M.P., 1988. An Experimental System for Office Document Handling and Text Recognition. In: Proc. 9th Internat. Conf. on Pattern Recognition, Rome, Italy, 14-17 November pp. 739-743.
[3]
Gatos, B., Papamarkos, N., Chamzas, C., 1997. Skew detection and text line position determination in digitized documents. Pattern Recognition 30 (9), 1505-1519.
[4]
Hinds, S.C., Fisher, J.L., D'Amato, D.F., 1990. A document skew detection method using run-length encoding and the Hough transform. In: Proc. 10th Internat. Conf. on Pattern Recognition, Atlantic City, New York, 16-21 June, pp. 464-468.
[5]
Le, D.S., Thoma, G.R., Wechsler, H., 1997. Automated page orientation and skew angle detection for binary document images. Pattern Recognition 27 (10), 1325-1344.
[6]
O'Gorman, L., 1993. The document spectrum for page layout analysis. IEEE Trans. on Pattern Anal. Machine Intell. 15 (11), 1162-1173.
[7]
Okun, O., Pietikainen, M., Sauvola, J., 1999. Robust, skew estimation on low-resolutioon document images. In: Proc. 5th Internat. Cong. on Document Anal. Recognition, Bangalore, India, 20-22 September, pp. 621-624.
[8]
Pstl, W., 1986. Detection of linear oblique structure and skew scan in digitized documents. In: Proc. 8th Internat. Conf. on Pattern Recognition, Paris, France, 27-31 October, pp. 687-689.
[9]
Steiherz, T., Intrator, N., Rivlin, E., 1999. Skew detection via principal component analysis. In: Proc. 5th Internat. Conf. on Document Anal. and Recognition, Bangalore, India, 20-22 September, pp. 153-156.
[10]
Sun, C., Si, D., 1997. Skew and slant correction for document image using gradient direction. In: Proc. 4th Internat. Conf. on Document Anal. Recognition, Ulm, Germany, 18-20 August, pp. 170-174.
[11]
Wang, S., Cao, Y., Cai, S., 2002. An approach to page segmentation and classification. J. Comput.-Aided Design Comput. Graphics 14 (1), 17-20.
[12]
Yan, H., 1993. Skew correction of document images using inerline cross-correlation. Comput. Vision Graphics Image Process. 55 (6), 538-543.
[13]
Yu, B., Jain, A.K., 1996. A robust and fast skew detection algorithm for generic documents. Pattern Recognition 29 (10), 1599-1629.

Cited By

View all
  • (2019)Document Layout AnalysisACM Computing Surveys10.1145/335561052:6(1-36)Online publication date: 16-Oct-2019
  • (2018)Ultra-fast basic geometrical transformations on linear image data structureExpert Systems with Applications: An International Journal10.1016/j.eswa.2017.09.01191:C(322-346)Online publication date: 1-Jan-2018
  • (2017)On the Farey sequence and its augmentation for applications to image analysisInternational Journal of Applied Mathematics and Computer Science10.1515/amcs-2017-004527:3(637-658)Online publication date: 1-Sep-2017
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Pattern Recognition Letters
Pattern Recognition Letters  Volume 24, Issue 12
August 2003
289 pages

Publisher

Elsevier Science Inc.

United States

Publication History

Published: 01 August 2003

Author Tags

  1. Eigen-point
  2. connected component
  3. document analysis
  4. skew correction
  5. skew detection

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 25 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2019)Document Layout AnalysisACM Computing Surveys10.1145/335561052:6(1-36)Online publication date: 16-Oct-2019
  • (2018)Ultra-fast basic geometrical transformations on linear image data structureExpert Systems with Applications: An International Journal10.1016/j.eswa.2017.09.01191:C(322-346)Online publication date: 1-Jan-2018
  • (2017)On the Farey sequence and its augmentation for applications to image analysisInternational Journal of Applied Mathematics and Computer Science10.1515/amcs-2017-004527:3(637-658)Online publication date: 1-Sep-2017
  • (2015)Small Eigenvalue Based Skew Estimation of Handwritten Devanagari WordsProceedings of the Third International Conference on Mining Intelligence and Knowledge Exploration - Volume 946810.1007/978-3-319-26832-3_21(216-225)Online publication date: 9-Dec-2015
  • (2010)Identification of scripts and orientations of degraded document imagesPattern Analysis & Applications10.5555/2736769.273690913:4(469-475)Online publication date: 1-Nov-2010
  • (2008)A rotation method for binary document images using DDA algorithmProceedings of the eighth ACM symposium on Document engineering10.1145/1410140.1410198(267-270)Online publication date: 16-Sep-2008
  • (2007)Language independent skew estimation technique based on Gaussian mixture modelsProceedings of the 2nd international conference on Pattern recognition and machine intelligence10.5555/1781034.1781101(487-494)Online publication date: 18-Dec-2007
  • (2007)An accurate and efficient skew estimation technique for South Indian documentsInternational Journal of Robotics and Automation10.5555/1739839.173984122:4(272-280)Online publication date: 1-Sep-2007
  • (2007)Language Independent Skew Estimation Technique Based on Gaussian Mixture Models: A Case Study on South Indian ScriptsPattern Recognition and Machine Intelligence10.1007/978-3-540-77046-6_60(487-494)Online publication date: 18-Dec-2007
  • (2005)Sequential Correction of Perspective Warp in Camera-based DocumentsProceedings of the Eighth International Conference on Document Analysis and Recognition10.1109/ICDAR.2005.216(394-398)Online publication date: 31-Aug-2005

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media