[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to main content

Geometric Distortion Correction Technique of Text Images

  • Conference paper
  • First Online:
Advances in Intelligent Systems and Computing V (CSIT 2020)

Abstract

The rapid distribution of digital cameras has caused several new problems related to text recognition. Based on experimental studies, it was revealed that existing OCR systems cannot cope with a complex perspective and geometric distortions that arise during photographing text documents. Therefore, it is necessary to apply text documents pre-processing, so the text lines were straight and horizontal. This article briefly considers the methods document pre-processing, and found that it depends on the type of distortion and is not universal. We proposed information technology and a new method involving the mathematical raising of straightened text lines on the image and heterogeneous distortion correction based on a page surface transformation model. This technology is more reliable than others as it is universal and corrects any type of geometric distortion, including a combination of several types of distortion.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
£29.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
GBP 19.95
Price includes VAT (United Kingdom)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
GBP 103.50
Price includes VAT (United Kingdom)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
GBP 129.99
Price includes VAT (United Kingdom)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Sharma, M.D.A., Kumar, R.: A comparative study of edge detectors in digital image processing. In: 2017 4th International Conference on Signal Processing, Computing and Control (ISPCC), Solan, pp. 246–250 (2017)

    Google Scholar 

  2. Yadav, S., Bhanushali, P., Jain, S., Kaur, T.: Word matching and retrieval from images. In: 2017 International Conference of Electronics, Communication and Aerospace Technology (ICECA), Coimbatore, pp. 318–323(2017)

    Google Scholar 

  3. Plaza, J.. Plaza, A.P., Sanchez, S.: Parallel Hyperspectral Image and Signal Processing [Applications Corner]. In: IEEE Signal Processing Magazine, vol. 28, no. 3, pp. 119–126, May 2011

    Google Scholar 

  4. Nykolaichuk, Y., Krulikovskyi, B., Gryga, V., Davletova, A.: Computational accelerators for analog-to-digital and digital processing of sensor signals in information measuring systems. In: 2018 14th International Conference on Advanced Trends in Radioelecrtronics, Telecommunications and Computer Engineering (TCSET), Lviv-Slavske, Ukraine, pp. 148–151 (2018)

    Google Scholar 

  5. Cheung, G., Magli, E., Tanaka, Y., Ng, M.K.: Graph spectral image processing. In: Proceedings of the IEEE, vol. 106, no. 5, pp. 907–930, May 2018

    Google Scholar 

  6. Elsalamony, H.A.: Detecting distorted and benign blood cells using the Hough transform based on neural networks and decision trees. In: Emerging Trends in Image Processing, Computer Vision and Pattern Recognition, pp. 457–473 (2015)

    Google Scholar 

  7. Mori, S.: Historical review of OCR research and development. In: Mori, S., Suen, C.Y., Yamamoto, K. (eds.) Proceedings of the IEEE, vol. 80, no. 7, pp. 1029–1058 (1992)

    Google Scholar 

  8. Cao, H., Ding, X., Liu, C.: Rectifying the bound document image captured by the camera: a model based approach. In: Proceedings of 7th International Conference on Document Analysis and Recognition, Scotland, pp. 71–75 (2003)

    Google Scholar 

  9. Zhang, L., Yip, A.M., Brown, M.S., Tan, C.L.: A unified framework for document restoration using in painting and shape-from-shading. In: Pattern Recognition, vol. 42, no. 11, pp. 2961–2978 (2009)

    Google Scholar 

  10. A model based book dewarping method using text line detection. In: Brown, B., Wu, M., Rongfeng, L., et al. (eds.) Proceedings of the Second International Workshop on Camera-Based Document Analysis and Recognition, Curitiba, Brazil, pp. 63–70 (2007)

    Google Scholar 

  11. Active contours network to straighten distorted text lines. In: Lavialle, O., Molines, X., Angella, F., et al. (eds.) Proceedings of International Conference on Image Processing. – Thessaloniki, Greece, pp. 748–751 (2001)

    Google Scholar 

  12. Zhang, L., Zhang, Y., Tan, C.L.: An improved physically-based method for geometric restoration of distorted document images. In: IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 30, no. 4, pp. 728–734 (2008)

    Google Scholar 

  13. Ulges, A.: Document capture using stereo vision. In: Ulges, A., Lampert, C.H., Breuel, T.M. (eds.) Proceedings of the ACM Symposium on Document Engineering, pp. 198–200. ACM (2004)

    Google Scholar 

  14. Brown, M.S., Seales, W.B.: Document restoration using 3D shape: A general deskewing algorithm for arbitrarily warped documents. In: Proceedings of International Conference on Computer Vision, vol. 2, pp. 367–374, July 2001

    Google Scholar 

  15. Liang, J., De Menthon, D., Doermann, D.: Geometric rectification of camera-captured document images. IEEE Trans. Pattern Anal. Mach. Intell. 30(4), 591–605 (2008)

    Article  Google Scholar 

  16. Wu, C., Agam, G.: Document image dewarping for text/graphics recognition. In: IAPR International Workshop on Structural, Syntactic and Statistical Pattern Recognition, Windsor, Canada, 2002, pp. 348–357 (2002)

    Google Scholar 

  17. Mischke, L., Luther, W.: Document image de-warping based on detection of distorted text lines. In: Proceedings of International Conference on Image Analysis and Processing, Cagliari, Italy, pp. 1068–1075 (2005)

    Google Scholar 

  18. Ulges, A., Lampert, C.H., Breuel, T.M.: Document image dewarping using robust estimation of curled text lines. In: Proceedings of 8th International Conference on Document Analysis and Recognition, Seoul, Korea, 2005, pp. 1001–1005

    Google Scholar 

  19. Zhang, Y., Liu, C., Ding, X., Zou, Y.: Arbitrary warped document image restoration based on segmentation and Thin-Plate Splines. In: Proceedings of 19-th International Conference on Pattern Recognition, Florida, USA, 2008, pp. 1–4 (2008)

    Google Scholar 

  20. Zhang, Zh., Tan, C.L.: Correcting document image warping based on regression of curved text lines. In: Proceedings of International Conference on Document Analysis and Recognition, pp. 589–563 (2003)

    Google Scholar 

  21. Minaee, S., Wang, Y.: Text extraction from texture images using masked signal decomposition. In: 2017 IEEE Global Conference on Signal and Information Processing (GlobalSIP), Montreal, QC, 2017, pp. 1210–1214 (2017)

    Google Scholar 

  22. Elagouni, K., Garcia, C., Mamalet, F., Sebillot, P.: Combining multi-scale character recognition and linguistic knowledge for natural scene text OCR. In: 2012 10th IAPR International Workshop on Document Analysis Systems, Gold Cost, QLD, pp. 120–124 (2012)

    Google Scholar 

  23. Zhu, W., Liu, Y., Hao, L.: A novel OCR Approach based on document layout analysis and text block classification. In: 2016 12th International Conference on Computational Intelligence and Security (CIS), Wuxi, pp. 91–94 (2016)

    Google Scholar 

  24. Mai, B.Q.L., Huynh, T.H., Doan, A.D.: A study about the reconstruction of remote, low resolution mobile captured text images for OCR. In: 2014 International Conference on Advanced Technologies for Communications (ATC 2014), Hanoi, pp. 286–291 (2014)

    Google Scholar 

  25. Naganjaneyulu, G.V.S.S.K.R., Narasimhadhan, A.V., Venkatesh, K.: Performance evaluation of OCR on poor resolution text document images using different pre processing steps. In: 2014 IEEE Region 10 Conference, ENCON 2014, Bangkok, pp. 1–4 (2014)

    Google Scholar 

  26. Image data sets. http://www.iupr.com/archived- 2009/datasets

Download references

Acknowledgments

The authors are appreciative to colleagues for their support and appropriate suggestions, which allowed to improve the materials of the article.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Orest Khamula .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Tymchenko, O., Havrysh, B., Khamula, O., Tymchenko, O.O., Vasiuta, S. (2021). Geometric Distortion Correction Technique of Text Images. In: Shakhovska, N., Medykovskyy, M.O. (eds) Advances in Intelligent Systems and Computing V. CSIT 2020. Advances in Intelligent Systems and Computing, vol 1293. Springer, Cham. https://doi.org/10.1007/978-3-030-63270-0_67

Download citation

Publish with us

Policies and ethics