Geometric Distortion Correction Technique of Text Images

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1293))

Included in the following conference series:

Conference on Computer Science and Information Technologies

460 Accesses

Abstract

The rapid distribution of digital cameras has caused several new problems related to text recognition. Based on experimental studies, it was revealed that existing OCR systems cannot cope with a complex perspective and geometric distortions that arise during photographing text documents. Therefore, it is necessary to apply text documents pre-processing, so the text lines were straight and horizontal. This article briefly considers the methods document pre-processing, and found that it depends on the type of distortion and is not universal. We proposed information technology and a new method involving the mathematical raising of straightened text lines on the image and heterogeneous distortion correction based on a page surface transformation model. This technology is more reliable than others as it is universal and corrects any type of geometric distortion, including a combination of several types of distortion.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 103.50; Price includes VAT (United Kingdom)

Softcover Book: GBP 129.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Rule-based perspective rectification for Chinese text in natural scene images

Article 17 February 2021

An Effective Approach to Text Detection and Recognition in Degraded Historical Documents

A Combined Method for Document Image Enhancement Using Image Smoothing, Gray-Level Reduction and Thresholding

References

Sharma, M.D.A., Kumar, R.: A comparative study of edge detectors in digital image processing. In: 2017 4th International Conference on Signal Processing, Computing and Control (ISPCC), Solan, pp. 246–250 (2017)
Google Scholar
Yadav, S., Bhanushali, P., Jain, S., Kaur, T.: Word matching and retrieval from images. In: 2017 International Conference of Electronics, Communication and Aerospace Technology (ICECA), Coimbatore, pp. 318–323(2017)
Google Scholar
Plaza, J.. Plaza, A.P., Sanchez, S.: Parallel Hyperspectral Image and Signal Processing [Applications Corner]. In: IEEE Signal Processing Magazine, vol. 28, no. 3, pp. 119–126, May 2011
Google Scholar
Nykolaichuk, Y., Krulikovskyi, B., Gryga, V., Davletova, A.: Computational accelerators for analog-to-digital and digital processing of sensor signals in information measuring systems. In: 2018 14th International Conference on Advanced Trends in Radioelecrtronics, Telecommunications and Computer Engineering (TCSET), Lviv-Slavske, Ukraine, pp. 148–151 (2018)
Google Scholar
Cheung, G., Magli, E., Tanaka, Y., Ng, M.K.: Graph spectral image processing. In: Proceedings of the IEEE, vol. 106, no. 5, pp. 907–930, May 2018
Google Scholar
Elsalamony, H.A.: Detecting distorted and benign blood cells using the Hough transform based on neural networks and decision trees. In: Emerging Trends in Image Processing, Computer Vision and Pattern Recognition, pp. 457–473 (2015)
Google Scholar
Mori, S.: Historical review of OCR research and development. In: Mori, S., Suen, C.Y., Yamamoto, K. (eds.) Proceedings of the IEEE, vol. 80, no. 7, pp. 1029–1058 (1992)
Google Scholar
Cao, H., Ding, X., Liu, C.: Rectifying the bound document image captured by the camera: a model based approach. In: Proceedings of 7th International Conference on Document Analysis and Recognition, Scotland, pp. 71–75 (2003)
Google Scholar
Zhang, L., Yip, A.M., Brown, M.S., Tan, C.L.: A unified framework for document restoration using in painting and shape-from-shading. In: Pattern Recognition, vol. 42, no. 11, pp. 2961–2978 (2009)
Google Scholar
A model based book dewarping method using text line detection. In: Brown, B., Wu, M., Rongfeng, L., et al. (eds.) Proceedings of the Second International Workshop on Camera-Based Document Analysis and Recognition, Curitiba, Brazil, pp. 63–70 (2007)
Google Scholar
Active contours network to straighten distorted text lines. In: Lavialle, O., Molines, X., Angella, F., et al. (eds.) Proceedings of International Conference on Image Processing. – Thessaloniki, Greece, pp. 748–751 (2001)
Google Scholar
Zhang, L., Zhang, Y., Tan, C.L.: An improved physically-based method for geometric restoration of distorted document images. In: IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 30, no. 4, pp. 728–734 (2008)
Google Scholar
Ulges, A.: Document capture using stereo vision. In: Ulges, A., Lampert, C.H., Breuel, T.M. (eds.) Proceedings of the ACM Symposium on Document Engineering, pp. 198–200. ACM (2004)
Google Scholar
Brown, M.S., Seales, W.B.: Document restoration using 3D shape: A general deskewing algorithm for arbitrarily warped documents. In: Proceedings of International Conference on Computer Vision, vol. 2, pp. 367–374, July 2001
Google Scholar
Liang, J., De Menthon, D., Doermann, D.: Geometric rectification of camera-captured document images. IEEE Trans. Pattern Anal. Mach. Intell. 30(4), 591–605 (2008)
Article Google Scholar
Wu, C., Agam, G.: Document image dewarping for text/graphics recognition. In: IAPR International Workshop on Structural, Syntactic and Statistical Pattern Recognition, Windsor, Canada, 2002, pp. 348–357 (2002)
Google Scholar
Mischke, L., Luther, W.: Document image de-warping based on detection of distorted text lines. In: Proceedings of International Conference on Image Analysis and Processing, Cagliari, Italy, pp. 1068–1075 (2005)
Google Scholar
Ulges, A., Lampert, C.H., Breuel, T.M.: Document image dewarping using robust estimation of curled text lines. In: Proceedings of 8th International Conference on Document Analysis and Recognition, Seoul, Korea, 2005, pp. 1001–1005
Google Scholar
Zhang, Y., Liu, C., Ding, X., Zou, Y.: Arbitrary warped document image restoration based on segmentation and Thin-Plate Splines. In: Proceedings of 19-th International Conference on Pattern Recognition, Florida, USA, 2008, pp. 1–4 (2008)
Google Scholar
Zhang, Zh., Tan, C.L.: Correcting document image warping based on regression of curved text lines. In: Proceedings of International Conference on Document Analysis and Recognition, pp. 589–563 (2003)
Google Scholar
Minaee, S., Wang, Y.: Text extraction from texture images using masked signal decomposition. In: 2017 IEEE Global Conference on Signal and Information Processing (GlobalSIP), Montreal, QC, 2017, pp. 1210–1214 (2017)
Google Scholar
Elagouni, K., Garcia, C., Mamalet, F., Sebillot, P.: Combining multi-scale character recognition and linguistic knowledge for natural scene text OCR. In: 2012 10th IAPR International Workshop on Document Analysis Systems, Gold Cost, QLD, pp. 120–124 (2012)
Google Scholar
Zhu, W., Liu, Y., Hao, L.: A novel OCR Approach based on document layout analysis and text block classification. In: 2016 12th International Conference on Computational Intelligence and Security (CIS), Wuxi, pp. 91–94 (2016)
Google Scholar
Mai, B.Q.L., Huynh, T.H., Doan, A.D.: A study about the reconstruction of remote, low resolution mobile captured text images for OCR. In: 2014 International Conference on Advanced Technologies for Communications (ATC 2014), Hanoi, pp. 286–291 (2014)
Google Scholar
Naganjaneyulu, G.V.S.S.K.R., Narasimhadhan, A.V., Venkatesh, K.: Performance evaluation of OCR on poor resolution text document images using different pre processing steps. In: 2014 IEEE Region 10 Conference, ENCON 2014, Bangkok, pp. 1–4 (2014)
Google Scholar
Image data sets. http://www.iupr.com/archived- 2009/datasets

Download references

Acknowledgments

The authors are appreciative to colleagues for their support and appropriate suggestions, which allowed to improve the materials of the article.

Author information

Authors and Affiliations

University of Warmia and Mazury, Olsztyn, Poland
Oleksandr Tymchenko
Ukrainian Academy of Printing, 79020, 19, Pid Holosko St, Lviv, 79020, Ukraine
Bohdana Havrysh, Orest Khamula, Oleksandr O. Tymchenko & Svitlana Vasiuta

Authors

Oleksandr Tymchenko
View author publications
You can also search for this author in PubMed Google Scholar
Bohdana Havrysh
View author publications
You can also search for this author in PubMed Google Scholar
Orest Khamula
View author publications
You can also search for this author in PubMed Google Scholar
Oleksandr O. Tymchenko
View author publications
You can also search for this author in PubMed Google Scholar
Svitlana Vasiuta
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Orest Khamula .

Editor information

Editors and Affiliations

Department of Artificial Intelligence, Lviv Polytechnic National University, Lviv, Ukraine
Natalya Shakhovska
Institute of Computer Science and Information Technologies, Lviv Polytechnic National University, Lviv, Ukraine
Mykola O. Medykovskyy

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tymchenko, O., Havrysh, B., Khamula, O., Tymchenko, O.O., Vasiuta, S. (2021). Geometric Distortion Correction Technique of Text Images. In: Shakhovska, N., Medykovskyy, M.O. (eds) Advances in Intelligent Systems and Computing V. CSIT 2020. Advances in Intelligent Systems and Computing, vol 1293. Springer, Cham. https://doi.org/10.1007/978-3-030-63270-0_67

Download citation

DOI: https://doi.org/10.1007/978-3-030-63270-0_67
Published: 23 December 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-63269-4
Online ISBN: 978-3-030-63270-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Geometric Distortion Correction Technique of Text Images

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Rule-based perspective rectification for Chinese text in natural scene images

An Effective Approach to Text Detection and Recognition in Degraded Historical Documents

A Combined Method for Document Image Enhancement Using Image Smoothing, Gray-Level Reduction and Thresholding

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Geometric Distortion Correction Technique of Text Images

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Rule-based perspective rectification for Chinese text in natural scene images

An Effective Approach to Text Detection and Recognition in Degraded Historical Documents

A Combined Method for Document Image Enhancement Using Image Smoothing, Gray-Level Reduction and Thresholding

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation