Abstract
The rapid distribution of digital cameras has caused several new problems related to text recognition. Based on experimental studies, it was revealed that existing OCR systems cannot cope with a complex perspective and geometric distortions that arise during photographing text documents. Therefore, it is necessary to apply text documents pre-processing, so the text lines were straight and horizontal. This article briefly considers the methods document pre-processing, and found that it depends on the type of distortion and is not universal. We proposed information technology and a new method involving the mathematical raising of straightened text lines on the image and heterogeneous distortion correction based on a page surface transformation model. This technology is more reliable than others as it is universal and corrects any type of geometric distortion, including a combination of several types of distortion.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Sharma, M.D.A., Kumar, R.: A comparative study of edge detectors in digital image processing. In: 2017 4th International Conference on Signal Processing, Computing and Control (ISPCC), Solan, pp. 246–250 (2017)
Yadav, S., Bhanushali, P., Jain, S., Kaur, T.: Word matching and retrieval from images. In: 2017 International Conference of Electronics, Communication and Aerospace Technology (ICECA), Coimbatore, pp. 318–323(2017)
Plaza, J.. Plaza, A.P., Sanchez, S.: Parallel Hyperspectral Image and Signal Processing [Applications Corner]. In: IEEE Signal Processing Magazine, vol. 28, no. 3, pp. 119–126, May 2011
Nykolaichuk, Y., Krulikovskyi, B., Gryga, V., Davletova, A.: Computational accelerators for analog-to-digital and digital processing of sensor signals in information measuring systems. In: 2018 14th International Conference on Advanced Trends in Radioelecrtronics, Telecommunications and Computer Engineering (TCSET), Lviv-Slavske, Ukraine, pp. 148–151 (2018)
Cheung, G., Magli, E., Tanaka, Y., Ng, M.K.: Graph spectral image processing. In: Proceedings of the IEEE, vol. 106, no. 5, pp. 907–930, May 2018
Elsalamony, H.A.: Detecting distorted and benign blood cells using the Hough transform based on neural networks and decision trees. In: Emerging Trends in Image Processing, Computer Vision and Pattern Recognition, pp. 457–473 (2015)
Mori, S.: Historical review of OCR research and development. In: Mori, S., Suen, C.Y., Yamamoto, K. (eds.) Proceedings of the IEEE, vol. 80, no. 7, pp. 1029–1058 (1992)
Cao, H., Ding, X., Liu, C.: Rectifying the bound document image captured by the camera: a model based approach. In: Proceedings of 7th International Conference on Document Analysis and Recognition, Scotland, pp. 71–75 (2003)
Zhang, L., Yip, A.M., Brown, M.S., Tan, C.L.: A unified framework for document restoration using in painting and shape-from-shading. In: Pattern Recognition, vol. 42, no. 11, pp. 2961–2978 (2009)
A model based book dewarping method using text line detection. In: Brown, B., Wu, M., Rongfeng, L., et al. (eds.) Proceedings of the Second International Workshop on Camera-Based Document Analysis and Recognition, Curitiba, Brazil, pp. 63–70 (2007)
Active contours network to straighten distorted text lines. In: Lavialle, O., Molines, X., Angella, F., et al. (eds.) Proceedings of International Conference on Image Processing. – Thessaloniki, Greece, pp. 748–751 (2001)
Zhang, L., Zhang, Y., Tan, C.L.: An improved physically-based method for geometric restoration of distorted document images. In: IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 30, no. 4, pp. 728–734 (2008)
Ulges, A.: Document capture using stereo vision. In: Ulges, A., Lampert, C.H., Breuel, T.M. (eds.) Proceedings of the ACM Symposium on Document Engineering, pp. 198–200. ACM (2004)
Brown, M.S., Seales, W.B.: Document restoration using 3D shape: A general deskewing algorithm for arbitrarily warped documents. In: Proceedings of International Conference on Computer Vision, vol. 2, pp. 367–374, July 2001
Liang, J., De Menthon, D., Doermann, D.: Geometric rectification of camera-captured document images. IEEE Trans. Pattern Anal. Mach. Intell. 30(4), 591–605 (2008)
Wu, C., Agam, G.: Document image dewarping for text/graphics recognition. In: IAPR International Workshop on Structural, Syntactic and Statistical Pattern Recognition, Windsor, Canada, 2002, pp. 348–357 (2002)
Mischke, L., Luther, W.: Document image de-warping based on detection of distorted text lines. In: Proceedings of International Conference on Image Analysis and Processing, Cagliari, Italy, pp. 1068–1075 (2005)
Ulges, A., Lampert, C.H., Breuel, T.M.: Document image dewarping using robust estimation of curled text lines. In: Proceedings of 8th International Conference on Document Analysis and Recognition, Seoul, Korea, 2005, pp. 1001–1005
Zhang, Y., Liu, C., Ding, X., Zou, Y.: Arbitrary warped document image restoration based on segmentation and Thin-Plate Splines. In: Proceedings of 19-th International Conference on Pattern Recognition, Florida, USA, 2008, pp. 1–4 (2008)
Zhang, Zh., Tan, C.L.: Correcting document image warping based on regression of curved text lines. In: Proceedings of International Conference on Document Analysis and Recognition, pp. 589–563 (2003)
Minaee, S., Wang, Y.: Text extraction from texture images using masked signal decomposition. In: 2017 IEEE Global Conference on Signal and Information Processing (GlobalSIP), Montreal, QC, 2017, pp. 1210–1214 (2017)
Elagouni, K., Garcia, C., Mamalet, F., Sebillot, P.: Combining multi-scale character recognition and linguistic knowledge for natural scene text OCR. In: 2012 10th IAPR International Workshop on Document Analysis Systems, Gold Cost, QLD, pp. 120–124 (2012)
Zhu, W., Liu, Y., Hao, L.: A novel OCR Approach based on document layout analysis and text block classification. In: 2016 12th International Conference on Computational Intelligence and Security (CIS), Wuxi, pp. 91–94 (2016)
Mai, B.Q.L., Huynh, T.H., Doan, A.D.: A study about the reconstruction of remote, low resolution mobile captured text images for OCR. In: 2014 International Conference on Advanced Technologies for Communications (ATC 2014), Hanoi, pp. 286–291 (2014)
Naganjaneyulu, G.V.S.S.K.R., Narasimhadhan, A.V., Venkatesh, K.: Performance evaluation of OCR on poor resolution text document images using different pre processing steps. In: 2014 IEEE Region 10 Conference, ENCON 2014, Bangkok, pp. 1–4 (2014)
Image data sets. http://www.iupr.com/archived- 2009/datasets
Acknowledgments
The authors are appreciative to colleagues for their support and appropriate suggestions, which allowed to improve the materials of the article.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Tymchenko, O., Havrysh, B., Khamula, O., Tymchenko, O.O., Vasiuta, S. (2021). Geometric Distortion Correction Technique of Text Images. In: Shakhovska, N., Medykovskyy, M.O. (eds) Advances in Intelligent Systems and Computing V. CSIT 2020. Advances in Intelligent Systems and Computing, vol 1293. Springer, Cham. https://doi.org/10.1007/978-3-030-63270-0_67
Download citation
DOI: https://doi.org/10.1007/978-3-030-63270-0_67
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-63269-4
Online ISBN: 978-3-030-63270-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)