[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3568562.3568606acmotherconferencesArticle/Chapter ViewAbstractPublication PagessoictConference Proceedingsconference-collections
research-article
Open access

Retrieving Keywords in Historical Vietnamese Stele Images Without Human Annotations

Published: 01 December 2022 Publication History

Abstract

Stone engravings on Vietnamese steles are an invaluable resource for historians to study the life of the villagers in the past. Thanks to pictures taken of stampings of the steles, they can be investigated today in the form of digital images. Automatic keyword spotting is a promising means to access the textual content of the images, allowing to retrieve steles that contain a certain query term. In this paper, we present a complete pipeline for retrieving Chu Nom characters in Vietnamese steles that operates fully automatically on the original images, without the need for preprocessing, segmentation, or human annotation. It combines a self-calibration approach to character detection using deep convolutional neural networks with a graph-based approach to keyword spotting that compares templates of the search term with detected characters based on structural properties.

References

[1]
H. Bunke and G. Allermann. 1983. Inexact Graph Matching for Structural Pattern Recognition. Pattern Recognition Letters 1, 4 (1983), 245–253.
[2]
Andreas Fischer, Marcus Liwicki, and Rolf Ingold (Eds.). 2020. Handwritten Historical Document Analysis, Recognition, and Retrieval — State of the Art and Future Trends. World Scientific.
[3]
A. Fischer, K. Riesen, and H. Bunke. 2010. Graph Similarity Features for HMM-Based Handwriting Recognition in Historical Documents. In Proc. Int. Conf. on Frontiers in Handwriting Recognition. 253–258.
[4]
A. Fischer, C. Y. Suen, V. Frinken, K. Riesen, and H. Bunke. 2015. Approximation of graph edit distance based on Hausdorff matching. Pat. Rec. 48, 2 (2015), 331–343.
[5]
Thai V. Hoang, Salvatore Tabbone, and Ngoc-Yen Pham. 2009. Extraction of Nom Text Regions from Stele Images Using Area Voronoi Diagram. In 10th International Conference on Document Analysis and Recognition. 921–925.
[6]
Shu Liu, Lu Qi, Haifang Qin, Jianping Shi, and Jiaya Jia. 2018. Path aggregation network for instance segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition. 8759–8768.
[7]
Paul Maergner, Vinaychandran Pondenkandath, Michele Alberti, Marcus Liwicki, Kaspar Riesen, Rolf Ingold, and Andreas Fischer. 2019. Combining graph edit distance and triplet networks for offline signature verification. Pattern Recognition Letters 125 (2019), 527–533.
[8]
Kha Cong Nguyen, Cuong Tuan Nguyen, and Masaki Nakagawa. 2020. Nom document digitalization by deep convolution neural networks. Pattern Recognition Letters 133 (2020), 8–16.
[9]
Philippe Papin. 2003. Aperçu sur le programme “Publication de l’inventaire et du corpus complet des inscriptions sur stèles du Viêt-Nam”. Bulletin de l’École française d’Extrême-Orient 90, 1(2003), 465–472.
[10]
Joseph Redmon, Santosh Divvala, Ross Girshick, and Ali Farhadi. 2016. You only look once: Unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition. 779–788.
[11]
Pau Riba, Andreas Fischer, Josep Lladós, and Alicia Fornés. 2021. Learning graph edit distance by graph neural networks. Pattern Recognition 120(2021), 108132.
[12]
A. Sanfeliu and K. S. Fu. 1983. A distance measure between attributed relational graphs for pattern recognition. IEEE Trans. on Systems, Man, and Cybernetics 13, 3(1983), 353–363.
[13]
Anna Scius-Bertrand, Michael Jungo, Beat Wolf, Andreas Fischer, and Marc Bui. 2021. Annotation-Free Character Detection in Historical Vietnamese Stele Images. In Proc. 16th Int. Conf. on Document Analysis and Recognition (ICDAR). 432–447.
[14]
Anna Scius-Bertrand, Linda Studer, Andreas Fischer, and Marc Bui. 2022. Annotation-free keyword spotting in historical Vietnamese manuscripts using graph matching. In IAPR Joint International Workshops on Statistical Techniques in Pattern Recognition (SPR 2022) and Structural and Syntactic Pattern Recognition (SSPR 2022) : S+SSPR.
[15]
Anna Scius-Bertrand, Lars Voegtlin, Michele Alberti, Andreas Fischer, and Marc Bui. 2019. Layout analysis and text column segmentation for historical vietnamese steles. In Proceedings of the 5th International Workshop on Historical Document Imaging and Processing. 84–89.
[16]
Michael Stauffer, Andreas Fischer, and Kaspar Riesen. 2019. Graph-based Keyword Spotting. World Scientific.
[17]
Sebastian Sudholt and Gernot A Fink. 2016. Phocnet: A deep convolutional neural network for word spotting in handwritten documents. In 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR). IEEE, 277–282.
[18]
Chien-Yao Wang, Hong-Yuan Mark Liao, Yueh-Hua Wu, Ping-Yang Chen, Jun-Wei Hsieh, and I-Hau Yeh. 2020. CSPNet: A new backbone that can enhance learning capability of CNN. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops. 390–391.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
SoICT '22: Proceedings of the 11th International Symposium on Information and Communication Technology
December 2022
474 pages
ISBN:9781450397254
DOI:10.1145/3568562
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 December 2022

Check for updates

Author Tags

  1. Chu Nom
  2. Hausdorff edit distance
  3. Vietnamese stele images
  4. annotation-free
  5. historical documents
  6. keyword spotting

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

  • ERC Advanced Grant

Conference

SoICT 2022

Acceptance Rates

Overall Acceptance Rate 147 of 318 submissions, 46%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 203
    Total Downloads
  • Downloads (Last 12 months)127
  • Downloads (Last 6 weeks)33
Reflects downloads up to 10 Dec 2024

Other Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media