research-article

Table Detection Method Based on Faster-RCNN and Window Attention

Authors:

Han Chen,

Shengli Song,

Rijian SuAuthors Info & Claims

ICNCC '23: Proceedings of the 2023 12th International Conference on Networks, Communication and Computing

Pages 267 - 273

https://doi.org/10.1145/3638837.3638879

Published: 07 March 2024 Publication History

Get Access

Abstract

As an important carrier of information, tables possess the characteristics of high data storage density, conciseness, and intuitiveness, and are widely applied in offices and daily life. Due to the complexity of table structures and diverse presentation formats, the automated processing of a large number of image-based tables has always been a challenge in the field of document recognition. This algorithm addresses the task of table detection in table processing and proposes a table detection algorithm based on an improved window self-attention network for feature extraction of image-based tables. It utilizes a two-stage object detection algorithm, introduces local feature extraction blocks and backward feed-forward residual network blocks, and designs a feature pyramid network within the backbone to enhance the model's detection performance by improving its ability to learn document spatial layout features. The effectiveness of the proposed method is verified through experimental comparisons on publicly available datasets.

References

[1]

Schreiber S, Agne S, Wolf I,et al.DeepDeSRT: Deep Learning for Detection and Structure Recognition of Tables in Document Images[C]//2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR). IEEE Computer Society, 2017.

Crossref

Google Scholar

[2]

Tang B, Jiang J, Xu X,et al. Triangle Coordinate Diagram Localization for Academic Literature Based on Line Segment Detection in Cloud Computing[J].Springer, Cham, 2022.

Crossref

Google Scholar

[3]

Kavasidis I, Pino C, Palazzo S,et al. A Saliency-Based Convolutional Neural Network for Table and Chart Detection in Digitized Documents[C]//International Conference on Image Analysis and Processing.Springer, Cham, 2019.

Digital Library

Google Scholar

[4]

S. A. Siddiqui, I. A. Fateh, S. T. R. Rizvi, A. Dengel and S. Ahmed. DeepTabStR: Deep Learning based Table Structure Recognition[C]// 2019 International Conference on Document Analysis and Recognition (ICDAR), Sydney, NSW, Australia, 2019, pp. 1403-1409.

Crossref

Google Scholar

[5]

Fernandes J, Simsek M, Kantarci B,et al. TableDet: An end-to-end deep learning approach for table detection and table image classification in data sheet images[J].Neurocomputing, 2022(Jan.11):468.

Google Scholar

[6]

HUANG Y,YAN Q,LI Y, A YOLO-Based Table Detection Method[C]// 2019 International Conference on Document Analysis and Recognition (ICDAR), Sydney, NSW, Australia, 2019, pp. 813-818.

Crossref

Google Scholar

[7]

Fengchang Yu, Jiani Huang, Zhuoran Luo, An effective method for figures and tables detection in academic literature[J]. Information Processing and Management, 2023, 103286.

Digital Library

Google Scholar

[8]

Minghao Li, Lei Cui, Shaohan Huang, Furu Wei, Ming Zhou, Zhoujun Li. TablBank: A Benchmark Dataset for Table Detection and Recognition[J]. arXiv preprint arXiv:1903.01949, 2019.

Google Scholar

[9]

K. He, X. Zhang, S. Ren, and J. Sun. Deep Residual Learning for Image Recognition. in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016[C], pp. 770–778.

Google Scholar

[10]

Liu Z, Lin Y, Cao Y, Swin Transformer: Hierarchical Vision Transformer using Shifted Windows. 2021 IEEE/CVF International Conference on Computer Vision (ICCV), 10.1109/ICCV48922.2021.00986.

Crossref

Google Scholar

[11]

RAO Y, CHENG Y, XUE J, FPSiamRPN: feature pyramid siamese network with region proposal network for target tracking. IEEE Access, 2020, 8: 176158-176169.

Crossref

Google Scholar

[12]

Xuran Pan, Chunjiang Ge, Rui Lu, On the Integration of Self-Attention and Convolution. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2022

Google Scholar

[13]

M. Sandler, A. Howard, M. Zhu, A. Zhmoginov, and L.-C. Chen, Mobilenetv2: Inverted residuals and linear bottlenecks. in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2018, pp. 4510-4520.

Crossref

Google Scholar

[14]

Y. He, J. Liu, Z. Wang, L. Tong, T. Lan, and W. Zuo, CMT: Convolutional Neural Networks Meet Vision Transformers. arXiv Preprint arXiv:2107.06263, 2021.

Google Scholar

Index Terms

Table Detection Method Based on Faster-RCNN and Window Attention
1. Applied computing
  1. Document management and text processing
    1. Document capture
      1. Document analysis
2. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Object detection

Recommendations

NTable: A Dataset for Camera-Based Table Detection
Document Analysis and Recognition – ICDAR 2021
Abstract
Comparing with raw textual data, information in tabular format is more compact and concise, and easier for comparison, retrieval, and understanding. Furthermore, there are many demands to detect and extract tables from photos in the era of Mobile ...
Table detection in heterogeneous documents
DAS '10: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems

Detecting tables in document images is important since not only do tables contain important information, but also most of the layout analysis methods fail in the presence of tables in the document image. Existing approaches for table detection mainly ...
Junction-based table detection in camera-captured document images

In this paper, we present a method that locates tables and their cells in camera-captured document images. In order to deal with this problem in the presence of geometric and photometric distortions, we develop new junction detection and labeling ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

ICNCC '23: Proceedings of the 2023 12th International Conference on Networks, Communication and Computing

December 2023

310 pages

ISBN:9798400709265

DOI:10.1145/3638837

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 March 2024

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

ICNCC 2023

ICNCC 2023: 2023 the 12th International Conference on Networks, Communication and Computing

December 15 - 17, 2023

Osaka, Japan

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
17
Total Downloads

Downloads (Last 12 months)17
Downloads (Last 6 weeks)3

Reflects downloads up to 11 Dec 2024

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Abstract

References

Index Terms

Recommendations

NTable: A Dataset for Camera-Based Table Detection

Table detection in heterogeneous documents

Junction-based table detection in camera-captured document images

Comments

Information

Published In

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Login options

Full Access

View options

PDF

eReader

HTML Format

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations