default search action
16th ICDAR 2021: Lausanne, Switzerland - Part II
- Josep Lladós, Daniel Lopresti, Seiichi Uchida:
16th International Conference on Document Analysis and Recognition, ICDAR 2021, Lausanne, Switzerland, September 5-10, 2021, Proceedings, Part II. Lecture Notes in Computer Science 12822, Springer 2021, ISBN 978-3-030-86330-2
Document Analysis for Literature Search
- Rongyu Cao, Hongwei Li, Ganbin Zhou, Ping Luo:
Towards Document Panoptic Segmentation with Pinpoint Accuracy: Method and Evaluation. 3-18 - Ayush Kumar Shah, Abhisek Dey, Richard Zanibbi:
A Math Formula Extraction and Evaluation Framework for PDF Documents. 19-34 - Laura E. Brandt, William T. Freeman:
Toward Automatic Interpretation of 3D Plots. 35-50
Document Summarization and Translation
- Marta Esther Vicente, Robiert Sepúlveda-Torres, Cristina Barros, Estela Saquete, Elena Lloret:
Can Text Summarization Enhance the Headline Stance Detection Task? Benefits and Drawbacks. 53-67 - Justin Wood, Wei Wang, Corey W. Arnold:
The Biased Coin Flip Process for Nonparametric Topic Modeling. 68-83 - Sayali Kulkarni, Sheide Chammas, Wan Zhu, Fei Sha, Eugene Ie:
CoMSum and SIBERT: A Dataset and Neural Model for Query-Based Multi-document Summarization. 84-98 - Tonghua Su, Shuchen Liu, Shengjie Zhou:
RTNet: An End-to-End Method for Handwritten Text Image Translation. 99-113
Multimedia Document Analysis
- Ziyi Zhu, Liangcai Gao, Yibo Li, Yilun Huang, Lin Du, Ning Lu, Xianfeng Wang:
NTable: A Dataset for Camera-Based Table Detection. 117-129 - Tianqi Ji, Jun Li, Jianhua Xu:
Label Selection Algorithm Based on Boolean Interpolative Decomposition with Sequential Backward Selection for Multi-label Classification. 130-144 - Quang Huy Ung, Cuong Tuan Nguyen, Hung Tuan Nguyen, Masaki Nakagawa:
GSSF: A Generative Sequence Similarity Function Based on a Seq2Seq Model for Clustering Online Handwritten Mathematical Answers. 145-159 - Vaibhavi Gupta, Vinay Detani, Vivek Khokar, Chiranjoy Chattopadhyay:
C2VNet: A Deep Learning Framework Towards Comic Strip to Audio-Visual Scene Synthesis. 160-175 - Jie He, Xingjiao Wu, Wenxin Hu, Jing Yang:
LSTMVAEF: Vivid Layout via LSTM-Based Variational Autoencoder Framework. 176-189
Mobile Text Recognition
- Andrii Grygoriev, Illya Degtyarenko, Ivan Deriuga, Serhii Polotskyi, Volodymyr Melnyk, Dmytro Zakharchuk, Olga Radyvonenko:
HCRNN: A Novel Architecture for Fast Online Handwritten Stroke Classification. 193-208 - Daniil Matalov, Elena Limonova, Natalya Skoryukina, Vladimir V. Arlazarov:
RFDoc: Memory Efficient Local Descriptors for ID Documents Localization and Classification. 209-224 - Haibo Qin, Chun Yang, Xiaobin Zhu, Xu-Cheng Yin:
Dynamic Receptive Field Adaptation for Attention-Based Text Recognition. 225-239 - Ryota Yoshihashi, Tomohiro Tanaka, Kenji Doi, Takumi Fujino, Naoaki Yamashita:
Context-Free TextSpotter for Real-Time and Mobile End-to-End Text Detection and Recognition. 240-257 - Yulia S. Chernyshova, Ekaterina Emelianova, Alexander Sheshkus, Vladimir V. Arlazarov:
MIDV-LAIT: A Challenging Dataset for Recognition of IDs with Perso-Arabic, Thai, and Indian Scripts. 258-272 - Konstantin B. Bulatov, Vladimir V. Arlazarov:
Determining Optimal Frame Processing Strategies for Real-Time Document Recognition Systems. 273-288
Document Analysis for Social Good
- Eugen Rusakov, Turna Somel, Gerfrid G. W. Müller, Gernot A. Fink:
Embedded Attributes for Cuneiform Sign Spotting. 291-305 - Adrià Molina, Pau Riba, Lluís Gómez, Oriol Ramos Terrades, Josep Lladós:
Date Estimation in the Wild of Scanned Historical Photos: An Image Retrieval Approach. 306-320 - Muhammad Osama Zeeshan, Imran Siddiqi, Momina Moetesum:
Two-Step Fine-Tuned Convolutional Neural Networks for Multi-label Classification of Children's Drawings. 321-334 - Tamal Chowdhury, Palaiahnakote Shivakumara, Umapada Pal, Tong Lu, Ramachandra Raghavendra, Sukalpa Chanda:
DCINN: Deformable Convolution and Inception Based Neural Network for Tattoo Text Detection Through Skin Region. 335-350 - Fatma Najar, Nizar Bouguila:
Sparse Document Analysis Using Beta-Liouville Naive Bayes with Vocabulary Knowledge. 351-363 - Sk Md Obaidullah, Mridul Ghosh, Himadri Mukherjee, Kaushik Roy, Umapada Pal:
Automatic Signature-Based Writer Identification in Mixed-Script Scenarios. 364-377
Indexing and Retrieval of Documents
- Pau Riba, Adrià Molina, Lluís Gómez, Oriol Ramos Terrades, Josep Lladós:
Learning to Rank Words: Optimizing Ranking Metrics for Word Spotting. 381-395 - Trung Tan Ngo, Hung Tuan Nguyen, Masaki Nakagawa:
A-VLAD: An End-to-End Attention-Based Neural Network for Writer Identification in Historical Documents. 396-409 - Nhu-Van Nguyen, Christophe Rigaud, Arnaud Revel, Jean-Christophe Burie:
Manga-MMTL: Multimodal Multitask Transfer Learning for Manga Character Analysis. 410-425 - Enrique Vidal, Alejandro H. Toselli:
Probabilistic Indexing and Search for Hyphenated Words. 426-442
Physical and Logical Layout Analysis
- Sieben Bocklandt, Gust Verbruggen, Thomas Winters:
SandSlide: Automatic Slideshow Normalization. 445-461 - Alejandro H. Toselli, Si Wu, David A. Smith:
Digital Editions as Distant Supervision for Layout Analysis of Printed Books. 462-476 - Prema Satish Sharan, Sowmya Aitha, Amandeep Kumar, Abhishek Trivedi, Aaron Augustine, Ravi Kiran Sarvadevabhatla:
Palmira: A Deep Deformable Network for Instance Segmentation of Dense and Uneven Layouts in Handwritten Manuscripts. 477-491 - Oldrich Kodym, Michal Hradis:
Page Layout Analysis System for Unconstrained Historic Documents. 492-506 - José Ramón Prieto, Enrique Vidal:
Improved Graph Methods for Table Layout Understanding. 507-522 - Berat Kurar Barakat, Ahmad Droby, Raid Saabni, Jihad El-Sana:
Unsupervised Learning of Text Line Segmentation by Differentiating Coarse Patterns. 523-537
Recognition of Tables and Formulas
- Yibo Li, Yilun Huang, Ziyi Zhu, Lemeng Pan, Yongshuai Huang, Lin Du, Zhi Tang, Liangcai Gao:
Rethinking Table Structure Recognition Using Sequence Labeling Methods. 541-553 - Harsh Desai, Pratik Kayal, Mayank Singh:
TabLeX: A Benchmark Dataset for Structure and Content Information Extraction from Scientific Tables. 554-569 - Wenqi Zhao, Liangcai Gao, Zuoyu Yan, Shuai Peng, Lin Du, Ziyin Zhang:
Handwritten Mathematical Expression Recognition with Bidirectionally Trained Transformer. 570-584 - Umar Khan, Sohaib Zahid, Muhammad Asad Ali, Adnan Ul-Hasan, Faisal Shafait:
TabAug: Data Driven Augmentation for Enhanced Table Structure Recognition. 585-601 - Haisong Ding, Kai Chen, Qiang Huo:
An Encoder-Decoder Approach to Handwritten Mathematical Expression Recognition with Multi-head Attention and Stacked Decoder. 602-616 - Cuong Tuan Nguyen, Thanh-Nghia Truong, Hung Tuan Nguyen, Masaki Nakagawa:
Global Context for Improving Recognition of Online Handwritten Mathematical Expressions. 617-631 - Koji Ichikawa:
Image-Based Relation Classification Approach for Table Structure Recognition. 632-647 - Shuai Peng, Liangcai Gao, Ke Yuan, Zhi Tang:
Image to LaTeX with Graph Neural Network for Mathematical Formula Recognition. 648-663
NLP for Document Understanding
- Badal Agrawal, Mohit Mishra, Varun Parashar:
A Novel Method for Automated Suggestion of Similar Software Incidents Using 2-Stage Filtering: Findings on Primary Data. 667-682 - Lianxi Wang, Xiaotian Lin, Nankai Lin:
Research on Pseudo-label Technology for Multi-label News Classification. 683-698 - Ahmed Hamdi, Elodie Carel, Aurélie Joseph, Mickaël Coustaty, Antoine Doucet:
Information Extraction from Invoices. 699-714 - Apoorva Singh, Sriparna Saha:
Are You Really Complaining? A Multi-task Framework for Complaint Identification, Emotion, and Sentiment Classification. 715-731 - Rafal Powalski, Lukasz Borchmann, Dawid Jurkiewicz, Tomasz Dwojak, Michal Pietruszka, Gabriela Palka:
Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer. 732-747 - Luisa März, Stefan Schweter, Nina Pörner, Benjamin Roth, Hinrich Schütze:
Data Centric Domain Adaptation for Historical Text with OCR Errors. 748-761 - Nafaa Haffar, Rami Ayadi, Emna Hkiri, Mounir Zrigui:
Temporal Ordering of Events via Deep Neural Networks. 762-777 - Rubèn Tito, Dimosthenis Karatzas, Ernest Valveny:
Document Collection Visual Question Answering. 778-792 - Jirí Martínek, Pavel Král, Ladislav Lenc:
Dialogue Act Recognition Using Visual Information. 793-807 - Oliver Tüselmann, Fabian Wolf, Gernot A. Fink:
Are End-to-End Systems Really Necessary for NER on Handwritten Document Images? 808-822 - Harsh Kohli:
Training Bi-Encoders for Word Sense Disambiguation. 823-837 - Freddy C. Chua, Nigel P. Duffy:
DeepCPCFG: Deep Learning and Context Free Grammars for End-to-End Information Extraction. 838-853 - Djedjiga Belhadj, Yolande Belaïd, Abdel Belaïd:
Consideration of the Word's Neighborhood in GATs for Information Extraction in Semi-structured Documents. 854-869
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.