default search action
27th ICPR 2024: Kolkata, India - Part XXXI
- Apostolos Antonacopoulos, Subhasis Chaudhuri, Rama Chellappa, Cheng-Lin Liu, Saumik Bhattacharya, Umapada Pal:
Pattern Recognition - 27th International Conference, ICPR 2024, Kolkata, India, December 1-5, 2024, Proceedings, Part XXXI. Lecture Notes in Computer Science 15331, Springer 2025, ISBN 978-3-031-78118-6 - Xiangyu Xie, Yuxuan Zhou, Liangcai Gao:
ODTr: Transformer Integrating OCR Auxiliary Map and Image Depth Information for Document Image Unwarping. 1-12 - Zhiwang Han, Nurbiya Yadikar, Xuebin Xu, Alimjan Aysa, Kurban Ubul:
Oracle Character Recognition Based on Attention Enhancement and Multi-level Feature Fusion. 13-28 - Xinyue Zhou, Guanting Li, Nanfeng Jiang, Da-Han Wang, Xu-Yao Zhang, Shunzhi Zhu:
DocHFormer: Document Image Dewarping via Harmonized Modeling of Hierarchical Priors. 29-44 - Fan Yang, Xinyue Zhou, Nanfeng Jiang, Da-Han Wang, Xu-Yao Zhang, Guantin Li, Wang Man, Yun Wu:
Document Image Shadow Removal via Frequency Information-Oriented Network. 45-60 - Jiseok Lee, Masaki Akiba, Brian Kenji Iwana:
Improving Online Handwriting Recognition with Transfer Learning Using Out-of-Domain and Different-Dimensional Sources. 61-75 - Zening Lin, Jiapeng Wang, Wenhui Liao, Weicong Dai, Longfei Xiong, Lianwen Jin:
ROISER: Towards Real World Semantic Entity Recognition from Visually-Rich Documents. 76-90 - Runbo Zhao, Jun Jie Ou Yang, Chen Gao, Xugong Qin, Gangyan Zeng, Xiaoxu Hu, Peng Zhang:
Perception-Enhanced Generative Transformer for Key Information Extraction from Documents. 91-106 - Md. Maruf Hasan, Shawly Ahsan, Mohammed Moshiul Hoque, M. Ali Akber Dewan:
MuLAD: Multimodal Aggression Detection from Social Media Memes Exploiting Visual and Textual Features. 107-123 - Wenbo Guan, Xiaoqian Li, Jiyu Lu, Jun Zhou:
[inline-graphic not available: see fulltext] : A Voting-Based Paradigm for Enhancing Retrieval Augmented Generation. 124-138 - Haocheng Lan, Jie Ou, Zhaokun Wang, Wenhong Tian:
Improving Chinese Emotion Classification Based on Bilingual Feature Fusion. 139-153 - Mikhail Kulyabin, Gleb Sokolov, Aleksandr Galaida, Andreas K. Maier, Tomás Arias-Vergara:
SNOBERT: A Benchmark for Clinical Notes Entity Linking in the SNOMED CT Clinical Terminology. 154-163 - P. P. Afeefa, Raju Hazari, Pranesh Das:
Enhancing Automated Short Answer Grading with Prompt-Driven Augmentation and Prompt Adaptive Oversampling. 164-182 - Zi-Hao Lin, Shun-Xin Xiao, Zirong Chen, Jian-Min Li, Da-Han Wang, Xu-Yao Zhang:
SANS: Spatial-Aware Neural Solver for Plane Geometry Problem. 183-196 - Kirtilekha Bhesra, Akshay Agarwal:
A Multi-modal Framework to Counter Hate Speeches. 197-207 - Zhaoxi Liu, Gang Zhou, Runlin He, Mengnan Zhang, Zhenhong Jia, Jing Ma:
TBIA-DBNet: A Two-Branch Image-Adaptive DBNet for Scene Text Detection in Real-World Foggy Scenes. 208-221 - Souhaila Djaffal, Yasmina Benmabrouk, Chawki Djeddi, Moisés Díaz Cabrera:
Breaking Boundaries: Enhancing Script Identification Using a Learnable MULLER Resizer. 222-236 - Shuo Xu, Zeming Zhuang, Mingjun Li, Feng Su:
Arbitrary-Shaped Scene Text Recognition with Deformable Ensemble Attention. 237-253 - Xin Che, Mohammad Akbari, Shaoxin Li, David (Ming Xuan) Yue, Yong Zhang, Lingyang Chu:
Primary Key Free Watermarking for Numerical Tabular Datasets in Machine Learning. 254-270 - Kecia Gomes de Moura, Rafael Menelau Oliveira E. Cruz, Robert Sabourin:
Offline Handwritten Signature Verification Using a Stream-Based Approach. 271-286 - Chao-Qun Lin, Da-Han Wang, Yanfei Su, De-Wu Ge, Xu-Yao Zhang:
OCR4HSV: A Multi-task Learning Approach for Handwritten Signature Verification. 287-302 - Song-Liang Pan, Da-Han Wang, Nanfeng Jiang, Xu-Yao Zhang, Shunzhi Zhu:
Learning Explicit Radical Representations for Zero-Shot Chinese Character Recognition. 303-317 - Mohamed Hjaiej, Imen Ben Cheikh, Heithem Abbes:
Deep Learning for Arabic Word Classification: Leveraging Transfer Learning and Grad-CAM for Morphological Analysis. 318-330 - Sunil Kumar Kopparapu, Ashish Panda:
A Cost Minimization Approach to Fix the Vocabulary Size in a Tokenizer for an End-to-End ASR System. 331-342 - Fengrun Zhang, Xiang Xie, Kai Guo:
ASD-Diffusion: Anomalous Sound Detection with Diffusion Models. 343-355 - Ravindrakumar M. Purohit, Arushi Srivastava, Hemant A. Patil:
FCHiFi-GAN: Aggrandizing Fast Convergence with Batchwise Normalization. 356-372 - Peishan Li, Yonghong Zhang, Junfei Wang, Guangyi Ma, Ziwei Yuan:
Adaptive Enhanced Reversible Flow Model for Remote Sensing Image Super Resolution. 373-388 - Qian Cao, Dongdong Zhang, Xiaolei Zhang:
Saliency-Based Neural Representation for Videos. 389-403 - Xinyuan Cheng, Dongdong Zhang, Xiaolei Zhang:
HNRC: Lightweight Image Compression with Hybrid Neural Representation. 404-418
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.