Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- short-paperMay 2024
Semantic interlinking of Immigration Data using LLMs for Knowledge Graph Construction
WWW '24: Companion Proceedings of the ACM Web Conference 2024Pages 605–608https://doi.org/10.1145/3589335.3651557The challenge of managing immigration data is exacerbated by its reliance on paper-based, evidence-driven records maintained by legal professionals, creating obstacles for efficient processing and analysis due to inherent trust issues with AI-based ...
- research-articleOctober 2023
TC-OCR: TableCraft OCR for Efficient Detection & Recognition of Table Structure & Content
- Avinash Anand,
- Raj Jaiswal,
- Pijush Bhuyan,
- Mohit Gupta,
- Siddhesh Bangar,
- Md. Modassir Imam,
- Rajiv Ratn Shah,
- Shin'ichi Satoh
MMIR '23: Proceedings of the 1st International Workshop on Deep Multimodal Learning for Information RetrievalPages 11–18https://doi.org/10.1145/3606040.3617444The automatic recognition of tabular data in document images presents a significant challenge due to the diverse range of table styles and complex structures. Tables offer valuable content representation, enhancing the predictive capabilities of various ...
- panelMarch 2023
SketchRec 2023: 3rd Workshop on Sketch Recognition
IUI '23 Companion: Companion Proceedings of the 28th International Conference on Intelligent User InterfacesPage 1https://doi.org/10.1145/3581754.3584184Sketch recognition is the interpretation of hand-drawn diagrams, and seeks to understand the users’ intent while allowing them to draw unconstrained diagrams. Research in sketch recognition has been on-going for approximately half a century, and has ...
- research-articleDecember 2022
Fast and Accurate Deep Learning Model for Stamps Detection for Embedded Devices
Pattern Recognition and Image Analysis (SPPRIA), Volume 32, Issue 4Pages 772–779https://doi.org/10.1134/S1054661822040046AbstractThe search for stamps on images is necessary to verify the authenticity of a document and extract valuable textual information contained in them. Despite the vast number of methods for detecting stamps, most of them are not universal and are ...
- letterJanuary 2021
Corpus processing service: A Knowledge Graph platform to perform deep data exploration on corpora
AbstractKnowledge Graphs have been fast emerging as the de facto standard to model and explore knowledge in weakly structured data. Large corpora of documents constitute a source of weakly structured data of particular interest for both the academic and ...
-
- research-articleApril 2020Honorable Mention
Textlets: Supporting Constraints and Consistency in Text Documents
CHI '20: Proceedings of the 2020 CHI Conference on Human Factors in Computing SystemsPages 1–13https://doi.org/10.1145/3313831.3376804Writing technical documents frequently requires following constraints and consistently using domain-specific terms. We interviewed 12 legal professionals and found that they all use a standard word processor, but must rely on their memory to manage ...
- short-paperAugust 2018
Understanding Documents with Hyperknowledge Specifications
DocEng '18: Proceedings of the ACM Symposium on Document Engineering 2018Article No.: 41, Pages 1–4https://doi.org/10.1145/3209280.3229118Finding concepts considering their meaning and semantic relations in a document corpus is an important and challenging task. In this paper, we present our contributions on how to understand unstructured data present in one or multiple documents. ...
- short-paperAugust 2018
Annotation Data Management with JeDIS
DocEng '18: Proceedings of the ACM Symposium on Document Engineering 2018Article No.: 42, Pages 1–4https://doi.org/10.1145/3209280.3229102This paper introduces the Jena Document Information System (JeDIS). The focus lies on its capability to partition annotation graphs into modules. Annotation modules are defined in terms of types from the annotation schema. Modules allow easy ...
- research-articleFebruary 2017
Lightweight Multilingual Entity Extraction and Linking
WSDM '17: Proceedings of the Tenth ACM International Conference on Web Search and Data MiningPages 365–374https://doi.org/10.1145/3018661.3018724Text analytics systems often rely heavily on detecting and linking entity mentions in documents to knowledge bases for downstream applications such as sentiment analysis, question answering and recommender systems. A major challenge for this task is to ...
- articleJanuary 2017
An algorithm for detection and phase estimation of protective elements periodic lattice on document image
Pattern Recognition and Image Analysis (SPPRIA), Volume 27, Issue 1Pages 53–65https://doi.org/10.1134/S1054661817010023Various periodic security elements, such as holograms, watermarks, and guilloches, are applied to documents in order to protect against counterfeiting. These elements can be detected and used to automatically check the authenticity of a document and to ...
- invited-talkSeptember 2015
Documents as Data, Data as Documents: What we learned about Semi-Structured Information for our Open World of Cloud & Devices
DocEng '15: Proceedings of the 2015 ACM Symposium on Document EngineeringPage 1https://doi.org/10.1145/2682571.2797070Many of us always believed in a unique vision unifying documents and data through semantically-rich semi-structured information. This vision is even more critical today in our open interconnected world of Clouds and Devices.
The last 20 years represents ...
- research-articleSeptember 2015
Towards Mobile OCR: How to Take a Good Picture of a Document Without Sight
DocEng '15: Proceedings of the 2015 ACM Symposium on Document EngineeringPages 75–84https://doi.org/10.1145/2682571.2797066The advent of mobile OCR (optical character recognition) applications on regular smartphones holds great promise for enabling blind people to access printed information. Unfortunately, these systems suffer from a problem: in order for OCR output to be ...
- research-articleSeptember 2015
An Approach to Convert NCL Applications into Stereoscopic 3D
DocEng '15: Proceedings of the 2015 ACM Symposium on Document EngineeringPages 177–186https://doi.org/10.1145/2682571.2797064This paper presents and discusses the internal operation of NCLSC (NCL Stereo Converter): a tool to convert a 2D interactive multimedia application annotated with depth information to a stereoscopic-multimedia application. Stereoscopic-multimedia ...
- research-articleAugust 2013
Multilingual OCR research and applications: an overview
MOCR '13: Proceedings of the 4th International Workshop on Multilingual OCRArticle No.: 1, Pages 1–8https://doi.org/10.1145/2505377.2509977This paper offers an overview of the current approaches to research in the field of off-line multilingual OCR. Typically, off-line OCR systems are designed for a particular script or language. However, the ideal approach to multilingual OCR would likely ...
- research-articleAugust 2013
A robust table registration method for batch table OCR processing
MOCR '13: Proceedings of the 4th International Workshop on Multilingual OCRArticle No.: 12, Pages 1–5https://doi.org/10.1145/2505377.2505383A robust table registration method is proposed in this paper for a better understanding on structured information from scanned table images. Scanned images can be heavily degraded because of scanning effects, binarization or purely document itself. For ...
- research-articleMay 2013
RTS - an integrated analytic solution for managing regulation changes and their impact on business compliance
- Davide Pasetto,
- Hubertus Franke,
- Weihong Qian,
- Zhili Guo,
- Honglei Guo,
- Dongxu Duan,
- Yuan Ni,
- Yingxin Pan,
- Shenghua Bao,
- Feng Cao,
- Zhong Su
CF '13: Proceedings of the ACM International Conference on Computing FrontiersArticle No.: 24, Pages 1–8https://doi.org/10.1145/2482767.2482798Governance, Risk Management and Compliance are key success factors for corporations. Every company worldwide must ensure a proper compliance level with current and future laws and regulations, but managing the dynamic nature of the regulatory ...
- short-paperSeptember 2012
A software tool that helps teachers in handling, processing and understanding the results of massive exams
BCI '12: Proceedings of the Fifth Balkan Conference in InformaticsPages 259–262https://doi.org/10.1145/2371316.2371370During the last decade, at the University of Belgrade, School of Electrical Engineering, various tools have been developed and used for automation of preparation, grading and results processing of programming exams. Those exams consist of multiple-...
- ArticleSeptember 2012
Improving Requirements Quality in Digital Libraries: The Case of Scientific Proceedings
QUATIC '12: Proceedings of the 2012 Eighth International Conference on the Quality of Information and Communications TechnologyPages 211–216https://doi.org/10.1109/QUATIC.2012.82Proceedings of technical events, postgraduate theses, and technical reports in many different areas of knowledge witness the history of the development of that area; The experience gained in the development of digital libraries may be used in the ...
- ArticleJune 2012
Arabic bank check analysis and zone extraction
ICIAR'12: Proceedings of the 9th international conference on Image Analysis and Recognition - Volume Part IPages 141–148https://doi.org/10.1007/978-3-642-31295-3_17Bank check processing is an important application of document analysis as checks are one of the most widespread documents. In this paper we propose an efficient and effective top-down Arabic check analysis and zone extraction technique for courtesy ...
- ArticleMarch 2012
A Strategy for Automatically Extracting References from PDF Documents
DAS '12: Proceedings of the 2012 10th IAPR International Workshop on Document Analysis SystemsPages 435–439https://doi.org/10.1109/DAS.2012.12Every day the number of citations an author receives is becoming more important than the size of his list of publications. The automatic extraction of bibliographic references in scientific articles is still a difficult problem in Document Engineering, ...