default search action
DATeCH 2014: Madrid, Spain
- Apostolos Antonacopoulos, Klaus U. Schulz:
Digital Access to Textual Cultural Heritage 2014, DATeCH 2014, Madrid, Spain, May 19-20, 2014. ACM 2014, ISBN 978-1-4503-2588-2
Document analysis and OCR
- David Hebert, Thomas Palfray, Stéphane Nicolas, Pierrick Tranouez, Thierry Paquet:
Automatic article extraction in old newspapers digitized collections. 3-8 - Christian Clausner, Stefan Pletschacher, Apostolos Antonacopoulos:
Document representation refinement for precise region description. 9-13 - Sajid Saleem, Fabian Hollaus, Robert Sablatnig:
Recognition of degraded ancient characters based on dense SIFT. 15-20
Linguistic processing and encoding
- Florian Fink, Christoph Ringlstetter, Klaus U. Schulz:
Automated assignment of topics to OCRed historical texts. 23-28 - Petar Mitankin, Stefan Gerdjikov, Stoyan Mihov:
An approach to unsupervised historical text normalisation. 29-34 - Arianna Ciula, Øyvind Eide:
Reflections on cultural heritage and digital humanities: modelling in practice and theory. 35-41
Postcorrection
- John Evershed, Kent Fitch:
Correcting noisy OCR: context beats confusion. 45-51 - Günter Mühlberger, Johannes Zelger, David Sagmeister:
User-driven correction of OCR errors: combing crowdsourcing and information retrieval technology. 53-56 - Thorsten Vobl, Annette Gotscharek, Ulrich Reffle, Christoph Ringlstetter, Klaus U. Schulz:
PoCoTo - an open source system for efficient interactive postcorrection of OCRed historical texts. 57-61
Best practices and experiences
- Truyen Van Phan, Masaki Nakagawa:
Construction of a text digitization system for Nom historical documents. 65-70 - Uwe Springmann, Dietmar Najock, Hermann Morgenroth, Helmut Schmid, Annette Gotscharek, Florian Fink:
OCR of historical printings of Latin texts: problems, prospects, progress. 71-75 - Christine Roughan:
Digital editions and diplomatic diagrams. 77-82 - Gregory R. Crane, Bridget Almas, Alison Babeu, Lisa Cerrato, Anna Krohn, Frederik Baumgardt, Monica Berti, Greta Franzini, Simona Stoyanova:
Cataloging for a billion word library of Greek and Latin. 83-88
Enrichment
- Max Hadersbeck, Alois Pichler, Florian Fink, Øyvind Liland Gjesdal:
Wittgenstein's Nachlass: WiTTFind and Wittgenstein advanced search tools (WAST). 91-96 - Beatrice Alex, John Burns:
Estimating and rating the quality of optically character recognised text. 97-102 - Alicia Fornés, Josep Lladós, Joan Mas, Joana Maria Pujades, Anna Cabré:
A bimodal crowdsourcing platform for demographic historical manuscripts. 103-108
Posters
- Joan-Andreu Sánchez, Vicente Bosch, Verónica Romero, Katrien Depuydt, Jesse de Does:
Handwritten text recognition for historical documents in the transcriptorium project. 111-117 - Paula Estrella, Pablo Paliza:
OCR correction of documents generated during Argentina's national reorganization process. 119-123 - Vicente Bosch, Isabel Bordes-Cabrera, Paloma Cuenca Muñoz, Celio Hernández-Tornero, Luis A. Leiva, Moisés Pastor, Verónica Romero, Alejandro H. Toselli, Enrique Vidal:
Computer-assisted transcription of a historical botanical specimen book: organization and process overview. 125-130 - Basilis Gatos, Nikolaos Stamatopoulos, Georgios Louloudis, Stavros J. Perantonis:
H-DocPro: a document image processing platform for historical documents. 131-136 - Dimitrios Kokkinakis, Mats Malm, Jenny Bergenmar, Ann Ighe:
Semantics in storytelling in Swedish fiction. 137-142 - Adam Dudczak, Aleksandra Nowak, Tomasz Parkola:
Creation of custom recognition profiles for historical documents. 143-146 - A. Papandreou, Basilios Gatos, Georgios Louloudis:
An adaptive zoning technique for efficient word retrieval using dynamic time warping. 147-152 - Roberto Therón, Carlos Seguín, Laura de la Cruz, María Vaquero:
Highly interactive and natural user interfaces: enabling visual analysis in historical lexicography. 153-158 - Martin Reynaert:
On OCR ground truths and OCR post-correction gold standards, tools and formats. 159-166 - Bruce Robertson, Christoph Dalitz, Fabian Schmitt:
Automated page layout simplification of Patrologia Graeca. 167-172 - David Hebert, Thomas Palfray, Stéphane Nicolas, Pierrick Tranouez, Thierry Paquet:
PIVAJ: displaying and augmenting digitized newspapers on the web experimental feedback from the "Journal de Rouen" collection. 173-178 - Rafael C. Carrasco:
An open-source OCR evaluation tool. 179-184 - Iuliu Vasile Konya, Stefan Eickeler:
Logical structure recognition for heterogeneous periodical collections. 185-192 - Karel Kucera, Martin Stluka:
Data processing and lemmatization in digitized 19th-century Czech texts. 193-196 - Hervé Déjean:
Using ancestral layout models for document digitization. 197-202
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.