[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3103010.3121042acmconferencesArticle/Chapter ViewAbstractPublication PagesdocengConference Proceedingsconference-collections
short-paper

High-Performance Preprocessing of Architectural Drawings for Legend Metadata Extraction via OCR

Published: 31 August 2017 Publication History

Abstract

This paper describes the results of an investigation into methods of preprocessing architectural plots to enable them to be processed very quickly via OCR, detecting the region containing the relevant metadata legend and obtaining it in machine-readable form for e.g. automated folding and filenaming applications. We show how a processing pipeline adapted to this type of content can vastly decrease processing time, maintaining acceptable accuracy. Initial results show a reduction in total processing time from 2--3 minutes to around 15 seconds for most documents encountered, with the folding orientation being correctly detected in 78% of cases and the legend region being completely detected in 60% of cases, high enough for the use-case at hand.

References

[1]
Christian Ah-Soon and Karl Tombre. 1997. Variations on the Analysis of Architectural Drawings ICDAR 1997: Proceedings of the Fourth International Conference on Document Analysis and Recognition.
[2]
S. Ahmed, M. Liwicki, M. Weber, and A. Dengel. 2011. Improved Automatic Analysis of Architectural Floor Plans ICDAR 2011: Proceedings of the 11th International Conference on Document Analysis and Recognition.
[3]
L. A. Fletcher and R. Kasturi. 1988. A Robust Algorithm for Text String Separation from Mixed Text/Graphics Images. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 10, 6 (1988).
[4]
J. Gllavata, R. Ewerth, and B. Freisleben. 2004. Text Detection in Images Based on Unsupervised Classification of High-Frequency Wavelet Coefficients. In ICPR 2004: Proceedings of the 17th International Conference on Pattern Recognition.
[5]
M. Goebel, T. Hassan, E. Oro, and G. Orsi. 2013. ICDAR 2013 Table Competition. In ICDAR 2013: Proceedings of the 12th International Conference on Document Analysis and Recognition.
[6]
R. W. Lienhart and Frank Stuber. 1996. Automatic text recognition in digital videos. In Image and Video Processing IV: SPIE Proceedings 2666.
[7]
G. Nagy, S. Seth, and M. Viswanathan. 1992. A prototype document image analysis system for technical journals. Computer, Vol. 25, 7 (1992).

Index Terms

  1. High-Performance Preprocessing of Architectural Drawings for Legend Metadata Extraction via OCR

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      DocEng '17: Proceedings of the 2017 ACM Symposium on Document Engineering
      August 2017
      242 pages
      ISBN:9781450346894
      DOI:10.1145/3103010
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Sponsors

      In-Cooperation

      • SIGDOC: ACM Special Interest Group on Systems Documentation

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 31 August 2017

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. document analysis
      2. image processing
      3. ocr preprocessing

      Qualifiers

      • Short-paper

      Conference

      DocEng '17
      Sponsor:
      DocEng '17: ACM Symposium on Document Engineering 2017
      September 4 - 7, 2017
      Valletta, Malta

      Acceptance Rates

      DocEng '17 Paper Acceptance Rate 13 of 71 submissions, 18%;
      Overall Acceptance Rate 194 of 564 submissions, 34%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 106
        Total Downloads
      • Downloads (Last 12 months)4
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 03 Jan 2025

      Other Metrics

      Citations

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media