[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1572741.1572763acmconferencesArticle/Chapter ViewAbstractPublication PagesegConference Proceedingsconference-collections
research-article

Tools for the efficient generation of hand-drawn corpora based on context-free grammars

Published: 01 August 2009 Publication History

Abstract

In sketch recognition systems, ground-truth data sets serve to both train and test recognition algorithms. Unfortunately, generating data sets that are sufficiently large and varied is frequently a costly and time-consuming endeavour. In this paper, we present a novel technique for creating a large and varied ground-truthed corpus for hand drawn math recognition. Candidate math expressions for the corpus are generated via random walks through a context-free grammar, the expressions are transcribed by human writers, and an algorithm automatically generates ground-truth data for individual symbols and inter-symbol relationships within the math expressions. While the techniques we develop in this paper are illustrated through the creation of a ground-truthed corpus of mathematical expressions, they are applicable to any sketching domain that can be described by a formal grammar.

References

[1]
{BA69} Blackwell F. W., Anderson R. H.: An on-line symbolic mathematics system using hand-printed two-dimensional notation. In Proceedings of the 1969 24th national conference (New York, NY, USA, 1969), ACM, pp. 551--557.
[2]
{BCZ02} Blostein D., Cordy J. R., Zanibbi R.: Applying compiler techniques to diagram recognition. In ICPR '02: Proceedings of the 16th International Conference on Pattern Recognition (ICPR'02) Volume 3 (Washington, DC, USA, 2002), IEEE Computer Society, pp. 127--130.
[3]
{BSB08} Beusekom J. v., Shafait F., Breuel T. M.: Automated ocr ground truth generation. In Document Analysis Systems, 2008. DAS '08. The Eighth IAPR International Workshop on (Sept. 2008), pp. 111--117.
[4]
{HBAT07} Heroux P., Barbu E., Adam S., Trupin E.: Automatic ground-truth generation for document image analysis and understanding. In Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on (Sept. 2007), vol. 1, pp. 476--480.
[5]
{JZ04} Jr. J. J. L., Zeleznik R. C.: Mathpad2: a system for the creation and exploration of mathematical sketches. ACM Transactions on Graphics (Proceedings of SIGGRAPH 2004) 23, 3 (2004), 432--440.
[6]
{KBNJ06} Kumar A., Balasubramanian A., Namboodiri A., Jawahar C.: Model-Based Annotation of Online Handwritten Datasets. In Tenth International Workshop on Frontiers in Handwriting Recognition (Oct. 2006), Guy Lorette, (Ed.), Université de Rennes 1, Suvisoft.
[7]
{LaV06} LaViola Jr. J. J.: An initial evaluation of a pen-based tool for creating dynamic mathematical illustrations. In Third Eurographics Workshop on Sketch-Based Interfaces and Modeling (SBIM) (New York, NY, USA, 2006), ACM, pp. 157--164.
[8]
{LC82} Levy H. M., Clark D. W.: On the use of benchmarks for measuring system performance. SIGARCH Comput. Archit. News 10, 6 (1982), 5--8.
[9]
{LLM*08a} Labahn G., Lank E., MacLean S., Marzouk M., Tausky D.: Mathbrush: A system for doing math on pen-based devices. The Eighth IAPR Workshop on Document Analysis Systems (DAS) (Sep 16--19 2008).
[10]
{LLM*08b} Labahn G., Lank E., Marzouk M., Bunt A., MacLean S., Tausky D.: Mathbrush: A case study for interactive pen-based mathematics. Fifth Eurographics Workshop on Sketch-Based Interfaces and Modeling (SBIM) (June 11--13 2008).
[11]
{Mac09} MacLean S.: Parsing handwritten mathematics. Master's thesis, David R. Cheriton School of Computer Science, University of Waterloo, 2009.
[12]
{OP00} Okun O., Pietikainen M.: Automatic ground-truth generation for skew-tolerance evaluation of document layout analysis methods. In Pattern Recognition, 2000. Proceedings. 15th International Conference on (2000), vol. 4, pp. 376--379 vol. 4.
[13]
{SNA99} Smithies S., Novins K., Arvo J.: A handwriting-based equation editor. In Graphics Interface (1999), pp. 84--91.

Cited By

View all
  • (2016)A Rapid Prototyping Approach to Synthetic Data Generation for Improved 2D Gesture RecognitionProceedings of the 29th Annual Symposium on User Interface Software and Technology10.1145/2984511.2984525(873-885)Online publication date: 16-Oct-2016
  • (2013)A new approach for recognizing handwritten mathematics using relational grammars and fuzzy setsInternational Journal on Document Analysis and Recognition10.1007/s10032-012-0184-x16:2(139-163)Online publication date: 1-Jun-2013
  • (2012)Automated labeling of ink stroke dataProceedings of the International Symposium on Sketch-Based Interfaces and Modeling10.5555/2331067.2331078(67-75)Online publication date: 4-Jun-2012
  • Show More Cited By

Index Terms

  1. Tools for the efficient generation of hand-drawn corpora based on context-free grammars

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SBIM '09: Proceedings of the 6th Eurographics Symposium on Sketch-Based Interfaces and Modeling
    August 2009
    168 pages
    ISBN:9781605586021
    DOI:10.1145/1572741
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 August 2009

    Permissions

    Request permissions for this article.

    Check for updates

    Qualifiers

    • Research-article

    Conference

    SBIM '09
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 20 of 36 submissions, 56%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)2
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 13 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2016)A Rapid Prototyping Approach to Synthetic Data Generation for Improved 2D Gesture RecognitionProceedings of the 29th Annual Symposium on User Interface Software and Technology10.1145/2984511.2984525(873-885)Online publication date: 16-Oct-2016
    • (2013)A new approach for recognizing handwritten mathematics using relational grammars and fuzzy setsInternational Journal on Document Analysis and Recognition10.1007/s10032-012-0184-x16:2(139-163)Online publication date: 1-Jun-2013
    • (2012)Automated labeling of ink stroke dataProceedings of the International Symposium on Sketch-Based Interfaces and Modeling10.5555/2331067.2331078(67-75)Online publication date: 4-Jun-2012
    • (2011)Is the iPad useful for sketch input?Proceedings of the Eighth Eurographics Symposium on Sketch-Based Interfaces and Modeling10.1145/2021164.2021166(7-14)Online publication date: 5-Aug-2011
    • (2011)Analyzing sketch content using in-air packet informationProceedings of the 16th international conference on Intelligent user interfaces10.1145/1943403.1943476(403-406)Online publication date: 13-Feb-2011

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media