[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1109/ICDE.2008.4497602guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Automatically Extracting Form Labels

Published: 07 April 2008 Publication History

Abstract

We describe a machine-learning-based approach for extracting attribute labels from Web form interfaces. Having these labels is a requirement for several techniques that attempt to retrieve and integrate data that reside in online databases and that are hidden behind form interfaces, including schema matching and clustering, and hidden-Web crawlers. Whereas previous approaches to this problem have relied on heuristics and manually specified extraction rules, our technique makes use of learning classifiers to identify form labels. Our preliminary experiments show this approach is promising and has high accuracy.

Cited By

View all
  • (2023)Automated Selection of Web Form Text Field Values Based on Bayesian InferencesInternational Journal of Information Retrieval Research10.4018/IJIRR.31839913:1(1-13)Online publication date: 16-Feb-2023
  • (2009)Post processing wrapper generated tables for labeling anonymous datasetsProceedings of the eleventh international workshop on Web information and data management10.1145/1651587.1651602(63-66)Online publication date: 2-Nov-2009
  1. Automatically Extracting Form Labels

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Guide Proceedings
    ICDE '08: Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
    April 2008
    1628 pages
    ISBN:9781424418367

    Publisher

    IEEE Computer Society

    United States

    Publication History

    Published: 07 April 2008

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 25 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Automated Selection of Web Form Text Field Values Based on Bayesian InferencesInternational Journal of Information Retrieval Research10.4018/IJIRR.31839913:1(1-13)Online publication date: 16-Feb-2023
    • (2009)Post processing wrapper generated tables for labeling anonymous datasetsProceedings of the eleventh international workshop on Web information and data management10.1145/1651587.1651602(63-66)Online publication date: 2-Nov-2009

    View Options

    View options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media