[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1109/CICN.2011.97guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Information Extraction Using Web Usage Mining, Web Scrapping and Semantic Annotation

Published: 07 October 2011 Publication History

Abstract

Extracting useful information from the web is the most significant issue of concern for the realization of semantic web. This may be achieved by several ways among which Web Usage Mining, Web Scrapping and Semantic Annotation plays an important role. Web mining enables to find out the relevant results from the web and is used to extract meaningful information from the discovery patterns kept back in the servers. Web usage mining is a type of web mining which mines the information of access routes/manners of users visiting the web sites. Web scraping, another technique, is a process of extracting useful information from HTML pages which may be implemented using a scripting language known as Prolog Server Pages(PSP) based on Prolog. Third, Semantic annotation is a technique which makes it possible to add semantics and a formal structure to unstructured textual documents, an important aspect in semantic information extraction which may be performed by a tool known as KIM(Knowledge Information Management). In this paper, we revisit, explore and discuss some information extraction techniques on web like web usage mining, web scrapping and semantic annotation for a better or efficient information extraction on the web illustrated with examples.

Cited By

View all
  1. Information Extraction Using Web Usage Mining, Web Scrapping and Semantic Annotation

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Guide Proceedings
    CICN '11: Proceedings of the 2011 International Conference on Computational Intelligence and Communication Networks
    October 2011
    771 pages
    ISBN:9780769545875

    Publisher

    IEEE Computer Society

    United States

    Publication History

    Published: 07 October 2011

    Author Tags

    1. KIM
    2. Prolog
    3. Prolog Server Pages
    4. Semantic Web
    5. Text Grepping
    6. Web Log Analyzer
    7. Web Mining
    8. Web Scrapping
    9. Web Usage Mining
    10. knowledge management
    11. semantic annotation

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 17 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2021)CHIEvACM SIGAPP Applied Computing Review10.1145/3477133.347713421:1(5-23)Online publication date: 20-Jul-2021
    • (2020)XIEvProceedings of the 35th Annual ACM Symposium on Applied Computing10.1145/3341105.3373885(2201-2210)Online publication date: 30-Mar-2020
    • (2019)Understanding the Mobile Game App ActivityProceedings of the 2019 ACM Southeast Conference10.1145/3299815.3314460(206-209)Online publication date: 18-Apr-2019
    • (2014)Arabic web pages clustering and annotation using semantic class featuresJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2014.06.00226:4(388-397)Online publication date: 1-Dec-2014

    View Options

    View options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media