Article

Information Extraction Using Web Usage Mining, Web Scrapping and Semantic Annotation

Authors:

Sanjay Kumar Malik,

SAM RizviAuthors Info & Claims

CICN '11: Proceedings of the 2011 International Conference on Computational Intelligence and Communication Networks

Pages 465 - 469

https://doi.org/10.1109/CICN.2011.97

Published: 07 October 2011 Publication History

Abstract

Extracting useful information from the web is the most significant issue of concern for the realization of semantic web. This may be achieved by several ways among which Web Usage Mining, Web Scrapping and Semantic Annotation plays an important role. Web mining enables to find out the relevant results from the web and is used to extract meaningful information from the discovery patterns kept back in the servers. Web usage mining is a type of web mining which mines the information of access routes/manners of users visiting the web sites. Web scraping, another technique, is a process of extracting useful information from HTML pages which may be implemented using a scripting language known as Prolog Server Pages(PSP) based on Prolog. Third, Semantic annotation is a technique which makes it possible to add semantics and a formal structure to unstructured textual documents, an important aspect in semantic information extraction which may be performed by a tool known as KIM(Knowledge Information Management). In this paper, we revisit, explore and discuss some information extraction techniques on web like web usage mining, web scrapping and semantic annotation for a better or efficient information extraction on the web illustrated with examples.

Cited By

View all

Leithner MSimos D(2021)CHIEvACM SIGAPP Applied Computing Review10.1145/3477133.347713421:1(5-23)Online publication date: 20-Jul-2021
https://dl.acm.org/doi/10.1145/3477133.3477134
Leithner MSimos DHung CCerny TShin DBechini A(2020)XIEvProceedings of the 35th Annual ACM Symposium on Applied Computing10.1145/3341105.3373885(2201-2210)Online publication date: 30-Mar-2020
https://dl.acm.org/doi/10.1145/3341105.3373885
Vyas JHan MLo DKim DGamess E(2019)Understanding the Mobile Game App ActivityProceedings of the 2019 ACM Southeast Conference10.1145/3299815.3314460(206-209)Online publication date: 18-Apr-2019
https://dl.acm.org/doi/10.1145/3299815.3314460
Show More Cited By

Information Extraction Using Web Usage Mining, Web Scrapping and Semantic Annotation
1. Information systems
  1. Information systems applications

Recommendations

Ontology and Web Usage Mining towards an Intelligent Web Focusing Web Logs
CICN '10: Proceedings of the 2010 International Conference on Computational Intelligence and Communication Networks

Today, Internet is a huge database which comprises of a large number of Web sites, search engines and other information. Due to the unstructured and semi structured data in the web pages, it is a challenging task for researchers to make a relevant and ...
Semantic Web Mining

Semantic Web Mining aims at combining the two fast-developing research areas Semantic Web and Web Mining. This survey analyzes the convergence of trends from both areas: More and more researchers are working on improving the results of Web Mining by ...
Interpretable Mining of Influential Patterns from Sparse Web
WI-IAT '21: IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology

Big data are everywhere. World Wide Web is an example of these big data. It has become a vast data production and consumption platform, at which threads of data evolve from multiple devices, by different human interactions, over worldwide locations, ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

CICN '11: Proceedings of the 2011 International Conference on Computational Intelligence and Communication Networks

October 2011

771 pages

ISBN:9780769545875

Publisher

IEEE Computer Society

United States

Publication History

Published: 07 October 2011

Author Tags

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

4
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 17 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Leithner MSimos D(2021)CHIEvACM SIGAPP Applied Computing Review10.1145/3477133.347713421:1(5-23)Online publication date: 20-Jul-2021
https://dl.acm.org/doi/10.1145/3477133.3477134
Leithner MSimos DHung CCerny TShin DBechini A(2020)XIEvProceedings of the 35th Annual ACM Symposium on Applied Computing10.1145/3341105.3373885(2201-2210)Online publication date: 30-Mar-2020
https://dl.acm.org/doi/10.1145/3341105.3373885
Vyas JHan MLo DKim DGamess E(2019)Understanding the Mobile Game App ActivityProceedings of the 2019 ACM Southeast Conference10.1145/3299815.3314460(206-209)Online publication date: 18-Apr-2019
https://dl.acm.org/doi/10.1145/3299815.3314460
Alghamdi HSelamat AAbdul Karim N(2014)Arabic web pages clustering and annotation using semantic class featuresJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2014.06.00226:4(388-397)Online publication date: 1-Dec-2014
https://dl.acm.org/doi/10.1016/j.jksuci.2014.06.002

Abstract

Cited By

Recommendations

Ontology and Web Usage Mining towards an Intelligent Web Focusing Web Logs

Semantic Web Mining

Interpretable Mining of Influential Patterns from Sparse Web

Comments

Information

Published In

Publisher

Publication History

Author Tags

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations