[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1109/ICDE.2009.146guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Social Streams Blog Crawler

Published: 29 March 2009 Publication History

Abstract

Weblogs, and other forms of social media, differ from traditional web content in many ways. One of the most important differences is the highly temporal nature of the content. Applications that leverage social media content must, to be effective, have access to this data with minimal publication/acquisition latency. An effective weblog crawler should satisfy the following requirements: low latency, highly scalable, high data quality and appropriate network politeness. In this paper, we outline the weblog crawler implemented in the social streams project and summarize the challenges faced during development.

Cited By

View all
  1. Social Streams Blog Crawler

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Guide Proceedings
    ICDE '09: Proceedings of the 2009 IEEE International Conference on Data Engineering
    March 2009
    1772 pages
    ISBN:9780769535456

    Publisher

    IEEE Computer Society

    United States

    Publication History

    Published: 29 March 2009

    Author Tags

    1. blogs
    2. crawling
    3. social media
    4. web
    5. weblogs

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 10 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2013)Towards social data platformProceedings of the VLDB Endowment10.14778/2556549.25565776:14(1966-1977)Online publication date: 1-Sep-2013
    • (2013)RetriBlogExpert Systems with Applications: An International Journal10.1016/j.eswa.2012.08.02040:4(1177-1195)Online publication date: 1-Mar-2013
    • (2012)A framework for building web mining applications in the world of blogsExpert Systems with Applications: An International Journal10.1016/j.eswa.2011.09.13539:5(4813-4834)Online publication date: 1-Apr-2012
    • (2009)Click-through prediction for news queriesProceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval10.1145/1571941.1572002(347-354)Online publication date: 19-Jul-2009

    View Options

    View options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media