[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2806416.2806642acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
short-paper

Towards Multi-level Provenance Reconstruction of Information Diffusion on Social Media

Published: 17 October 2015 Publication History

Abstract

In order to assess the trustworthiness of information on social media, a consumer needs to understand where this information comes from, and which processes were involved in its creation. The entities, agents and activities involved in the creation of a piece of information are referred to as its provenance, which was standardized by W3C PROV. However, current social media APIs cannot always capture the full lineage of every message, leaving the consumer with incomplete or missing provenance, which is crucial for judging the trust it carries. Therefore in this paper, we propose an approach to reconstruct the provenance of messages on social media on multiple levels. To obtain a fine-grained level of provenance, we use an approach from prior work to reconstruct information cascades with high certainty, and map them to PROV using the PROV-SAID extension for social media. To obtain a coarse-grained level of provenance, we adapt our similarity-based, fuzzy provenance reconstruction approach -- previously applied on news. We illustrate the power of the combination by providing the reconstructed provenance of a limited social media dataset gathered during the 2012 Olympics, for which we were able to reconstruct a significant amount of previously unidentified connections.

References

[1]
M. Al Hasan, S. Salem, and M. J. Zaki. Simclus: an effective algorithm for clustering with a lower bound on similarity. Knowledge and information systems, 28(3):665--685, 2011.
[2]
G. Barbier, Z. Feng, P. Gundecha, and H. Liu. Provenance data in social media. Synthesis Lectures on Data Mining and Knowledge Discovery, 4(1):1--84, 2013.
[3]
T. De Nies, S. Coppens, D. Van Deursen, E. Mannens, and R. Van de Walle. Automatic discovery of high-level provenance using semantic similarity. In Provenance and Annotation of Data and Processes, pages 97--110. Springer, 2012.
[4]
T. De Nies, S. Coppens, R. Verborgh, M. Vander Sande, E. Mannens, R. Van de Walle, D. Michaelides, and L. Moreau. Easy access to provenance: an essential step towards trust on the web. In COMPSACW, 2013.
[5]
A. Dimou, M. Vander Sande, P. Colpaert, R. Verborgh, E. Mannens, and R. Van de Walle. RML: a generic language for integrated RDF mappings of heterogeneous data. In Proceedings of the 7th Workshop on Linked Data on the Web (LDOW2014), Seoul, Korea, 2014.
[6]
Z. Feng, P. Gundecha, and H. Liu. Recovering information recipients in social media via provenance. In ASONAM, pages 706--711, 2013.
[7]
A. Guille, H. Hacid, C. Favre, and D. A. Zighed. Information diffusion in online social networks: A survey. ACM SIGMOD Record, 42(2):17--28, 2013.
[8]
P. Gundecha, Z. Feng, and H. Liu. Seeking provenance of information using social media. In CIKM, 2013.
[9]
P. Gundecha, S. Ranganath, Z. Feng, and H. Liu. A tool for collecting provenance data in social media. In KDD, pages 1462--1465, 2013.
[10]
J. Leskovec, L. A. Adamic, and B. A. Huberman. The dynamics of viral marketing. ACM TWEB, 1(1):5, 2007.
[11]
J. Leskovec, L. Backstrom, and J. Kleinberg. Meme-tracking and the dynamics of the news cycle. In KDD, pages 497--506, 2009.
[12]
L. Moreau, P. Missier (Eds.), and W3C Provenance Working Group. PROV-DM: The PROV Data Model. W3C, 2013.
[13]
M. P. Simmons, L. A. Adamic, and E. Adar. Memes Online: Extracted, subtracted, injected, and recollected. ICWSM, 11:17--21, 2011.
[14]
I. Taxidou, T. De Nies, R. Verborgh, P. M. Fischer, E. Mannens, and R. Van de Walle. Modeling information diffusion in social media as provenance with W3C PROV. In Proceedings of the 24th International Conference on WWW Companion, pages 819--824, 2015.
[15]
I. Taxidou and P. M. Fischer. Online analysis of information diffusion in twitter. In Proceedings of the 23rd International Conference on WWW Companion, pages 1313--1318, 2014.

Cited By

View all
  • (2024)Post-hoc Evaluation of Nodes Influence in Information Cascades: The Case of Coordinated AccountsACM Transactions on the Web10.1145/3700644Online publication date: 17-Oct-2024
  • (2022)Social data provenance framework based on zero-information loss graph databaseSocial Network Analysis and Mining10.1007/s13278-022-00889-612:1Online publication date: 3-Jul-2022
  • (2021)Provenance Framework for Twitter Data using Zero-Information Loss Graph DatabaseProceedings of the 3rd ACM India Joint International Conference on Data Science & Management of Data (8th ACM IKDD CODS & 26th COMAD)10.1145/3430984.3431014(74-82)Online publication date: 2-Jan-2021
  • Show More Cited By

Index Terms

  1. Towards Multi-level Provenance Reconstruction of Information Diffusion on Social Media

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIKM '15: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management
    October 2015
    1998 pages
    ISBN:9781450337946
    DOI:10.1145/2806416
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 October 2015

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. clustering
    2. information diffusion
    3. provenance
    4. reconstruction
    5. social media

    Qualifiers

    • Short-paper

    Conference

    CIKM'15
    Sponsor:

    Acceptance Rates

    CIKM '15 Paper Acceptance Rate 165 of 646 submissions, 26%;
    Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

    Upcoming Conference

    CIKM '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)23
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 14 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Post-hoc Evaluation of Nodes Influence in Information Cascades: The Case of Coordinated AccountsACM Transactions on the Web10.1145/3700644Online publication date: 17-Oct-2024
    • (2022)Social data provenance framework based on zero-information loss graph databaseSocial Network Analysis and Mining10.1007/s13278-022-00889-612:1Online publication date: 3-Jul-2022
    • (2021)Provenance Framework for Twitter Data using Zero-Information Loss Graph DatabaseProceedings of the 3rd ACM India Joint International Conference on Data Science & Management of Data (8th ACM IKDD CODS & 26th COMAD)10.1145/3430984.3431014(74-82)Online publication date: 2-Jan-2021
    • (2021)Big social data provenance framework for Zero-Information Loss Key-Value Pair (KVP) DatabaseInternational Journal of Data Science and Analytics10.1007/s41060-021-00287-914:1(65-87)Online publication date: 9-Nov-2021
    • (2020)Interaction Strength Analysis to Model Retweet Cascade GraphsApplied Sciences10.3390/app1023839410:23(8394)Online publication date: 25-Nov-2020
    • (2020)Prediction of Social Influence for Provenance of Misinformation in Online Social Network Using Big Data ApproachThe Computer Journal10.1093/comjnl/bxaa13264:3(391-407)Online publication date: 22-Oct-2020
    • (2019)Provenance of Explicit and Implicit Interactions on Social Media with W3C PROV-DMBehavioral Analytics in Social and Ubiquitous Environments10.1007/978-3-030-34407-8_7(126-150)Online publication date: 18-Nov-2019
    • (2018)A Templating System to Generate ProvenanceIEEE Transactions on Software Engineering10.1109/TSE.2017.265974544:2(103-121)Online publication date: 1-Feb-2018
    • (2018)Web-scale provenance reconstruction of implicit information diffusion on social mediaDistributed and Parallel Databases10.1007/s10619-017-7211-336:1(47-79)Online publication date: 1-Mar-2018
    • (2017)On quantifying diffusion of health information on Twitter2017 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI)10.1109/BHI.2017.7897311(485-488)Online publication date: 2017
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media