[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2783258.2783277acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

On the Discovery of Evolving Truth

Published: 10 August 2015 Publication History

Abstract

In the era of big data, information regarding the same objects can be collected from increasingly more sources. Unfortunately, there usually exist conflicts among the information coming from different sources. To tackle this challenge, truth discovery, i.e., to integrate multi-source noisy information by estimating the reliability of each source, has emerged as a hot topic. In many real world applications, however, the information may come sequentially, and as a consequence, the truth of objects as well as the reliability of sources may be dynamically evolving. Existing truth discovery methods, unfortunately, cannot handle such scenarios. To address this problem, we investigate the temporal relations among both object truths and source reliability, and propose an incremental truth discovery framework that can dynamically update object truths and source weights upon the arrival of new data. Theoretical analysis is provided to show that the proposed method is guaranteed to converge at a fast rate. The experiments on three real world applications and a set of synthetic data demonstrate the advantages of the proposed method over state-of-the-art truth discovery methods.

References

[1]
D. P. Bertsekas. Non-linear Programming. Athena Scientific, 2 edition, 1999.
[2]
C.-F. Chen. On asymptotic normality of limiting density functions with bayesian implications. Journal of the Royal Statistical Society. Series B (Methodological), pages 540--546, 1985.
[3]
X. L. Dong, L. Berti-Equille, and D. Srivastava. Integrating conflicting data: The role of source dependence. PVLDB, 2(1):550--561, 2009.
[4]
X. L. Dong, E. Gabrilovich, G. Heitz, W. Horn, N. Lao, K. Murphy, T. Strohmann, S. Sun, and W. Zhang. Knowledge vault: A web-scale approach to probabilistic knowledge fusion. In Proc. of KDD, pages 601--610, 2014.
[5]
A. Galland, S. Abiteboul, A. Marian, and P. Senellart. Corroborating information from disagreeing views. In Proc. of WSDM, pages 131--140, 2010.
[6]
Q. Li, Y. Li, J. Gao, L. Su, B. Zhao, D. Murat, W. Fan, and J. Han. A confidence-aware approach for truth discovery on long-tail data. PVLDB, 8(4):425--436, 2015.
[7]
Q. Li, Y. Li, J. Gao, B. Zhao, W. Fan, and J. Han. Resolving conflicts in heterogeneous data by truth discovery and source reliability estimation. In Proc. of SIGMOD, pages 1187--1198, 2014.
[8]
X. Li, X. L. Dong, K. B. Lyons, W. Meng, and D. Srivastava. Truth finding on the deep web: Is the problem solved? PVLDB, 6(2):97--108, 2012.
[9]
Y. Li, J. Gao, C. Meng, Q. Li, L. Su, B. Zhao, W. Fan, and J. Han. A survey on truth discovery. arXiv preprint arXiv:1505.02463, 2015.
[10]
X. Liu, X. L. Dong, B. C. Ooi, and D. Srivastava. Online data fusion. PVLDB, 4(11):932--943, 2011.
[11]
S. Mukherjee, G. Weikum, and C. Danescu-Niculescu-Mizil. People on drugs: credibility of user statements in health communities. In Proc. of KDD, pages 65--74, 2014.
[12]
A. Pal, V. Rastogi, A. Machanavajjhala, and P. Bohannon. Information integration over time in unreliable and uncertain environments. In Proc. of WWW, pages 789--798, 2012.
[13]
J. Pasternack and D. Roth. Comprehensive trust metrics for information networks. In Army Science Conference, 2010.
[14]
J. Pasternack and D. Roth. Knowing what to believe (when you already know something). In Proc. of COLING, pages 877--885, 2010.
[15]
J. Pasternack and D. Roth. Latent credibility analysis. In Proc. of WWW, pages 1009--1020, 2013.
[16]
R. Pasupathy and S. Kim. The stochastic root-finding problem: Overview, solutions, and open questions. ACM Transactions on Modeling and Computer Simulation (TOMACS), 21(3):19, 2011.
[17]
R. Pochampally, A. D. Sarma, X. L. Dong, A. Meliou, and D. Srivastava. Fusing data with correlations. In Proc. of SIGMOD, pages 433--444, 2014.
[18]
L. Su, Q. Li, S. Hu, S. Wang, J. Gao, H. Liu, T. Abdelzaher, J. Han, X. Liu, Y. Gao, and L. Kaplan. Generalized decision aggregation in distributed sensing systems. In Proc. of RTSS, pages 1--10, 2014.
[19]
D. Wang, L. Kaplan, H. Le, and T. Abdelzaher. On truth discovery in social sensing: A maximum likelihood estimation approach. In Proc. of IPSN, pages 233--244, 2012.
[20]
S. Wang, D. Wang, L. Su, L. Kaplan, and T. Abdelzaher. Towards cyber-physical systems in social spaces: The data reliability challenge. In Proc. of RTSS, pages 74--85, 2014.
[21]
X. Yin, J. Han, and P. S. Yu. Truth discovery with multiple conflicting information providers on the web. In Proc. of KDD, pages 1048--1052, 2007.
[22]
X. Yin, J. Han, and P. S. Yu. Truth discovery with multiple conflicting information providers on the web. IEEE Transactions on Knowledge and Data Engineering, 20(6):796--808, 2008.
[23]
X. Yin and W. Tan. Semi-supervised truth discovery. In Proc. of WWW, pages 217--226, 2011.
[24]
B. Zhao and J. Han. A probabilistic model for estimating real-valued truth from conflicting sources. In Proc. of QDB, 2012.
[25]
B. Zhao, B. I. P. Rubinstein, J. Gemmell, and J. Han. A bayesian approach to discovering truth from conflicting sources for data integration. PVLDB, 5(6):550--561, 2012.
[26]
Z. Zhao, J. Cheng, and W. Ng. Truth discovery in data streams: A single-pass probabilistic approach. In Proc. of CIKM, pages 1589--1598, 2014.

Cited By

View all
  • (2024)High Precision ≠ High Cost: Temporal Data Fusion for Multiple Low-Precision SensorsProceedings of the ACM on Management of Data10.1145/36549462:3(1-27)Online publication date: 30-May-2024
  • (2024)A Novel Truth Discovery Approach for Health Recommendation SystemsIEEE Transactions on Consumer Electronics10.1109/TCE.2023.332286970:1(2435-2446)Online publication date: Feb-2024
  • (2024)ATWR-SMR: An Area-Constrained Truthful-Worker Recruitment-Based Sensing Map Recovery Scheme for Sparse MCS in Extreme-Environment Internet of ThingsIEEE Internet of Things Journal10.1109/JIOT.2023.331461511:3(3711-3724)Online publication date: 1-Feb-2024
  • Show More Cited By

Index Terms

  1. On the Discovery of Evolving Truth

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '15: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
    August 2015
    2378 pages
    ISBN:9781450336642
    DOI:10.1145/2783258
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 10 August 2015

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. dynamic data
    2. source reliability
    3. truth discovery

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    KDD '15
    Sponsor:

    Acceptance Rates

    KDD '15 Paper Acceptance Rate 160 of 819 submissions, 20%;
    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Upcoming Conference

    KDD '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)40
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 26 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)High Precision ≠ High Cost: Temporal Data Fusion for Multiple Low-Precision SensorsProceedings of the ACM on Management of Data10.1145/36549462:3(1-27)Online publication date: 30-May-2024
    • (2024)A Novel Truth Discovery Approach for Health Recommendation SystemsIEEE Transactions on Consumer Electronics10.1109/TCE.2023.332286970:1(2435-2446)Online publication date: Feb-2024
    • (2024)ATWR-SMR: An Area-Constrained Truthful-Worker Recruitment-Based Sensing Map Recovery Scheme for Sparse MCS in Extreme-Environment Internet of ThingsIEEE Internet of Things Journal10.1109/JIOT.2023.331461511:3(3711-3724)Online publication date: 1-Feb-2024
    • (2024)Blockchain-Based Lightweight and Privacy-Preserving Quality Assurance Framework in Crowdsensing SystemsIEEE Internet of Things Journal10.1109/JIOT.2023.328834911:1(974-986)Online publication date: 1-Jan-2024
    • (2024)Truth Discovery of Source Dependency Perception in Dynamic ScenariosWeb and Big Data10.1007/978-981-97-2387-4_4(48-63)Online publication date: 28-Apr-2024
    • (2024)A Reliable and Privacy-Preserving Truth Discovery Scheme for Mobile Crowdsensing Based on Functional EncryptionFrontiers in Cyber Security10.1007/978-981-96-0151-6_24(365-382)Online publication date: 27-Dec-2024
    • (2023)A Lightweight, Effective, and Efficient Model for Label Aggregation in CrowdsourcingACM Transactions on Knowledge Discovery from Data10.1145/363010218:4(1-27)Online publication date: 26-Oct-2023
    • (2023)Matching Roles from Temporal Data: Why Joe Biden is not only President, but also Commander-in-ChiefProceedings of the ACM on Management of Data10.1145/35889191:1(1-26)Online publication date: 30-May-2023
    • (2023)TIRA: Truth Inference via Reliability Aggregation on Object-Source GraphIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.322530835:11(11967-11981)Online publication date: 1-Nov-2023
    • (2023)Reliable and Streaming Truth Discovery in Blockchain-based Crowdsourcing2023 20th Annual IEEE International Conference on Sensing, Communication, and Networking (SECON)10.1109/SECON58729.2023.10287465(492-500)Online publication date: 11-Sep-2023
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media