[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2463676.2465296acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

LinkBench: a database benchmark based on the Facebook social graph

Published: 22 June 2013 Publication History

Abstract

Database benchmarks are an important tool for database researchers and practitioners that ease the process of making informed comparisons between different database hardware, software and configurations. Large scale web services such as social networks are a major and growing database application area, but currently there are few benchmarks that accurately model web service workloads.
In this paper we present a new synthetic benchmark called LinkBench. LinkBench is based on traces from production databases that store "social graph" data at Facebook, a major social network. We characterize the data and query workload in many dimensions, and use the insights gained to construct a realistic synthetic benchmark. LinkBench provides a realistic and challenging test for persistent storage of social and web service data, filling a gap in the available tools for researchers, developers and administrators.

References

[1]
Oracle Corporation, "MySQL 5.6 reference manual," 2012, http://dev.mysql.com/doc/refman/5.6/en/.
[2]
B. Atikoglu, Y. Xu, E. Frachtenberg, S. Jiang, and M. Paleczny, "Workload analysis of a large-scale key-value store," in Proc. SIGMETRICS'12, 2012.
[3]
Facebook, Inc., "TAO: Facebook's distributed data store for the social graph," 2012, draft in preparation.
[4]
The Apache Software Foundation, "Apache HBase," 2012, http://hbase.apache.org.
[5]
D. Borthakur, J. Gray, J. S. Sarma, K. Muthukkaruppan, N. Spiegelberg, H. Kuang, K. Ranganathan, D. Molkov, A. Menon, S. Rash, R. Schmidt, and A. Aiyer, "Apache Hadoop goes realtime at Facebook," in Proc. SIGMOD'11, 2011.
[6]
B. Debnath, S. Sengupta, and J. Li, "FlashStore: high throughput persistent key-value store," Proc. VLDB Endow., vol. 3, no. 1-2, Sep. 2010.
[7]
H. Lim, B. Fan, D. G. Andersen, and M. Kaminsky, "SILT: a memory-efficient, high-performance key-value store," in Proc. SOSP'11, 2011.
[8]
WiredTiger, Inc., "WiredTiger reference guide 1.3.4," 2012, http://source.wiredtiger.com/1.3.4/.
[9]
TokuTek Inc., "TokuDB," 2012, http://www.tokutek.com/.
[10]
M. Seltzer, D. Krinsky, K. Smith, and X. Zhang, "The case for application-specific benchmarking," in Proc. HotOS '99, 1999.
[11]
Y. C. Tay, "Data generation for application-specific benchmarking," in Proc. VLDB 4, 2011.
[12]
Transaction Processing Performance Council, "TPC benchmark C standard spec. 5.11," Feb 2010, http://www.tpc.org/tpcc/spec/tpc-c_v5-11.pdf.
[13]
----, "TPC benchmark H (decision support) standard spec. 2.14.4," Apr 2012, http://www.tpc.org/tpch/spec/tpch2.14.4.pdf.
[14]
D. Dominguez-Sal, P. Urbón-Bayes, A. Giménez-Vañó, S. Gómez-Villamor, N. Martínez-Bazán, and J. Larriba-Pey, "Survey of graph database performance on the HPC Scalable Graph Analysis Benchmark," in Web-Age Information Management, 2010, vol. 6185.
[15]
D. Dominguez-Sal, N. Martinez-Bazan, V. Muntes-Mulero, P. Baleta, and J. Larriba-Pey, "A discussion on the design of graph database benchmarks," in Performance Evaluation, Measurement and Characterization of Complex Systems, 2011, vol. 6417.
[16]
M. J. Carey, D. J. DeWitt, C. Kant, and J. F. Naughton, "A status report on the OO7 OODBMS benchmarking effort," in Proc. OOPSLA'94, 1994.
[17]
B. F. Cooper, A. Silberstein, E. Tam, R. Ramakrishnan, and R. Sears, "Benchmarking cloud serving systems with YCSB," in Proc. SoCC'10, 2010.
[18]
S. Barahmand and S. Ghandeharizadeh, "BG: A benchmark to evaluate interactive social networking actions," in Proc. CIDR'13, 2013.
[19]
B. F. Cooper, R. Ramakrishnan, U. Srivastava, A. Silberstein, P. Bohannon, H.-A. Jacobsen, N. Puz, D. Weaver, and R. Yerneni, "PNUTS: Yahoo!'s hosted data serving platform," Proc. VLDB Endow., vol. 1, no. 2, Aug. 2008.
[20]
A. Mislove, M. Marcon, K. P. Gummadi, P. Druschel, and B. Bhattacharjee, "Measurement and analysis of online social networks," in Proc. IMC'07, 2007.
[21]
J. Ugander, B. Karrer, L. Backstrom, and C. Marlow, "The anatomy of the Facebook social graph," CoRR, vol. abs/1111.4503, 2011.
[22]
D. J. Watts and S. H. Strogatz, "Collective dynamics of 'small-world' networks." Nature, vol. 393, no. 6684, Jun. 1998.
[23]
M. Girvan and M. Newman, "Community structure in social and biological networks," Proc. Nat'l Acad. Sciences, vol. 99, no. 12, 2002.
[24]
M. Newman, "Power laws, Pareto distributions and Zipf's law," Contemporary physics, vol. 46, no. 5, 2005.
[25]
J. Gray, P. Sundaresan, S. Englert, K. Baclawski, and P. J. Weinberger, "Quickly generating billion-record synthetic databases," SIGMOD Rec., vol. 23, no. 2, 1994.

Cited By

View all
  • (2024)SymbiosisProceedings of the 22nd USENIX Conference on File and Storage Technologies10.5555/3650697.3650701(51-70)Online publication date: 27-Feb-2024
  • (2024)Galaxybase: A High Performance Native Distributed Graph Database for HTAPProceedings of the VLDB Endowment10.14778/3685800.368581417:12(3893-3905)Online publication date: 8-Nov-2024
  • (2024)Enabling Window-Based Monotonic Graph Analytics with Reusable Transitional Results for Pattern-Consistent QueriesProceedings of the VLDB Endowment10.14778/3681954.368197917:11(3003-3016)Online publication date: 30-Aug-2024
  • Show More Cited By

Index Terms

  1. LinkBench: a database benchmark based on the Facebook social graph

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGMOD '13: Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
    June 2013
    1322 pages
    ISBN:9781450320375
    DOI:10.1145/2463676
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 22 June 2013

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. database benchmarks
    2. database workload analysis
    3. hbase
    4. mysql
    5. social networks

    Qualifiers

    • Research-article

    Conference

    SIGMOD/PODS'13
    Sponsor:

    Acceptance Rates

    SIGMOD '13 Paper Acceptance Rate 76 of 372 submissions, 20%;
    Overall Acceptance Rate 785 of 4,003 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)137
    • Downloads (Last 6 weeks)10
    Reflects downloads up to 31 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)SymbiosisProceedings of the 22nd USENIX Conference on File and Storage Technologies10.5555/3650697.3650701(51-70)Online publication date: 27-Feb-2024
    • (2024)Galaxybase: A High Performance Native Distributed Graph Database for HTAPProceedings of the VLDB Endowment10.14778/3685800.368581417:12(3893-3905)Online publication date: 8-Nov-2024
    • (2024)Enabling Window-Based Monotonic Graph Analytics with Reusable Transitional Results for Pattern-Consistent QueriesProceedings of the VLDB Endowment10.14778/3681954.368197917:11(3003-3016)Online publication date: 30-Aug-2024
    • (2024)Memento Filter: A Fast, Dynamic, and Robust Range FilterProceedings of the ACM on Management of Data10.1145/36988202:6(1-27)Online publication date: 20-Dec-2024
    • (2024)CAMAL: Optimizing LSM-trees via Active LearningProceedings of the ACM on Management of Data10.1145/36771382:4(1-26)Online publication date: 30-Sep-2024
    • (2024)A Unified Graph Framework for Storage-Compute Coupled Cluster and High-Density Computing ClusterProceedings of the International Workshop on Big Data in Emergent Distributed Environments10.1145/3663741.3664790(1-6)Online publication date: 9-Jun-2024
    • (2024)KVBench: A Key-Value Benchmarking SuiteProceedings of the Tenth International Workshop on Testing Database Systems10.1145/3662165.3662765(9-15)Online publication date: 9-Jun-2024
    • (2024)Structural Designs Meet Optimality: Exploring Optimized LSM-tree Structures in a Colossal Configuration SpaceProceedings of the ACM on Management of Data10.1145/36549782:3(1-26)Online publication date: 30-May-2024
    • (2024)Limousine: Blending Learned and Classical Indexes to Self-Design Larger-than-Memory Cloud Storage EnginesProceedings of the ACM on Management of Data10.1145/36393022:1(1-28)Online publication date: 26-Mar-2024
    • (2024)Spruce: a Fast yet Space-saving Structure for Dynamic Graph StorageProceedings of the ACM on Management of Data10.1145/36392822:1(1-26)Online publication date: 26-Mar-2024
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media