[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1807128.1807157acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

G-Store: a scalable data store for transactional multi key access in the cloud

Published: 10 June 2010 Publication History

Abstract

Cloud computing has emerged as a preferred platform for deploying scalable web-applications. With the growing scale of these applications and the data associated with them, scalable data management systems form a crucial part of the cloud infrastructure. Key-Value stores -- such as Bigtable, PNUTS, Dynamo, and their open source analogues-- have been the preferred data stores for applications in the cloud. In these systems, data is represented as Key-Value pairs, and atomic access is provided only at the granularity of single keys. While these properties work well for current applications, they are insufficient for the next generation web applications -- such as online gaming, social networks, collaborative editing, and many more -- which emphasize collaboration. Since collaboration by definition requires consistent access to groups of keys, scalable and consistent multi key access is critical for such applications. We propose the Key Group abstraction that defines a relationship between a group of keys and is the granule for on-demand transactional access. This abstraction allows the Key Grouping protocol to collocate control for the keys in the group to allow efficient access to the group of keys. Using the Key Grouping protocol, we design and implement G-Store which uses a key-value store as an underlying substrate to provide efficient, scalable, and transactional multi key access. Our implementation using a standard key-value store and experiments using a cluster of commodity machines show that G-Store preserves the desired properties of key-value stores, while providing multi key access functionality at a very low overhead.

References

[1]
D. Agrawal, A. El Abbadi, S. Antony, and S. Das. Data Management Challenges in Cloud Computing Infrastructures. In DNIS, 2010.
[2]
M. K. Aguilera, A. Merchant, M. Shah, A. Veitch, and C. Karamanolis. Sinfonia: a new paradigm for building scalable distributed systems. In SOSP, pages 159--174, 2007.
[3]
S. Amer-Yahia, V. Markl, A. Halevy, A. Doan, G. Alonso, D. Kossmann, and G. Weikum. Databases and Web 2.0 panel at VLDB 2007. SIGMOD Rec., 37(1):49--52, 2008.
[4]
M. Armbrust, A. Fox, D. A. Patterson, N. Lanham, B. Trushkowsky, J. Trutna, and H. Oh. SCADS: Scale Independent Storage for Social Computing Applications. In CIDR Perspectives, 2009.
[5]
H. Berenson, P. Bernstein, J. Gray, J. Melton, E. O'Neil, and P. O'Neil. A critique of ANSI SQL isolation levels. In SIGMOD, pages 1--10, 1995.
[6]
P. A. Bernstein, V. Hadzilacos, and N. Goodman. Concurrency Control and Recovery in Database Systems. Addison Wesley, Reading, Massachusetts, 1987.
[7]
M. Brantner, D. Florescu, D. Graf, D. Kossmann, and T. Kraska. Building a database on S3. In SIGMOD, pages 251--264, 2008.
[8]
F. Chang, J. Dean, S. Ghemawat, W. C. Hsieh, D. A. Wallach, M. Burrows, T. Chandra, A. Fikes, and R. E. Gruber. Bigtable: A Distributed Storage System for Structured Data. In OSDI, pages 205--218, 2006.
[9]
B. F. Cooper, R. Ramakrishnan, U. Srivastava, A. Silberstein, P. Bohannon, H.-A. Jacobsen, N. Puz, D. Weaver, and R. Yerneni. PNUTS: Yahoo!'s hosted data serving platform. Proc. VLDB Endow., 1(2):1277--1288, 2008.
[10]
S. Das, S. Agarwal, D. Agrawal, and A. El Abbadi. ElasTraS: An Elastic, Scalable, and Self Managing Transactional Database for the Cloud. Technical Report 2010-04, CS, UCSB, 2010. http://www.cs.ucsb.edu/research/tech_reports/.
[11]
S. Das, D. Agrawal, and A. El Abbadi. ElasTraS: An Elastic Transactional Data Store in the Cloud. In USENIX HotCloud, 2009.
[12]
G. DeCandia, D. Hastorun, M. Jampani, G. Kakulapati, A. Lakshman, A. Pilchin, S. Sivasubramanian, P. Vosshall, and W. Vogels. Dynamo: Amazon's highly available key-value store. In SOSP, pages 205--220, 2007.
[13]
D. J. DeWitt, R. H. Gerber, G. Graefe, M. L. Heytens, K. B. Kumar, and M. Muralikrishna. GAMMA - A High Performance Dataflow Database Machine. In VLDB, pages 228--237, 1986.
[14]
K. P. Eswaran, J. N. Gray, R. A. Lorie, and I. L. Traiger. The notions of consistency and predicate locks in a database system. Commun. ACM, 19(11):624--633, 1976.
[15]
S. Fushimi, M. Kitsuregawa, and H. Tanaka. An Overview of The System Software of A Parallel Relational Database Machine GRACE. In VLDB, pages 209--219, 1986.
[16]
J. Gray. Notes on data base operating systems. In Operating Systems, An Advanced Course, pages 393--481, London, UK, 1978. Springer-Verlag.
[17]
HBase: Bigtable-like structured storage for Hadoop HDFS, 2009. http://hadoop.apache.org/hbase/.
[18]
HDFS: A distributed file system that provides high throughput access to application data, 2009. http://hadoop.apache.org/hdfs/.
[19]
P. Helland. Life beyond Distributed Transactions: An Apostate's Opinion. In CIDR, pages 132--141, 2007.
[20]
A. Hirsch. Cool Facebook Application Game - Scrabulous - Facebook's Scrabble. http://www.makeuseof.com/tag/best-facebookapplication-game-scrabulous-facebooks-scrabble/, 2007.
[21]
R. Kallman, H. Kimura, J. Natkins, A. Pavlo, A. Rasin, S. B. Zdonik, E. P. C. Jones, S. Madden, M. Stonebraker, Y. Zhang, J. Hugg, and D. J. Abadi. H-store: a high-performance, distributed main memory transaction processing system. PVLDB, 1(2):1496--1499, 2008.
[22]
J. Karlsson and P. Zeyliger. Megastore - Scalable Data System for User-facing Apps (Invited Talk). In SIGMOD, 2008.
[23]
T. Kraska, M. Hentschel, G. Alonso, and D. Kossmann. Consistency Rationing in the Cloud: Pay only when it matters. PVLDB, 2(1):253--264, 2009.
[24]
H. T. Kung and J. T. Robinson. On optimistic methods for concurrency control. ACM Trans. Database Syst., 6(2):213--226, 1981.
[25]
B. G. Lindsay, L. M. Haas, C. Mohan, P. F. Wilms, and R. A. Yost. Computation and communication in R*: A distributed database manager. ACM Trans. Comput. Syst., 2(1):24--38, 1984.
[26]
D. B. Lomet, A. Fekete, G. Weikum, and M. J. Zwilling. Unbundling transaction services in the cloud. In CIDR Perspectives, 2009.
[27]
D. B. Lomet and M. F. Mokbel. Locking Key Ranges with Unbundled Transaction Services. PVLDB, 2(1):265--276, 2009.
[28]
D. Obasanjo. When databases lie: Consistency vs. availability in distributed systems. http://www.25hoursaday.com/weblog/2007/10/10/WhenDatabasesLieConsistencyVsAvailabilityInDistributedSystems.aspx, 2009.
[29]
R. Rawson and J. Gray. HBase at Hadoop World NYC. http://www.docstoc.com/docs/12426408/HBase-at-Hadoop-World-NYC/, 2009.
[30]
J. B. Rothnie Jr., P. A. Bernstein, S. Fox, N. Goodman, M. Hammer, T. A. Landers, C. L. Reeve, D. W. Shipman, and E. Wong. Introduction to a System for Distributed Databases (SDD-1). ACM Trans. Database Syst., 5(1):1--17, 1980.
[31]
T. von Eicken. Righscale Blog: Animoto's Facebook Scale-up. http://blog.rightscale.com/2008/04/23/animoto-facebook-scale-up/, April 2008.
[32]
G. Weikum and G. Vossen. Transactional information systems: theory, algorithms, and the practice of concurrency control and recovery. Morgan Kaufmann Publishers Inc., 2001.
[33]
F. Yang, J. Shanmugasundaram, and R. Yerneni. A scalable data platform for a large number of small applications. In CIDR, 2009.

Cited By

View all
  • (2024)Reducing Write Tail Latency of Distributed Key-Value Stores Using In-Network Chasing2024 IEEE/ACM 32nd International Symposium on Quality of Service (IWQoS)10.1109/IWQoS61813.2024.10682891(1-10)Online publication date: 19-Jun-2024
  • (2024)Die NoSQL-Toolbox: Die NoSQL-Landschaft im ÜberblickSchnelles und skalierbares Cloud-Datenmanagement10.1007/978-3-031-54388-3_8(189-205)Online publication date: 3-May-2024
  • (2024)Transaktionale Semantik für global verteilte AnwendungenSchnelles und skalierbares Cloud-Datenmanagement10.1007/978-3-031-54388-3_6(141-159)Online publication date: 3-May-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SoCC '10: Proceedings of the 1st ACM symposium on Cloud computing
June 2010
264 pages
ISBN:9781450300360
DOI:10.1145/1807128
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 June 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. cloud computing
  2. consistency
  3. key-value stores
  4. multi key access

Qualifiers

  • Research-article

Conference

SOCC '10
Sponsor:

Acceptance Rates

Overall Acceptance Rate 169 of 722 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)34
  • Downloads (Last 6 weeks)2
Reflects downloads up to 18 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Reducing Write Tail Latency of Distributed Key-Value Stores Using In-Network Chasing2024 IEEE/ACM 32nd International Symposium on Quality of Service (IWQoS)10.1109/IWQoS61813.2024.10682891(1-10)Online publication date: 19-Jun-2024
  • (2024)Die NoSQL-Toolbox: Die NoSQL-Landschaft im ÜberblickSchnelles und skalierbares Cloud-Datenmanagement10.1007/978-3-031-54388-3_8(189-205)Online publication date: 3-May-2024
  • (2024)Transaktionale Semantik für global verteilte AnwendungenSchnelles und skalierbares Cloud-Datenmanagement10.1007/978-3-031-54388-3_6(141-159)Online publication date: 3-May-2024
  • (2024)EinführungSchnelles und skalierbares Cloud-Datenmanagement10.1007/978-3-031-54388-3_1(1-12)Online publication date: 3-May-2024
  • (2023)Detock: High Performance Multi-region Transactions at ScaleProceedings of the ACM on Management of Data10.1145/35892931:2(1-27)Online publication date: 20-Jun-2023
  • (2023)Towards a Unified Storage Scheme for Dual Data Models of Knowledge GraphsWeb and Big Data. APWeb-WAIM 2022 International Workshops10.1007/978-981-99-1354-1_4(34-44)Online publication date: 30-Mar-2023
  • (2023)Analysis of an Epoch Commit Protocol for Distributed Processing SystemsQuantitative Evaluation of Systems10.1007/978-3-031-43835-6_18(255-269)Online publication date: 15-Sep-2023
  • (2022)Enabling the Next Generation of Multi-Region Applications with CockroachDBProceedings of the 2022 International Conference on Management of Data10.1145/3514221.3526053(2312-2325)Online publication date: 10-Jun-2022
  • (2022)A Performance Study of Epoch-based Commit Protocols in Distributed OLTP Databases2022 41st International Symposium on Reliable Distributed Systems (SRDS)10.1109/SRDS55811.2022.00026(189-200)Online publication date: Sep-2022
  • (2022)FPIRPQ: Accelerating regular path queries on knowledge graphsWorld Wide Web10.1007/s11280-022-01103-526:2(661-681)Online publication date: 7-Oct-2022
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media