[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2517349.2522730acmconferencesArticle/Chapter ViewAbstractPublication PagessospConference Proceedingsconference-collections
research-article
Open access

SPANStore: cost-effective geo-replicated storage spanning multiple cloud services

Published: 03 November 2013 Publication History

Abstract

By offering storage services in several geographically distributed data centers, cloud computing platforms enable applications to offer low latency access to user data. However, application developers are left to deal with the complexities associated with choosing the storage services at which any object is replicated and maintaining consistency across these replicas.
In this paper, we present SPANStore, a key-value store that exports a unified view of storage services in geographically distributed data centers. To minimize an application provider's cost, we combine three key principles. First, SPANStore spans multiple cloud providers to increase the geographical density of data centers and to minimize cost by exploiting pricing discrepancies across providers. Second, by estimating application workload at the right granularity, SPANStore judiciously trades off greater geo-distributed replication necessary to satisfy latency goals with the higher storage and data propagation costs this entails in order to satisfy fault tolerance and consistency requirements. Finally, SPANStore minimizes the use of compute resources to implement tasks such as two-phase locking and data propagation, which are necessary to offer a global view of the storage services that it builds upon. Our evaluation of SPANStore shows that it can lower costs by over 10x in several scenarios, in comparison with alternative solutions that either use a single storage provider or replicate every object to every data center from which it is accessed.

Supplementary Material

MP4 File (d2-06-zhe-wu.mp4)

References

[1]
Amazon S3. http://aws.amazon.com/s3.
[2]
By the numbers: 31 amazing Twitter stats. http://expandedramblings.com/index.php/march-2013-by-the-numbers-a-few-amazing-twitter-stats.
[3]
Google cloud storage. http://cloud.google.com/storage.
[4]
Infographic: Who is using Twitter, how often, and why? http://www.theatlantic.com/technology/archive/2011/07/infographic-who-is-using-twitter-how-often-and-why/241407/.
[5]
Retwis. http://retwis.antirez.com.
[6]
ShareJS. https://github.com/josephg/ShareJS/.
[7]
VMware vFabric Hyperic. http://www.vmware.com/products/datacenter-virtualization/vfabric-hyperic/.
[8]
Windows Azure. http://www.microsoft.com/windowsazure.
[9]
H. Abu-Libdeh, L. Princehouse, and H. Weatherspoon. RACS: A case for cloud storage diversity. In SOCC, 2010.
[10]
A. Adya, W. J. Bolosky, M. Castro, G. Cermak, R. Chaiken, J. R. Douceur, J. Howell, J. R. Lorch, M. Theimer, and R. P. Wattenhofer. FARSITE: Federated, available, and reliable storage for an incompletely trusted environment. In OSDI, 2002.
[11]
S. Agarwal, J. Dunagan, N. Jain, S. Saroiu, and A. Wolman. Volley: Automated data placement for geo-distributed cloud services. In NSDI, 2010.
[12]
G. A. Alvarez, E. Borowsky, S. Go, T. H. Romer, R. A. Becker-Szendy, R. A. Golding, A. Merchant, M. Spasojevic, A. C. Veitch, and J. Wilkes. Minerva: An automated resource provisioning tool for large-scale storage systems. ACM ToCS, 2001.
[13]
E. Anderson, M. Hobbs, K. Keeton, S. Spence, M. Uysal, and A. C. Veitch. Hippodrome: Running circles around storage administration. In FAST, 2002.
[14]
B. Atikoglu, Y. Xu, E. Frachtenberg, S. Jiang, and M. Paleczny. Workload analysis of a large-scale key-value store. In SIGMETRICS, 2012.
[15]
D. Bermbach, M. Klems, S. Tai, and M. Menzel.
[16]
A. Bessani, M. Correia, B. Quaresma, F. Andre, and P. Sousa. DEPSKY: Dependable and secure storage in a cloud-of-clouds. In EuroSys, 2011.
[17]
P. Bodik, A. Fox, M. J. Franklin, M. I. Jordan, and D. A. Patterson. Characterizing, modeling, and generating workload spikes for stateful services. In SoCC, 2010.
[18]
N. Chohan, C. Bunch, S. Pang, C. Krintz, N. Mostafa, S. Soman, and R. Wolski. AppScale: Scalable and open AppEngine application development and deployment. In CloudComp, 2009.
[19]
J. C. Corbett, J. Dean, M. Epstein, A. Fikes, C. Frost, J. Furman, S. Ghemawat, A. Gubarev, C. Heiser, P. Hochschild, W. Hsieh, S. Kanthak, E. Kogan, H. Li, A. Lloyd, S. Melnik, D. Mwaura, D. Nagle, S. Quinlan, R. Rao, L. Rolig, Y. Saito, M. Szymaniak, C. Taylor, R. Wang, and D. Woodford. Spanner: Google's globally-distributed database. In OSDI, 2012.
[20]
D. K. Gifford. Weighted voting for replicated data. In SOSP, 1979.
[21]
M. Hajjat, X. Sun, Y.-W. E. Sung, D. Maltz, S. Rao, K. Sripanidkulchai, and M. Tawarmalani. Cloud-ward bound: Planning for beneficial migration of enterprise applications to the cloud. In SIGCOMM, 2010.
[22]
R. Kotla, L. Alvisi, and M. Dahlin. SafeStore: A durable and practical storage system. In USENIX ATC, 2007.
[23]
A. Li, X. Yang, S. Kandula, and M. Zhang. Cloud-Cmp: Comparing public cloud providers. In IMC, 2010.
[24]
C. Li, D. Porto, A. Clement, R. Rodrigues, N. Preguia, and J. Gehrke. Making geo-replicated systems fast as possible, consistent when necessary. In OSDI, 2012.
[25]
R. Li, S. Wang, H. Deng, R. Wang, and K. C.-C. Chang. Towards social user profiling: Unified and discriminative influence model for inferring home locations. In KDD, 2012.
[26]
W. Lloyd, M. J. Freedman, M. Kaminsky, and D. G. Andersen. Don't settle for eventual: Scalable causal consistency for wide-area storage with COPS. In SOSP, 2011.
[27]
W. Lloyd, M. J. Freedman, M. Kaminsky, and D. G. Andersen. Stronger semantics for low-latency geo-replicated storage. In NSDI, 2013.
[28]
H. V. Madhyastha, J. C. McCullough, G. Porter, R. Kapoor, S. Savage, A. C. Snoeren, and A. Vahdat. scc: Cluster storage provisioning informed by application characteristics and SLAs. In FAST, 2012.
[29]
P. Mahajan, L. Alvisi, and M. Dahlin. Consistency, availability, convergence. Technical report, Univ. of Texas, 2011.
[30]
D. Malkhi and M. Reiter. Byzantine quorum systems. In STOC, 1997.
[31]
J.-P. Martin, L. Alvisi, and M. Dahlin. Small byzantine quorum systems. In DSN, 2002.
[32]
Y. Sovran, R. Power, M. K. Aguilera, and J. Li. Transactional storage for geo-replicated systems. In SOSP, 2011.
[33]
B. C. Tak, B. Urgaonkar, and A. Sivasubramaniam. To move or not to move: The economics of cloud computing. In HotCloud, 2011.
[34]
A. Wieder, P. Bhatotia, A. Post, and R. Rodrigues. Orchestrating the deployment of computations in the cloud with Conductor. In NSDI, 2012.
[35]
J. Wilkes. Traveling to Rome: QoS specifications for automated storage system management. In IWQoS, 2001.
[36]
T. Wood, E. Cecchet, K. Ramakrishnan, P. Shenoy, J. van der Merwe, and A. Venkataramani. Disaster recovery as a cloud service: Economic benefits & deployment challenges. In HotCloud, 2010.

Cited By

View all
  • (2024)Reducing Cross-Cloud/Region Costs with the Auto-Configuring MACARON CacheProceedings of the ACM SIGOPS 30th Symposium on Operating Systems Principles10.1145/3694715.3695972(347-368)Online publication date: 4-Nov-2024
  • (2024)SkyPIE: A Fast & Accurate Oracle for Object PlacementProceedings of the ACM on Management of Data10.1145/36393102:1(1-27)Online publication date: 26-Mar-2024
  • (2024)On Existence of Latency Optimal Uncoded Storage Schemes in Geo-Distributed Data Storage Systems2024 IEEE International Symposium on Information Theory (ISIT)10.1109/ISIT57864.2024.10619184(1462-1467)Online publication date: 7-Jul-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SOSP '13: Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles
November 2013
498 pages
ISBN:9781450323888
DOI:10.1145/2517349
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 November 2013

Check for updates

Qualifiers

  • Research-article

Funding Sources

Conference

SOSP '13
Sponsor:

Acceptance Rates

Overall Acceptance Rate 174 of 961 submissions, 18%

Upcoming Conference

SOSP '25
ACM SIGOPS 31st Symposium on Operating Systems Principles
October 13 - 16, 2025
Seoul , Republic of Korea

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)333
  • Downloads (Last 6 weeks)44
Reflects downloads up to 14 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Reducing Cross-Cloud/Region Costs with the Auto-Configuring MACARON CacheProceedings of the ACM SIGOPS 30th Symposium on Operating Systems Principles10.1145/3694715.3695972(347-368)Online publication date: 4-Nov-2024
  • (2024)SkyPIE: A Fast & Accurate Oracle for Object PlacementProceedings of the ACM on Management of Data10.1145/36393102:1(1-27)Online publication date: 26-Mar-2024
  • (2024)On Existence of Latency Optimal Uncoded Storage Schemes in Geo-Distributed Data Storage Systems2024 IEEE International Symposium on Information Theory (ISIT)10.1109/ISIT57864.2024.10619184(1462-1467)Online publication date: 7-Jul-2024
  • (2024)Caching in Forschung und IndustrieSchnelles und skalierbares Cloud-Datenmanagement10.1007/978-3-031-54388-3_5(91-140)Online publication date: 3-May-2024
  • (2023)InfiniStore: Elastic Serverless Cloud StorageProceedings of the VLDB Endowment10.14778/3587136.358713916:7(1629-1642)Online publication date: 1-Mar-2023
  • (2023)SDKV: A Smart and Distributed Key-Value Store for the Edge-Cloud ContinuumProceedings of the IEEE/ACM 16th International Conference on Utility and Cloud Computing10.1145/3603166.3632126(1-8)Online publication date: 4-Dec-2023
  • (2023)Partial Network PartitioningACM Transactions on Computer Systems10.1145/357619241:1-4(1-34)Online publication date: 18-Dec-2023
  • (2023)Adaptive Data Placement in Multi-Cloud Storage: A Non-Stationary Combinatorial Bandit ApproachIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2023.330615034:11(2843-2859)Online publication date: Nov-2023
  • (2023)BlockAIM: A Neural Network-Based Intelligent Middleware For Large-Scale IoT Data Placement DecisionsIEEE Transactions on Mobile Computing10.1109/TMC.2021.307157622:1(84-99)Online publication date: 1-Jan-2023
  • (2023)Secure Distributed Storage Orchestration on Heterogeneous Cloud-Edge InfrastructuresIEEE Transactions on Cloud Computing10.1109/TCC.2023.328765311:4(3407-3425)Online publication date: Oct-2023
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media