[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3220199.3220220acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicbdcConference Proceedingsconference-collections
research-article

SAT-based Important Data Reliability Enhancement Model for Big Data Storage

Published: 28 April 2018 Publication History

Abstract

Disk reliability is a serious problem in the big data foundation environment. Although the reliability of disk drives has greatly improved over the past few years, they are still the most vulnerable core components in the server. If they fail, the result can be catastrophic: it can take some days to recover data, sometimes data lost forever. These are unacceptable for some important data. XOR parity is a typical method to generate reliability syndrome, thus improving the reliability of the data. In practice, we find that the data is still likely to be lost. In most storage systems reliability improvements are achieved through the allocation of additional disks in Redundant Arrays of Independent Disks (RAID), which will increase the hardware costs, thus it will be very difficult for cost-constrained environments. Therefore, how to improve the data integrity without raising the hardware cost has aroused much interest of big data researchers. This challenge is when creating non-traditional RAID geometries, care must be taken to respect data dependence relationships to ensure that the new RAID strategy improves reliability, which is a NP-hard problem. In this paper, we present an approach for characterizing these challenges using high-dimension variants of the n-queens problem that enables performable solutions via the SAT solver MiniSAT, and use the greedy algorithm to analyze the queen's attack domain, as a basis for reliability syndrome generation. A large number of experiments show that the approach proposed in this paper is feasible in software-defined data centers and the performance of the algorithm can meet the current requirements of the big data environment.

References

[1]
Rozier E W D, Zhou P, Divine D. Building intelligence for software defined data centers: modeling usage patterns{C} //Proceedings of the 6th International Systems and Storage Conference. ACM, 2013: 20.
[2]
Bayram U, Rozier K Y, Rozier E W D. Characterizing data dependence constraints for dynamic reliability using N-queens attack domains{C}//International Conference on Quantitative Evaluation of Systems. Springer, Cham, 2015: 211--227.
[3]
Bayram U, Divine D, Zhou P, et al. Improving reliability with dynamic syndrome allocation in intelligent software defined data centers{C}//Dependable Systems and Networks (DSN), 2015 45th Annual IEEE/IFIP International Conference on. IEEE, 2015: 219--230.
[4]
Liu X, Fan L, Wang L, et al. Multiobjective reliable cloud storage with its particle swarm optimization algorithm{J}. Mathematical Problems in Engineering, 2016, 2016.
[5]
Chen P M, Lee E K, Gibson G A, et al. RAID: High-performance, reliable secondary storage{J}. ACM Computing Surveys (CSUR), 1994, 26(2): 145--185.
[6]
Corbett P, English B, Goel A, et al. Row-diagonal parity for double disk failure correction{C}//Proceedings of the 3rd USENIX Conference on File and Storage Technologies. USENIX Association Berkeley, CA, USA, 2004: 1--14.
[7]
Li T, Mehta A, Yang P. Security Analysis of Email systems{C} //Cyber Security and Cloud Computing (CSCloud), 2017 IEEE 4th International Conference on. IEEE, 2017: 91--96.
[8]
Schroeder B, Gibson G A. Disk failures in the real world: What does an mttf of 1, 000, 000 hours mean to you?{C} //FAST. 2007, 7(1): 1--16.
[9]
Sathiamoorthy M, Asteris M, Papailiopoulos D, et al. Xoring elephants: Novel erasure codes for big data{C}//Proceedings of the VLDB Endowment. VLDB Endowment, 2013, 6(5): 325--336.
[10]
Zwolenski M, Weatherill L. The digital universe: Rich data and the increasing value of the internet of things{J}. Australian Journal of Telecommunications and the Digital Economy, 2014, 2(3): 47.
[11]
Utard G, Vernois A. Data durability in peer to peer storage systems{C}//Cluster Computing and the Grid, 2004. CCGrid 2004. IEEE International Symposium on. IEEE, 2004: 90--97.
[12]
Rozier E W D, Zhou P, Divine D. Building intelligence for software defined data centers: modeling usage patterns{C}//Proceedings of the 6th International Systems and Storage Conference. ACM, 2013: 20.
[13]
Bayram U, Divine D, Zhou P, et al. Improving reliability with dynamic syndrome allocation in intelligent software defined data centers{C}//Dependable Systems and Networks (DSN), 2015 45th Annual IEEE/IFIP International Conference on. IEEE, 2015: 219--230.
[14]
McCarty C P. Queen squares{J}. The American Mathematical Monthly, 1978, 85(7): 578--580.
[15]
Bell J, Stevens B. A survey of known results and research areas for n-queens{J}. Discrete Mathematics, 2009, 309(1): 1--31.
[16]
Wu X, Xu Y, Yuen C, et al. A tag encoding scheme against pollution attack to linear network coding{J}. IEEE Transactions on Parallel and Distributed Systems, 2014, 25(1): 33--42.
[17]
Gong W, Zhou X. A survey of SAT solver{C}//AIP Conference Proceedings. AIP Publishing, 2017, 1836(1): 020059.
[18]
Schwarz S J T, Long D D E, Paris J F. Reliability of disk arrays with double parity{C}//Dependable Computing (PRDC), 2013 IEEE 19th Pacific Rim International Symposium on. IEEE, 2013: 108--117.
[19]
Zhu Y, Lee P P C, Hu Y, et al. On the speedup of single-disk failure recovery in xor-coded storage systems: Theory and practice{C}//Mass Storage Systems and Technologies (MSST), 2012 IEEE 28th Symposium on. IEEE, 2012: 1--12.
[20]
Huang C, Li J, Chen M. On optimizing XOR-based codes for fault-tolerant storage applications{C}//Information Theory Workshop, 2007. ITW'07. IEEE. IEEE, 2007: 218--223.
[21]
Keedwell A D, Dénes J. Latin squares and their applications{M}. Elsevier, 2015.
[22]
Pâris J F, Long D D E, Litwin W. Three-dimensional redundancy codes for archival storage{C}//Modeling, Analysis & Simulation of Computer and Telecommunication Systems (MASCOTS), 2013 IEEE 21st International Symposium on. IEEE, 2013: 328--332.
[23]
Xiang L, Xu Y, Lui J, et al. A hybrid approach to failed disk recovery using RAID-6 codes: Algorithms and performance evaluation{J}. ACM Transactions on Storage (TOS), 2011, 7(3): 11.

Index Terms

  1. SAT-based Important Data Reliability Enhancement Model for Big Data Storage

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICBDC '18: Proceedings of the 3rd International Conference on Big Data and Computing
    April 2018
    155 pages
    ISBN:9781450364263
    DOI:10.1145/3220199
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    In-Cooperation

    • Shenzhen University: Shenzhen University

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 28 April 2018

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Boolean Satisfiability Problem
    2. NP-hard
    3. big data
    4. data reliability
    5. n-queens

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICBDC '18

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 77
      Total Downloads
    • Downloads (Last 12 months)3
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 09 Jan 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media