[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3589610.3596274acmconferencesArticle/Chapter ViewAbstractPublication PagescpsweekConference Proceedingsconference-collections
research-article

Rep-RAID: An Integrated Approach to Optimizing Data Replication and Garbage Collection in RAID-Enabled SSDs

Published: 13 June 2023 Publication History

Abstract

Redundant Array of Independent Disks (RAID) technology has been recently introduced to flash memory based SSDs to enhance their data reliability. Although RAID increases reliability, it doubles the number of write operations and requires additional parity computation as every write operation on a data chunk leads to another update on the corresponding parity chunk. Data replication has been proposed to mitigate the overhead of write requests in RAID enabled SSDs, however, replication increases the cost of garbage collection (GC), which in turn limits the improvement of I/O performance compared to the baseline RAID implementation. This paper introduces Rep-RAID, an improved data replication management scheme accompanied with optimized GC for RAID-enabled SSDs. Guided by a mathematical model, Rep-RAID only replicates frequently updated data chunks. Furthermore, Rep-RAID reorganizes new data stripes during the GC process by utilizing replicated data to replace invalid data chunks caused by data replication in old stripes. As a result, it decreases I/O latency for both read and write requests and significantly reduces the GC overhead induced by data movement. Experimental results show that the proposed scheme can improve I/O performance by 16.7%, and reduce tail latency by up to 17.9% at the 99.99th percentile, when compared to the state-of-the-art RAID-enabled SSDs.

References

[1]
Bryan S. Kim, Jongmoo Choi and Sang Lyul Min. Design tradeofs for SSD reliability. In USENIX Conference on File and Storage Technologies (FAST), 2019 : 281-294.
[2]
Jaeho Kim, Kwanghyun Lim, Youngdon Jung, Sungjin Lee, Changwoo Min and Sam H. Noh. Alleviating garbage collection interference through spatial separation in all flash arrays. In USENIX Annual Technical Conference (ATC), 2019 : 799-812.
[3]
Mahesh Balakrishnan, Asim Kadav, Vijayan Prabhakaran, and Dahlia Malkhi. Diferential raid: Rethinking raid for ssd reliability. ACM Transactions on Storage, Vol. 6 ( 2 ): 1-22, 2010. https://doi.org/10.1145/1807060.1807061
[4]
Yu Cai, Saugata Ghose, Erich F. Haratsch, Yixin Luo, and Onur Mutlu. Error characterization, mitigation, and recovery in flash-memorybased solid-state drives. Proceedings of the IEEE, Vol. 105 ( 9 ): 1666-1704, 2017. https://doi.org/10.1109/JPROC. 2017.2713127
[5]
Bianca Schroeder, Raghav Lagisetty, and Arif Merchant. 2016. Flash reliability in production: The expected and the unexpected. In USENIX Conference on File and Storage Technologies (FAST). 67-80.
[6]
NAND Flash Media Management Through RAIN. Retrieved from https://www.micron.com/-/media/client/global/documents/products/technical-marketingbrief/brief_ssd_rain.pdf
[7]
P320h 2. 5-Inch PCIe NAND SSD Features. Retrieved from https://www.micron.com/-/media/client/global/documents/products/datasheet/ssd/p320h_2_5.pdf
[8]
Huawei Tecal ES3000. Retrieved from https://www.storagereview.com/review/huawei-tecal-es3000-application-accelerator-review
[9]
Soojun Im, and Dongkun Shin. Improving SSD reliability with RAID via elastic striping and anywhere parity. In Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), 2013. https://doi.org/10.1109/DSN. 2013. 6575359.
[10]
Jaeho Kim, Eunjae Lee, Jongmoo Choi, Donghee Lee, and Sam H. Noh. Chip-level raid with flexible stripe size and parity placement for enhanced ssd reliability. IEEE Transactions on Computers, Vol. 65 ( 4 ): 1116-1130, 2016. https://doi.org/10.1109/TC. 2014.2375179
[11]
Jun Li, Zhibing Sha, Zhigang Cai, François Trahay, and Jianwei Liao. Patch-Based Data Management for Dual-Copy Bufers in RAIDEnabled SSDs. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 39, no. 11, pp. 3956-3967, 2020. https://doi.org/10.1109/TCAD. 2020.3012252
[12]
Zhibing Sha, Jun Li, Zhigang Cai, Min Huang, Jianwei Liao, and Francois Trahay. Degraded Mode-benefited I/O Scheduling to Ensure I/O Responsiveness in RAID-enabled SSDs. ACM Transactions on Design Automation of Electronic Systems, 2022. https://doi.org/10.1145/3522755
[13]
Jiguang Wan, Wei Wu, Ling Zhan, Qing Yang, Xiaoyang Qu, and Changsheng Xie. DEFT-Cache: A cost-efective and highly reliable SSD cache for RAID storage. In International Parallel and Distributed Processing Symposium (IPDPS), pp. 102-111, 2017. https://doi.org/10.1109/IPDPS. 2017.54
[14]
Yongkun Li, Biaobiao Shen, Yubiao Pan, Yinlong Xu, Zhipeng Li, and John C. S. Lui. Workload-aware elastic striping with hot data identiifcation for SSD RAID arrays. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 36 ( 5 ): 815-828, 2016. https://doi.org/10.1109/TCAD. 2016.2604292
[15]
Yangsup Lee, Sanghyuk Jung, and Yong Ho Song. FRA: a lfash-aware redundancy array of flash storage devices. In IEEE/ACM international conference on Hardware/software codesign and system synthesis (CODES+ISSS), pp. 163-172, 2009. https://doi.org/10.1145/1629435.1629459
[16]
Soojun Im and Dongkun Shin. Delayed partial parity scheme for reliable and high-performance flash memory SSD. In IEEE Symposium on Mass Storage Systems and Technologies (MSST), pp. 1-6, 2010. https://doi.org/10.1109/MSST. 2010.5496997
[17]
Yubiao Pan, Yongkun Li, Yinlong Xu, and Zhipeng Li. Groupingbased elastic striping with hotness awareness for improving SSD raid performance. In Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), pp. 160-171, 2015. https://doi.org/10.1109/DSN. 2015.51
[18]
Youmin Chen, Youyou Lu, Pei Chen, and Jiwu Shu. Eficient and Consistent NVMM Cache for SSD-Based File System. IEEE Transactions on Computers, Vol. 68 ( 8 ): 1147-1158, 2018. https://doi.org/10.1109/TC. 2018.2870137
[19]
Suzhen Wu, Weiwei Zhang, Bo Mao, and Hong Jiang. HotR: Alleviating Read/Write Interference with HotRead Data Replication for Flash Storage. In Design, Automation & Test in Europe Conference & Exhibition (DATE), 2019 : 1367-1372. https://doi.org/10.23919/DATE. 2019.8715100
[20]
Shiqin Yan, Huaicheng Li, Mingzhe Hao, Michael Hao Tong, Swaminathan Sundararaman, Andrew A. Chien, and Haryadi S. Gunawi. Near-perfect elimination of garbage collection tail latencies in NAND SSDs. In USENIX Conference on File and Storage Technologies (FAST), pp. 15-28, 2017.
[21]
Myoungsoo Jung, Ramya Prabhakar, and Mahmut Taylan Kandemir. Taking garbage collection overheads of the critical path in SSDs. In International Conference on Distributed Systems Platforms and Open Distributed Processing, 2012 : 164-186.
[22]
Wonkyung Kang, Dongkun Shin, and Sungjoo Yoo. Reinforcement Learning-Assisted Garbage Collection to Mitigate Long-Tail Latency in SSD. In ACM Transactions on Embedded Computing Systems, 2017, 16 (5s): 1-20. https://doi.org/10.1145/3126537
[23]
Wonil Choi, Myoungsoo Jung, Mahmut Kandemir, and Chita Das. 2018. Parallelizing garbage collection with I/O to improve flash resource utilization. In International Symposium on High-Performance Parallel and Distributed Computing (HPDC). https://doi.org/10.1145/3208040.3208048
[24]
Tianyang Jiang, Guangyan Zhang, Zican Huang, Xiaosong Ma, Junyu Wei, Zhiyue Li, and Weimin Zheng. FusionRAID: Achieving Consistent Low Latency for Commodity SSD Arrays. In USENIX Conference on File and Storage Technologies (FAST), 2021 : 355-370.
[25]
Wen Pan and Tao Xie. A mirroring-assisted channel-RAID5 SSD for mobile applications. ACM Transactions on Embedded Computing Systems, 2018, 17 ( 4 ): 1-27. https://doi.org/10.1145/3209625
[26]
Wenhui Zhang, Qiang Cao, Hong Jiang, and Jie Yao. Improving overall performance of TLC SSD by exploiting dissimilarity of flash pages. IEEE Transactions on Parallel and Distributed Systems, 2019, 31 ( 2 ): 332-346. https://doi.org/10.1109/TPDS. 2019.2934458
[27]
Congming Gao, Liang Shi, Kai Liu, Chun Jason Xue, Jun Yang, and Youtao Zhang. Boosting the performance of SSDs via fully exploiting the plane level parallelism. IEEE Transactions on Parallel and Distributed Systems, 2020, 31 ( 9 ): 2185-2200. https://doi.org/10.1109/TPDS. 2020.2987894
[28]
Wonkyung Kang and Yoo Sungjoo. Dynamic management of key states for reinforcement learning-assisted garbage collection to reduce long tail latency in SSD. In Proceedings of the 55th Annual Design Automation Conference (DAC), pp. 1-6, 2018. https://doi.org/10.1145/3195970.3196034
[29]
Zhibing Sha, Jun Li, Lihao Song, Jiewen Tang, Min Huang, Zhigang Cai, Lianju Qian, Jianwei Liao, and Zhiming Liu. 2021. Low I/O Intensityaware Partial GC Scheduling to Reduce Long-tail Latency in SSDs. ACM Transactions on Architecture and Code Optimization, 18, 4, Article 46 ( December 2021 ), 25 pages. https://doi.org/10.1145/3460433
[30]
Sergey Hardock, Petrov Ilia, and Gottstein Robert et al. From in-place updates to in-place appends: Revisiting out-of-place updates on flash. In proceedings of the 2017 ACM International Conference on Management of Data (SIGMOD), pp. 1571-1586, 2017. https://doi.org/10.1145/3035918.3035958
[31]
Soojun Im and Dongkun Shin. Delayed partial parity scheme for reliable and high-performance flash memory SSD. In IEEE Symposium on Mass Storage Systems and Technologies (MSST), 2010 : 1-6. https://doi.org/10.1109/MSST. 2010.5496997
[32]
Soojun Im and Dongkun Shin. Flash-aware RAID techniques for dependable and high-performance flash memory SSD. IEEE Transactions on Computers, 2010, 60 ( 1 ): 80-92. https://doi.org/10.1109/TC. 2010.197
[33]
Suzhen Wu, Weidong Zhu, Guixin Liu, Hong Jiang, and Bo Mao. GC-aware request steering with improved performance and reliability for SSD-based RAIDs. In IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2018 : 296-305. https://doi.org/10.1109/IPDPS. 2018.00039
[34]
Chenlei Tang, Jiguang Wan, Yifeng Zhu, Zhiyuan Liu, Peng Xu, Fei Wu, and Changsheng Xie. RAFS: A RAID-Aware File System to Reduce the Parity Update Overhead for SSD RAID. In Design, Automation & Test in Europe Conference & Exhibition (DATE), 2019 : 1373-1378. https://doi.org/10.23919/DATE. 2019.8714938
[35]
Youngjae Kim, Sarp Oral, Galen M. Shipman, Junghee Lee, David A. Dillow, and Feiyi Wang. Harmonia: A globally coordinated garbage collector for arrays of solid-state drives. IEEE Symposium on Mass Storage Systems and Technologies (MSST). IEEE, 2011 : 1-12. https://doi.org/10.1109/MSST. 2011.5937224
[36]
Jun Li, Xiaofei Xu, Xiaoning Peng, and Jianwei Liao. Pattern-based write scheduling and read balance-oriented wear-leveling for solid state drivers. In IEEE Symposium on Mass Storage Systems and Technologies (MSST), 2019 : 126-133. https://doi.org/10.1109/MSST. 2019. 00-10
[37]
Xiaofei Xu, Zhigang Cai, Jianwei Liao, and Yutaka Ishiakwa. Frequent access pattern-based prefetching inside of solid-state drives. In Design, Automation & Test in Europe Conference & Exhibition (DATE), 2020 : 720-725. https://doi.org/10.23919/DATE48585. 2020.9116382
[38]
Congming Gao, Min Ye, Qiao Li, Chun Jason Xue, Youtao Zhang, Liang Shi, and Jun Yang. 2019. Constructing Large, Durable and Fast SSD System via Reprogramming 3D TLC Flash Memory. In Annual IEEE/ACM International Symposium on Microarchitecture (MICRO). Association for Computing Machinery, New York, NY, USA, 493-505. https://doi.org/10.1145/3352460.3358323
[39]
Chun-Yi Liu, Yunju Lee, Myoungsoo Jung, Mahmut Taylan Kandemir, and Wonil Choi. 2021. Prolonging 3D NAND SSD lifetime via read latency relaxation. In ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS). Association for Computing Machinery, New York, NY, USA, 730-742. https://doi.org/10.1145/3445814.3446733
[40]
Jinhua Cui, Junwei Liu, Jianhang Huang, and Laurence T. Yang. SmartHeating: On the Performance and Lifetime Improvement of Self-Healing SSDs. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2020, 40 ( 1 ): 52-65. https://doi.org/0.1109/TCAD. 2020.2990896
[41]
Wenhui Zhang, Qiang Cao, Hong Jiang, and Jie Yao. 2018. PA-SSD: A Page-Type Aware TLC SSD for Improved Write/Read Performance and Storage Eficiency. In International Conference on Supercomputing (ICS). Association for Computing Machinery, New York, NY, USA, 22-32, 2018. https://doi.org/10.1145/3205289.3205319
[42]
Yang Hu, Hong Jiang, Dan Feng, Lei Tian, Hao Luo, and Chao Ren. Exploring and exploiting the multilevel parallelism inside SSDs for improved performance and endurance. IEEE Transactions on Computers, 2013, 62 ( 6 ): 1141-1155. https://doi.org/10.1109/TC. 2012.60
[43]
Dushyanth Narayanan, Eno Thereska, Austin Donnelly, Sameh Elnikety, and Antony Rowstron. 2009. Migrating server storage to SSDs: analysis of tradeofs. In European conference on Computer systems(EuroSys). Association for Computing Machinery, New York, NY, USA, 145-158. https://doi.org/10.1145/1519065.1519081

Index Terms

  1. Rep-RAID: An Integrated Approach to Optimizing Data Replication and Garbage Collection in RAID-Enabled SSDs

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    LCTES 2023: Proceedings of the 24th ACM SIGPLAN/SIGBED International Conference on Languages, Compilers, and Tools for Embedded Systems
    June 2023
    147 pages
    ISBN:9798400701740
    DOI:10.1145/3589610
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 13 June 2023

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Garbage Collection
    2. RAID-5
    3. Replication
    4. SSDs
    5. Stripe Reorganization

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    LCTES '23

    Acceptance Rates

    Overall Acceptance Rate 116 of 438 submissions, 26%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 210
      Total Downloads
    • Downloads (Last 12 months)90
    • Downloads (Last 6 weeks)9
    Reflects downloads up to 13 Dec 2024

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media