Abstract
Data storage, especially big data storage, is a research hot spot in Internet of Things (IoT) system today. In traditional data storage methods, the fault-tolerant algorithm for data copies is adjusted with whole data file, which causes huge redundancy because there are less utilization and more cost of data storage when only a part of data blocks in the file are accessed. Therefore, an optimized cluster storage method for big data in IoT is proposed in this paper. First, weights of data blocks in each historical accessing period are calculated by temporal locality of data access, and the access frequencies of the data block in next period are predicted by the weights. Second, the hot spot of a data block is determined with a threshold which is calculated by previous data access. Meantime, in order to improve the data access efficiency and resource utilization, as well as to reduce the copy storage costs, copy of data block is dynamically adjusted and stored in different groups with high-performance and low-load nodes for data balance. Finally, experimental results show that the storage cost of proposed method is 70% less than that of traditional methods, which means that the proposed method effectively improves the data access speed, reduces storage space, and balances the storage load.
Similar content being viewed by others
References
Chen B, Ting KM, Washio T et al (2015) Half-space mass: a maximally robust and efficient data depth method. Mach Learn 100(2–3):677–699. https://doi.org/10.1007/s10994-015-5524-x
Cherubini G, Jelitto J, Venkatesan V (2016) Cognitive storage for big data. Computer 49(4):43–51. https://doi.org/10.1109/MC.2016.117
Cody BM, Bau D, Gonzalez-Nicolas A (2015) Stochastic injection-strategy optimization for the preliminary assessment of candidate geological storage sites. Hydrogeol J 23(6):1229–1245. https://doi.org/10.1007/s10040-015-1250-5
El-Rabiaey MA, Areed NFF, Obayya SSA (2016) Novel plasmonic data storage based on nematic liquid crystal layers. J Lightwave Technol 34(16):3726–3732. https://doi.org/10.1109/JLT.2016.2582838
Gaocheng L, Shuai L, Khan M et al (2018) Object tracking in vary lighting conditions for fog based intelligent surveillance of public spaces. IEEE Access 6:29283–29296
Hong W, Yang Y, Li Y et al (2015) Systematic optimization of long gradient chromatography mass spectrometry for deep analysis of brain proteome. J Proteome Res 14(2):829–838. https://doi.org/10.1021/pr500882h
Ishii T, Shimada K, Hoshizawa T et al (2015) Analysis of vibration effects on holographic data storage system. Jpn J Appl Phys 54(9S):09MA04. https://doi.org/10.7567/jjap.54.09ma04
Kaneko K, Kawamoto Y, Nishiyama H et al (2015) An efficient utilization of intermittent surface–satellite optical links by using mass storage device embedded in satellites. Perform Eval 87(C):37–46. https://doi.org/10.1016/j.peva.2015.01.004
Liu S, Fu W, He L et al (2017) Distribution of primary additional errors in fractal encoding method. Multimed Tools Appl 76(4):5787–5802
Liu S, Bai W, Liu C et al (2018) Parallel fractal compression method for big video data. Complexity 2018:2016976. https://doi.org/10.1155/2018/2016976
Shuai L, Xiaochun C, Caihe L et al (2013) Fractal property of generalized M-set with rational number exponent. Appl Math Comput 220:668–675
Shuai L, Xiaochun C, Weina F et al (2014) Numeric characteristics of generalized M-set with its asymptote. Appl Math Comput 243:767–774
Wang J, Huang CG, Liu SK et al (2016) Research of distributed storage method for agricultural scientific data. Comput Eng Appl 52(11):248–253
Wijetunge CD, Saeed I, Boughton BA et al (2015) Exims an improved data analysis pipeline based on a new peak picking method for exploring imaging mass spectrometry data. Bioinformatics 31(19):3198–3206. https://doi.org/10.1093/bioinformatics/btv356
Xu Z, Schrama E, Wal WVD (2015) Optimization of regional constraints for estimating the Greenland mass balance with grace level-2 data. Geophys J Int 202(1):381–393. https://doi.org/10.1093/gji/ggv146
Yang G, Liu S (2014) Distributed cooperative algorithm for k − M set with negative integer k by fractal symmetrical property. Int J Distrib Sensor Netw 12:3. https://doi.org/10.1155/2014/398583
Yang CT, Shih WC, Huang CL et al (2016) On construction of a distributed data storage system in cloud. Computing 98(1–2):93–118. https://doi.org/10.1007/s00607-014-0399-4
Zhang Y, Bhamber R, Riba-Garcia I et al (2015) Streaming visualisation of quantitative mass spectrometry data based on a novel raw signal decomposition method. Proteomics 15(8):1419–1427. https://doi.org/10.1002/pmic.201400428
Zhao D, Katsouras I, Asadi K et al (2016) Retention of intermediate polarization states in ferroelectric materials enabling memories for multi-bit data storage. Appl Phys Lett 108(23):1040–1090. https://doi.org/10.1063/1.4953199
Zheng P, Shuai L, Weina F (2017) A review of visual moving target tracking. Multimed Tools Appl 76(16):16989–17018
Zheng P, Shuai L, Arun KS et al (2018) Visual attention feature (VAF): a novel strategy for visual tracking based on cloud platform in intelligent surveillance systems. J Parallel Distrib Comput 120:182–194
Acknowledgements
This work is supported by Programs of National Natural Science Foundation of China (No: 61502254), Program for Young Talents of Science and Technology in Universities of Inner Mongolia Autonomous Region (No. NJYT-18-B10), Teaching Reform Project Fund of University of Electronic Science and Technology of China Zhongshan Institute (No: 418YKQN02), Scientific Research Fund of Guangdong Provincial Education Department under Grant (No: 416YCQ01), Teaching Reform Project Fund of Hunan province under Grant (No: 2016[400]), Scientific Research Fund of Hunan Provincial Education Department under Grant Nos: 16C0299, 17C0295), Natural Science Foundation of Hunan Province, China (No. 2018JJ2023). We want to thank Prof Zhang from University of Leicester for his careful checking of language to this paper.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Tu, L., Liu, S., Wang, Y. et al. An optimized cluster storage method for real-time big data in Internet of Things. J Supercomput 76, 5175–5191 (2020). https://doi.org/10.1007/s11227-019-02773-1
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-019-02773-1