Abstract
Mathematical models for predicting values in time series are powerfultools for the process of knowledge discovery and decision making in several areas. However, the choice of the predictive model and its configuration are not trivial tasks, requiring a long processing time to obtain the results due to the high complexity of the models and the uncertainty of the value of the best parameters. Calculations performed by these approaches use sampling from the dataset, which can present discrepancies and variations that can directly impact the final result. Therefore, this work presents a new approach based on the SARIMA model for the prediction of values in time series. The proposal aims at predictive calculation from multiple executions of SARIMA in parallel, configured with predefined order and seasonal order parameters and applied to values already known in a time series. Thus, from the results obtained in past observations, it is possible to determine the percentage of precision that each parameter obtained, and, in this way, to determine the parameters that are more likely to obtain more accurate values in future observations, thus, eliminating the need to use specific algorithms to estimate them. The proposed approach is capable of achieving results with greater precision and performance compared to the traditional SARIMA execution, achieving results with greater assertiveness, reaching up to 10.77% of better accuracy and with better processing times, without the need for validation and parameter adjustments required by the settings obtained by functions, such as ACF and PACF.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
https://www.kaggle.com/ramirobentes/flights-in-brazil. Accessed August 6, 2020.
- 2.
https://www.kaggle.com/inquisitivecrow/crime-data-in-brazil. Accessed August 6, 2020.
References
Box, G.E., Jenkins, G.M.: Time Series Analysis: Forecasting and Control Holden-day, p. 498, San Francisco (1970)
Box, G.E., Jenkins, G.M., Reinsel, G.C., Ljung, G.M.: Time Series Analysis: Forecasting and Control. Wiley, Hoboken (2015)
Brockwell, P.J., Davis, R.A., Fienberg, S.E.: Time Series: Theory and methods. Springer, New York (1991). https://doi.org/10.1007/978-1-4419-0320-4
Castellanos, M.: Semantic enrichment of interoperable databases. In: Proceedings RIDE-IMS 1993: Third International Workshop on Research Issues in Data Engineering: Interoperability in Multidatabase Systems, pp. 126–129, April 1993. https://doi.org/10.1109/RIDE.1993.281934
Ding, F., Meng, D., Dai, J., Li, Q., Alsaedi, A., Hayat, T.: Least squares based iterative parameter estimation algorithm for stochastic dynamical systems with ARMA noise using the model equivalence. Int. J. Control Autom. Syst. 16(2), 630–639 (2018)
Ding, F., Meng, D., Wang, Q.: The model equivalence based parameter estimation methods for Box-Jenkins systems. J. Franklin Inst. 352(12), 5473–5485 (2015)
He, T., Zheng, S.: Time series analysis and forcast based on active learning artificial neural network. In: 2009 Second International Symposium on Knowledge Acquisition and Modeling, vol. 1, pp. 84–87, November 2009. https://doi.org/10.1109/KAM.2009.303
Holley, K., Sivakumar, G., Kannan, K.: Enrichment patterns for big data. In: 2014 IEEE International Congress on Big Data, pp. 796–799, June 2014. https://doi.org/10.1109/BigData.Congress.2014.127
HongJu, X., Fei, W., FenMei, W., XiuZhen, W.: Some key problems of data management in army data engineering based on big data. In: 2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA), pp. 149–152, March 2017. https://doi.org/10.1109/ICBDA.2017.8078796
Hyndman, R.: Better ACF and PACF plots, but no optimal linear prediction. Electron. J. Stat. [E] 8(2), 2296–2300 (2014)
McMurry, T.L., Politis, D.N.: Banded and tapered estimates for autocovariance matrices and the linear process bootstrap. J. Time Ser. Anal. 31(6), 471–482 (2010)
Meerschaert, M.M.: Mathematical Modeling. Elsevier, Amsterdam (2007)
Meng, D., Ding, F.: Model equivalence-based identification algorithm for equation-error systems with colored noise. Algorithms 8(2), 280–291 (2015)
Mohammadi, K., Eslami, H., Kahawita, R.: Parameter estimation of an ARMA model for river flow forecasting using goal programming. J. Hydrol. 331(1–2), 293–299 (2006)
Olsson, M., Soder, L.: Modeling real-time balancing power market prices using combined SARIMA and Markov processes. IEEE Trans. Power Syst. 23(2), 443–450 (2008). https://doi.org/10.1109/TPWRS.2008.920046
Sakia, R.M.: The box-cox transformation technique: a review. J. Roy. Stat. Soc. Ser. D (Stat.) 41(2), 169–178 (1992)
Sheta, A.F., Faris, H., Aljarah, I.: Estimating ARMA model parameters of an industrial process using meta-heuristic search algorithms. Int. J. Eng. Technol. 7(3.10), 187–194 (2018)
Shumway, R.H., Stoffer, D.S.: Time Series Analysis and Its Applications: With R Examples. Springer, Heidelberg (2017). https://doi.org/10.1007/978-3-319-52452-8
Soares., F.A.L., Silveira., T.B., Freitas., H.C.: Hybrid approach based on SARIMA and artificial neural networks for knowledge discovery applied to crime rates prediction. In: Proceedings of the 22nd International Conference on Enterprise Information Systems: ICEIS, vol. 1, pp. 407–415. INSTICC, SciTePress (2020). https://doi.org/10.5220/0009412704070415
Susanti, S.P., Azizah, F.N.: Imputation of missing value using dynamic Bayesian network for multivariate time series data. In: 2017 International Conference on Data and Software Engineering (ICoDSE), pp. 1–5, November 2017. https://doi.org/10.1109/ICODSE.2017.8285864
Willmott, C.J.: Some comments on the evaluation of model performance. Bull. Am. Meteor. Soc. 63(11), 1309–1313 (1982)
Wold, H.: A study in the analysis of stationary time series. Ph.D. thesis, Almqvist & Wiksell (1938)
Yule, G.U.: Why do we sometimes get nonsense-correlations between time-series? - A study in sampling and the nature of time-series. J. Roy. Stat. Soc. 89(1), 1–63 (1926)
Acknowledgements
The present work was carried out with the support of the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brazil (CAPES) - Financing Code 001. The authors thank CNPq, FAPEMIG, PUC Minas and REVEX for the partial support in the execution of this work.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Batista da Silveira, T., Lara Soares, F.A., Cota de Freitas, H. (2021). Fast and Efficient Parallel Execution of SARIMA Prediction Model. In: Filipe, J., Śmiałek, M., Brodsky, A., Hammoudi, S. (eds) Enterprise Information Systems. ICEIS 2020. Lecture Notes in Business Information Processing, vol 417. Springer, Cham. https://doi.org/10.1007/978-3-030-75418-1_11
Download citation
DOI: https://doi.org/10.1007/978-3-030-75418-1_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-75417-4
Online ISBN: 978-3-030-75418-1
eBook Packages: Computer ScienceComputer Science (R0)