Abstract
Time-series forecasting plays an important role in many domains. Boosted by the advances in Deep Learning algorithms, it has for instance been used to predict wind power for eolic energy production, stock market fluctuations, or motor overheating. In some of these tasks, we are interested in predicting accurately some particular moments which often are underrepresented in the dataset, resulting in a problem known as imbalanced regression. In the literature, while recognized as a challenging problem, limited attention has been devoted on how to handle the problem in a practical setting. In this paper, we put forward a general approach to analyze time-series forecasting problems focusing on those underrepresented moments to reduce imbalances. Our approach has been developed based on a case study in a large industrial company, which we use to exemplify the approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Ahmed, N.K., Atiya, A.F., Gayar, N.E., El-Shishiny, H.: An empirical comparison of machine learning models for time series forecasting. Econometric Rev. 29(5–6), 594–621 (2010)
Bai, S., Kolter, J.Z., Koltun, V.: An empirical evaluation of generic convolutional and recurrent networks for sequence modeling. arXiv:1803.01271 [cs] (2018)
Branco, P., Torgo, L., Ribeiro, R.P.: A survey of predictive modeling on imbalanced domains. ACM Comput. Surveys 49(2), 1–50 (2016)
Branco, P., Torgo, L., Ribeiro, R.P.: Pre-processing approaches for imbalanced distributions in regression. Neurocomputing 343, 76–99 (2019). https://doi.org/10.1016/j.neucom.2018.11.100
Ding, D., Zhang, M., Pan, X., Yang, M., He, X.: Modeling extreme events in time series prediction. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 1114–1122. KDD 2019. Association for Computing Machinery, New York, NY, USA (2019). https://doi.org/10.1145/3292500.3330896
Freedman, D., Diaconis, P.: On the histogram as a density estimator: L 2 theory. Zeitschrift für Wahrscheinlichkeitstheorie und verwandte Gebiete 57(4), 453–476 (1981)
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Laptev, N., Yosinski, J., Li, L., Smyl, S.: Time-series extreme event forecasting with neural networks at Uber. In: 2017 International Conference on Machine Learning Time Series Workshop (2017)
Liu, Y., et al.: Application of deep convolutional neural networks for detecting extreme weather in climate datasets. CoRR abs/1605.01156 (2016). http://arxiv.org/abs/1605.01156
Lundberg, S.M., Lee, S.I.: A unified approach to interpreting model predictions. In: Guyon, I., Luxburg, U.V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., Garnett, R. (eds.) Advances in Neural Information Processing Systems, vol. 30, pp. 4765–4774. Curran Associates, Inc. (2017)
Magalhães, E.: Quality prediction in a mining process, https://www.kaggle.com/edumagalhaes/quality-prediction-in-a-mining-process. Accessed 19 July 2021
Pantiskas, L., Verstoep, K., Bal, H.: Interpretable multivariate time series forecasting with temporal attention convolutional neural networks. In: 2020 IEEE Symposium Series on Computational Intelligence (SSCI), pp. 1687–1694 (2020). https://doi.org/10.1109/SSCI47803.2020.9308570
Pantiskas, L., Silvestrin, L.P.: Open code of the experiment using the mining process dataset (2021). https://gitlab.com/lpsilvestrin/imbalanced-time-series-forecast. Accessed 20 July 2021
Rosenblatt, F.: The perceptron: a probabilistic model for information storage and organization in the brain. Psychol. Rev. 65(6), 386 (1958)
Salinas, D., Flunkert, V., Gasthaus, J., Januschowski, T.: DeepAR: probabilistic forecasting with autoregressive recurrent networks. Int. J. Forecasting 36(3), 1181–1191 (2020). https://doi.org/10.1016/j.ijforecast.2019.07.001. https://www.sciencedirect.com/science/article/pii/S0169207019301888
Silvestrin, L.P., Hoogendoorn, M., Koole, G.: A comparative study of state-of-the-art machine learning algorithms for predictive maintenance. In: 2019 IEEE Symposium Series on Computational Intelligence (SSCI), pp. 760–767. IEEE (2019)
Snieder, E., Abogadil, K., Khan, U.T.: Resampling and ensemble techniques for improving ANN-based high streamflow forecast accuracy. Hydrol. Earth Syst. Sci. Discuss. 2020, 1–35 (2020)
Torres, J.F., Hadjout, D., Sebaa, A., Martínez-Álvarez, F., Troncoso, A.: Deep learning for time series forecasting: a survey. Big Data 9(1), 3–21 (2021). https://doi.org/10.1089/big.2020.0159
Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems Systems (NIPS), pp. 5998–6008 (2017)
Acknowledgements
This work has been conducted as part of the Just in Time Maintenance project funded by the European Fund for Regional Development. We also thank Tata Steel Europe for providing the data and technical expertise required for our experiments.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Silvestrin, L.P., Pantiskas, L., Hoogendoorn, M. (2022). A Framework for Imbalanced Time-Series Forecasting. In: Nicosia, G., et al. Machine Learning, Optimization, and Data Science. LOD 2021. Lecture Notes in Computer Science(), vol 13163. Springer, Cham. https://doi.org/10.1007/978-3-030-95467-3_19
Download citation
DOI: https://doi.org/10.1007/978-3-030-95467-3_19
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-95466-6
Online ISBN: 978-3-030-95467-3
eBook Packages: Computer ScienceComputer Science (R0)