Abstract
Generating accurate and reliable sales forecasts is crucial in the E-commerce business. The current state-of-the-art techniques are typically univariate methods, which produce forecasts considering only the historical sales data of a single product. However, in a situation where large quantities of related time series are available, conditioning the forecast of an individual time series on past behaviour of similar, related time series can be beneficial. Since the product assortment hierarchy in an E-commerce platform contains large numbers of related products, in which the sales demand patterns can be correlated, our attempt is to incorporate this cross-series information in a unified model. We achieve this by globally training a Long Short-Term Memory network (LSTM) that exploits the non-linear demand relationships available in an E-commerce product assortment hierarchy. Aside from the forecasting framework, we also propose a systematic pre-processing framework to overcome the challenges in the E-commerce business. We also introduce several product grouping strategies to supplement the LSTM learning schemes, in situations where sales patterns in a product portfolio are disparate. We empirically evaluate the proposed forecasting framework on a real-world online marketplace dataset from Walmart.com. Our method achieves competitive results on category level and super-departmental level datasets, outperforming state-of-the-art techniques.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Hyndman, R., et al.: Forecasting with Exponential Smoothing: The State Space Approach. Springer, Heidelberg (2008). doi: 10.1007/978-3-540-71918-2
Box, G.E.P., et al.: Time Series Analysis: Forecasting and Control. Wiley, Hoboken (2015)
Trapero, J.R., Kourentzes, N., Fildes, R.: On the identification of sales forecasting models in the presence of promotions. J. ORS 66, 299–307 (2015)
Borovykh, A., Bohte, S., Oosterlee, C.W.: Conditional time series forecasting with convolutional neural networks. arXiv [cs.AI] (2017)
Flunkert, V., Salinas, D., Gasthaus, J.: DeepAR: probabilistic forecasting with autoregressive recurrent networks. arXiv [cs.AI] (2017)
Wen, R. et al.: A multi-horizon quantile recurrent forecaster. arXiv [stat.ML] (2017)
Chapados, N.: Effective Bayesian modeling of groups of related count time series. In: Proceedings of the 31st ICML (2014)
Bandara, K., Bergmeir, C., Smyl, S.: Forecasting across time series databases using RNNs on groups of similar series: a clustering approach. arXiv [cs.LG] (2017)
Elman, J.L.: Finding structure in time. Cogn. Sci. 14(2), 179–211 (1990)
Williams, R.J., Zipser, D.: Gradient-based learning algorithms for RNNs and their computational complexity (1995)
Bengio, Y., Simard, P., Frasconi, P.: Learning long-term dependencies with gradient descent is difficult. IEEE Trans. Neural Netw. 2, 157–166 (1994)
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks. In: Ghahramani, Z., et al. (eds.) Proceedings of the 27th NIPS (2014)
Ben Taieb, S., et al.: A review and comparison of strategies for multi-step ahead time series forecasting based on the NN5 forecasting competition. Expert Syst. Appl. 39(8), 7067–7083 (2012)
Ord, K., Fildes, R.A., Kourentzes, N.: Principles of Business Forecasting, 2nd edn. Wessex Press Publishing Co., New York (2017)
Abadi, M., et al.: TensorFlow: Large-scale machine learning on heterogeneous distributed systems. arXiv [cs.DC] (2016)
Hutter, F., Hoos, H.H., Leyton-Brown, K.: Sequential model-based optimization for general algorithm configuration. In: Proceedings of the 5th International Conference on Learning and Intelligent Optimization, Italy, Rome, pp. 507–523 (2011)
Fernando, Bayesian-optimization: Bayesian Optimization of Hyper-parameters. Github (2017). https://bit.ly/2EssG1r
Snoek, J., Larochelle, H., Adams, R.P.: Practical Bayesian optimization of machine learning algorithms. In: Proceedings of the 25th NIPS (2012)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv [cs.LG] (2014)
Orabona, F., Tommasi, T.: Training deep networks without learning rates through coin betting. arXiv [cs.LG] (2017)
Hyndman, R.J., Khandakar, Y.: Automatic time series forecasting: the forecast package for R. J. Stat. Softw. 26(3), 1–22 (2008)
Hyndman, R.J., et al.: Time series features R package (2018). https://bit.ly/2GekHql
Kornelson, K.P., Vajjiravel, M., Prasad, R., Clark, P.D., Najm, T.: Method and system for developing extract transform load systems for data warehouses (2006)
Makridakis, S., Spiliotis, E., Assimakopoulos, V.: The M4 competition: results, findings, conclusion and way forward. Int. J. Forecast. 34, 802–808 (2018)
Weller, M., Crone, S.: Supply chain forecasting: best practices & benchmarking study. Technical report, Lancaster Centre for Forecasting (2012)
Taylor, S.J., Letham, B.: Forecasting at scale. Am. Stat. 72(1), 37–45 (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Bandara, K., Shi, P., Bergmeir, C., Hewamalage, H., Tran, Q., Seaman, B. (2019). Sales Demand Forecast in E-commerce Using a Long Short-Term Memory Neural Network Methodology. In: Gedeon, T., Wong, K., Lee, M. (eds) Neural Information Processing. ICONIP 2019. Lecture Notes in Computer Science(), vol 11955. Springer, Cham. https://doi.org/10.1007/978-3-030-36718-3_39
Download citation
DOI: https://doi.org/10.1007/978-3-030-36718-3_39
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-36717-6
Online ISBN: 978-3-030-36718-3
eBook Packages: Computer ScienceComputer Science (R0)