Abstract
An investigation on how to produce a fast and accurate prediction of user behaviour on the Web is conducted. First, the problem of predicting user behaviour as a classification task is formulated and then the main problems of such real-time predictions are specified: the accuracy and time complexity of the prediction. Second, a method for comparison of online and batch (offline) algorithms used for user behaviour prediction is proposed. Last, the performance of these algorithms using the data from a popular question and answer platform, Stack Overflow, is empirically explored. It is demonstrated that a simple online learning algorithm outperforms state-of-the-art batch algorithms and performs as well as a deep learning algorithm, Deep Belief Networks. The proposed method for comparison of online and offline algorithms as well as the provided experimental evidence can be used for choosing a machine learning set-up for predicting user behaviour on the Web in scenarios where the accuracy and the time performance are of main concern.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
The source code is at https://github.com/Nik0l/UTemPr.
- 2.
- 3.
- 4.
- 5.
In our experiments, we tried \(L_1\) and \(L_2\) regularisation but we did not find any significant improvements in the results compared to the results without regularisation reported in this paper.
References
Choi, S., Kim, E., Oh, S.: Human behavior prediction for smart homes using deep learning. In: 2013 IEEE RO-MAN, pp. 173–179 (2013)
Nazerfard E., Cook, D.: Using Bayesian Networks for Daily Activity Prediction (2013)
Burlutskiy, N., Petridis, M., Fish, A., Ali, N.: Prediction of users’ response time in Q&A communities. In: ICMLA’15, International Conference on Machine Learning and Applications (2015)
Weerkamp, W., De Rijke, M.: Activity prediction: a twitter-based exploration. In: Proceedings of TAIA’12 (2012)
Zheng, B., Thompson, K., Lam, S.S., Yoon, S.W., Gnanasambandam, N.: Customers behavior prediction using artificial neural network. In: Industrial and Systems Engineering Research Conference (ISERC), pp. 700–709. Institute of Industrial Engineerings (2013)
Loumiotis, I., Adamopoulou, E., Demestichas, K., Theologou, M.: On trade-off between computational efficiency and prediction accuracy in bandwidth traffic estimation. Electron. Lett. 50(10), 754–756 (2014)
Liang, N.Y., Huang, G.B., Saratchandran, P., Sundararajan, N.: A fast and accurate online sequential learning algorithm for feedforward networks. IEEE Trans. Neural Netw. 17(6), 1411–1423 (2006)
Sadilek, A., Krumm, J.: Predicting long-term human mobility. In: AAAI, Far out (2012)
Zhu, Y., Zhong, E., Pan, S.J., Wang, X., Zhou, M., Yang, Q.: Predicting user activity level in social networks. In: Proceedings of the 22Nd ACM International Conference on Information and Knowledge Management, CIKM ’13, pp. 159–168, New York, NY, USA. ACM (2013)
Radinsky, K., Svore, K., Dumais, S., Teevan, J., Bocharov, A., Horvitz, E.: Modeling and predicting behavioral dynamics on the web. In: Proceedings of the 21st International Conference on World Wide Web, WWW ’12, pp. 599–608, New York, NY, USA. ACM (2012)
Dror, G., Maarek, Y., Szpektor, I.: Will my question be answered? predicting “question answerability” in community question-answering sites. In: Blockeel, H. (ed.) Machine Learning and Knowledge Discovery in Databases. Lecture Notes in Computer Science, vol. 8190, pp. 499–514. Springer, Berlin Heidelberg (2013)
Yang, L., Bao, S., Lin, Q., Wu, X., Han, D., Su, Z., Yu, Y.: Analyzing and predicting not-answered questions in community-based question answering services. In: Burgard, W. (ed.) AAAI. AAAI Press (2011)
Lim, T.S., Loh, W.Y., Shih, Y.S.: A comparison of prediction accuracy, complexity, and training time of thirty-three old and new classification algorithms. Mach. Learn. 40(3), 203–228 (2000)
Anderson, A., Huttenlocher, D., Kleinberg, J., Leskovec, J.: Discovering value from community activity on focused question answering sites: a case study of stack overflow. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 850–858, New York, USA. ACM (2012)
Asaduzzaman, M., Mashiyat, A.S., Roy, C.K., Schneider, K.A.: Answering questions about unanswered questions of stack overflow. In: Proceedings of the 10th Working Conference on Mining Software Repositories, pp. 97–100. Piscataway, NJ, USA (2013)
Bhat, V., Gokhale, A., Jadhav, R., Pudipeddi, J., Akoglu, L.: Min(e)d your tags: analysis of question response time in stack overflow. In: 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), pp. 328–335 (2014)
Lezina, C.G.E., Kuznetsov, A.M.: Predict Closed Questions on Stack Overflow (2012)
Dekel, O.: From online to batch learning with cutoff-averaging. In: Koller, D., Schuurmans, D., Bengio, Y., Bottou, L. (eds.) Advances in Neural Information Processing Systems 21, pp. 377–384. Curran Associates, Inc. (2009)
Hoi, S.C., Wang, J., Zhao, P.: Libol: a library for online learning algorithms. J. Mach. Learn. Res. 15, 495–499 (2014)
Crammer, K., Dekel, O., Keshet, J., Shalev-Shwartz, S., Singer, Y.: Online passive-aggressive algorithms. J. Mach. Learn. Res. 7, 551–585 (2006)
Bianchini, M., Scarselli, F.: On the complexity of neural network classifiers: a comparison between shallow and deep achitectures. IEEE Trans. Neural Netw. Learn. Syst. 25(8), 1553–1565 (2014)
Chapelle, O.: Training a support vector machine in the primal. Neural Comput. 19(5), 1155–1178 (2007)
Minka, T.P.: A Comparison of Numerical Optimizers for Logistic Regression. Technical report (2003)
Su, J., Zhang, H.: A fast decision tree learning algorithm. In: Proceedings of the 21st National Conference on Artificial Intelligence—Volume 1, AAAI’06, pp. 500–505. AAAI Press (2006)
Bottou, L.: Proceedings of COMPSTAT’2010: 19th International Conference on Computational StatisticsParis France, August 22–27, 2010 Keynote, Invited and Contributed Papers, chapter Large-Scale Machine Learning with Stochastic Gradient Descent, pp. 177–186. Physica-Verlag HD, Heidelberg (2010)
Mohri, M., Rostamizadeh, A., Talwalkar, A.: Foundations of Machine Learning. The MIT Press (2012)
Cai, Y., Chakravarthy, S.: Answer quality prediction in Q&A social networks by leveraging temporal features. IJNGC 4(1) (2013)
Acknowledgments
The authors are grateful for illuminating discussions to Dr Yuri Kalnishkan’s team in the project “On-line Self-Tuning Learning Algorithms for Handling Historical Information” (funded by the Leverhulme Trust).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Burlutskiy, N., Petridis, M., Fish, A., Chernov, A., Ali, N. (2016). An Investigation on Online Versus Batch Learning in Predicting User Behaviour. In: Bramer, M., Petridis, M. (eds) Research and Development in Intelligent Systems XXXIII. SGAI 2016. Springer, Cham. https://doi.org/10.1007/978-3-319-47175-4_9
Download citation
DOI: https://doi.org/10.1007/978-3-319-47175-4_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-47174-7
Online ISBN: 978-3-319-47175-4
eBook Packages: Computer ScienceComputer Science (R0)