An Investigation on Online Versus Batch Learning in Predicting User Behaviour

Nikolay Burlutskiy³,
Miltos Petridis³,
Andrew Fish³,
Alexey Chernov³ &
…
Nour Ali³

Included in the following conference series:

International Conference on Innovative Techniques and Applications of Artificial Intelligence

1189 Accesses
6 Citations

Abstract

An investigation on how to produce a fast and accurate prediction of user behaviour on the Web is conducted. First, the problem of predicting user behaviour as a classification task is formulated and then the main problems of such real-time predictions are specified: the accuracy and time complexity of the prediction. Second, a method for comparison of online and batch (offline) algorithms used for user behaviour prediction is proposed. Last, the performance of these algorithms using the data from a popular question and answer platform, Stack Overflow, is empirically explored. It is demonstrated that a simple online learning algorithm outperforms state-of-the-art batch algorithms and performs as well as a deep learning algorithm, Deep Belief Networks. The proposed method for comparison of online and offline algorithms as well as the provided experimental evidence can be used for choosing a machine learning set-up for predicting user behaviour on the Web in scenarios where the accuracy and the time performance are of main concern.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 71.50; Price includes VAT (United Kingdom)

Softcover Book: GBP 89.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Improving Incremental Recommenders with Online Bagging

Online Learning of Deep Hybrid Architectures for Semi-supervised Categorization

Online Learning for Conversational Agents

Notes

1.
The source code is at https://github.com/Nik0l/UTemPr.
2.
http://scikit-learn.org/.
3.
https://github.com/Lasagne/Lasagne.
4.
http://www.nltk.org/.
5.
In our experiments, we tried \(L_1\) and \(L_2\) regularisation but we did not find any significant improvements in the results compared to the results without regularisation reported in this paper.

References

Choi, S., Kim, E., Oh, S.: Human behavior prediction for smart homes using deep learning. In: 2013 IEEE RO-MAN, pp. 173–179 (2013)
Google Scholar
Nazerfard E., Cook, D.: Using Bayesian Networks for Daily Activity Prediction (2013)
Google Scholar
Burlutskiy, N., Petridis, M., Fish, A., Ali, N.: Prediction of users’ response time in Q&A communities. In: ICMLA’15, International Conference on Machine Learning and Applications (2015)
Google Scholar
Weerkamp, W., De Rijke, M.: Activity prediction: a twitter-based exploration. In: Proceedings of TAIA’12 (2012)
Google Scholar
Zheng, B., Thompson, K., Lam, S.S., Yoon, S.W., Gnanasambandam, N.: Customers behavior prediction using artificial neural network. In: Industrial and Systems Engineering Research Conference (ISERC), pp. 700–709. Institute of Industrial Engineerings (2013)
Google Scholar
Loumiotis, I., Adamopoulou, E., Demestichas, K., Theologou, M.: On trade-off between computational efficiency and prediction accuracy in bandwidth traffic estimation. Electron. Lett. 50(10), 754–756 (2014)
Article Google Scholar
Liang, N.Y., Huang, G.B., Saratchandran, P., Sundararajan, N.: A fast and accurate online sequential learning algorithm for feedforward networks. IEEE Trans. Neural Netw. 17(6), 1411–1423 (2006)
Article Google Scholar
Sadilek, A., Krumm, J.: Predicting long-term human mobility. In: AAAI, Far out (2012)
Google Scholar
Zhu, Y., Zhong, E., Pan, S.J., Wang, X., Zhou, M., Yang, Q.: Predicting user activity level in social networks. In: Proceedings of the 22Nd ACM International Conference on Information and Knowledge Management, CIKM ’13, pp. 159–168, New York, NY, USA. ACM (2013)
Google Scholar
Radinsky, K., Svore, K., Dumais, S., Teevan, J., Bocharov, A., Horvitz, E.: Modeling and predicting behavioral dynamics on the web. In: Proceedings of the 21st International Conference on World Wide Web, WWW ’12, pp. 599–608, New York, NY, USA. ACM (2012)
Google Scholar
Dror, G., Maarek, Y., Szpektor, I.: Will my question be answered? predicting “question answerability” in community question-answering sites. In: Blockeel, H. (ed.) Machine Learning and Knowledge Discovery in Databases. Lecture Notes in Computer Science, vol. 8190, pp. 499–514. Springer, Berlin Heidelberg (2013)
Chapter Google Scholar
Yang, L., Bao, S., Lin, Q., Wu, X., Han, D., Su, Z., Yu, Y.: Analyzing and predicting not-answered questions in community-based question answering services. In: Burgard, W. (ed.) AAAI. AAAI Press (2011)
Google Scholar
Lim, T.S., Loh, W.Y., Shih, Y.S.: A comparison of prediction accuracy, complexity, and training time of thirty-three old and new classification algorithms. Mach. Learn. 40(3), 203–228 (2000)
Article MATH Google Scholar
Anderson, A., Huttenlocher, D., Kleinberg, J., Leskovec, J.: Discovering value from community activity on focused question answering sites: a case study of stack overflow. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 850–858, New York, USA. ACM (2012)
Google Scholar
Asaduzzaman, M., Mashiyat, A.S., Roy, C.K., Schneider, K.A.: Answering questions about unanswered questions of stack overflow. In: Proceedings of the 10th Working Conference on Mining Software Repositories, pp. 97–100. Piscataway, NJ, USA (2013)
Google Scholar
Bhat, V., Gokhale, A., Jadhav, R., Pudipeddi, J., Akoglu, L.: Min(e)d your tags: analysis of question response time in stack overflow. In: 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), pp. 328–335 (2014)
Google Scholar
Lezina, C.G.E., Kuznetsov, A.M.: Predict Closed Questions on Stack Overflow (2012)
Google Scholar
Dekel, O.: From online to batch learning with cutoff-averaging. In: Koller, D., Schuurmans, D., Bengio, Y., Bottou, L. (eds.) Advances in Neural Information Processing Systems 21, pp. 377–384. Curran Associates, Inc. (2009)
Google Scholar
Hoi, S.C., Wang, J., Zhao, P.: Libol: a library for online learning algorithms. J. Mach. Learn. Res. 15, 495–499 (2014)
MATH Google Scholar
Crammer, K., Dekel, O., Keshet, J., Shalev-Shwartz, S., Singer, Y.: Online passive-aggressive algorithms. J. Mach. Learn. Res. 7, 551–585 (2006)
MathSciNet MATH Google Scholar
Bianchini, M., Scarselli, F.: On the complexity of neural network classifiers: a comparison between shallow and deep achitectures. IEEE Trans. Neural Netw. Learn. Syst. 25(8), 1553–1565 (2014)
Article Google Scholar
Chapelle, O.: Training a support vector machine in the primal. Neural Comput. 19(5), 1155–1178 (2007)
Article MathSciNet MATH Google Scholar
Minka, T.P.: A Comparison of Numerical Optimizers for Logistic Regression. Technical report (2003)
Google Scholar
Su, J., Zhang, H.: A fast decision tree learning algorithm. In: Proceedings of the 21st National Conference on Artificial Intelligence—Volume 1, AAAI’06, pp. 500–505. AAAI Press (2006)
Google Scholar
Bottou, L.: Proceedings of COMPSTAT’2010: 19th International Conference on Computational StatisticsParis France, August 22–27, 2010 Keynote, Invited and Contributed Papers, chapter Large-Scale Machine Learning with Stochastic Gradient Descent, pp. 177–186. Physica-Verlag HD, Heidelberg (2010)
Google Scholar
Mohri, M., Rostamizadeh, A., Talwalkar, A.: Foundations of Machine Learning. The MIT Press (2012)
Google Scholar
Cai, Y., Chakravarthy, S.: Answer quality prediction in Q&A social networks by leveraging temporal features. IJNGC 4(1) (2013)
Google Scholar

Download references

Acknowledgments

The authors are grateful for illuminating discussions to Dr Yuri Kalnishkan’s team in the project “On-line Self-Tuning Learning Algorithms for Handling Historical Information” (funded by the Leverhulme Trust).

Author information

Authors and Affiliations

The University of Brighton, Brighton, UK
Nikolay Burlutskiy, Miltos Petridis, Andrew Fish, Alexey Chernov & Nour Ali

Authors

Nikolay Burlutskiy
View author publications
You can also search for this author in PubMed Google Scholar
Miltos Petridis
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Fish
View author publications
You can also search for this author in PubMed Google Scholar
Alexey Chernov
View author publications
You can also search for this author in PubMed Google Scholar
Nour Ali
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nikolay Burlutskiy .

Editor information

Editors and Affiliations

School of Computing, University of Portsmouth, Portsmouth, Hampshire, United Kingdom
Max Bramer
School of Computing, Engineering and Mathematics, University of Brighton, Brighton, East Sussex, United Kingdom
Miltos Petridis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Burlutskiy, N., Petridis, M., Fish, A., Chernov, A., Ali, N. (2016). An Investigation on Online Versus Batch Learning in Predicting User Behaviour. In: Bramer, M., Petridis, M. (eds) Research and Development in Intelligent Systems XXXIII. SGAI 2016. Springer, Cham. https://doi.org/10.1007/978-3-319-47175-4_9

Download citation

DOI: https://doi.org/10.1007/978-3-319-47175-4_9
Published: 05 November 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-47174-7
Online ISBN: 978-3-319-47175-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics