Abstract
Training machine learning models often requires large labelled datasets, which can be both expensive and time-consuming to obtain. Active learning aims to selectively choose which data is labelled in order to minimize the total number of labels required to train an effective model. This paper compares model-free and model-based approaches to active learning for regression, finding that model-free approaches, in addition to being less computationally intensive to implement, are more effective in improving the performance of linear regressions than model-based alternatives.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Settles, B.: Active Learning Literature Survey, vol. 52, pp. 55–66. University of Wisconsin, Madison (2010)
Settles, B., Craven, M., Friedland, L.: Active learning with real annotation costs. In: Proceedings of the NIPS Workshop on Cost-Sensitive Learning, pp. 1–10 (2008)
Margineantu, D.D.: Active cost-sensitive learning. IJCAI 5, 1622–1623 (2005)
Hu, R., Delany, S.J., Mac Namee, B.: Egal: Exploration guided active learning for tcbr. In: Case-Based Reasoning. Research and Development, pp. 156–170. Springer (2010)
Cai, W., Zhang, Y., Zhou, J.: Maximizing expected model change for active learning in regression. In: 2013 IEEE 13th International Conference on Data Mining (ICDM), pp. 51–60, IEEE (2013)
Castro, R.M., Willett, R., Nowak, R.: Faster rates in regression via active learning. In: Advances in Neural Information Processing Systems, pp. 179–186 (2005)
Sung, K.K., Niyogi, P.: Active learning for function approximation. In: Advances in Neural Information Processing Systems, pp. 593–600 (1995)
Yu, K., Bi, J., Tresp, V.: Active learning via transductive experimental design. In: Proceedings of the 23rd International Conference on Machine Learning, pp. 1081–1088. ACM (2006)
Seung, H.S., Opper, M., Sompolinsky, H.: Query by committee. In: Proceedings of the Fifth Annual Workshop on Computational Learning Theory, pp. 287–294. ACM (1992)
Burbidge, R., Rowland, J.J., King, R.D.: Active Learning for Regression Based on Query by Committee, pp. 209–218. Springer (2007) (state-of-the-art)
Settles, B., Craven, M., Ray, S.: Multiple-instance active learning. In: Advances in Neural Information Processing Systems, pp. 1289–1296 (2008)
Baram, Y., El-Yaniv, R., Luz, K.: Online choice of active learning algorithms. J. Mach. Learn. Res. 5, 255–291 (2004)
Osugi, T., Kim, D., Scott, S.: Balancing exploration and exploitation: a new algorithm for active machine learning. In: Fifth IEEE International Conference on Data Mining, pp. 8. IEEE (2005)
Boiy, E., Moens, M.-F.: A machine learning approach to sentiment analysis in multilingual web texts. Inf. Retriev. 12(5), 526–558 (2009)
Mac Namee, B., Hu, R., Delany, S.: Inside the selection box: visualising active learning selection strategies. In: The Challenges of Data Visualization Neural Information Processing Systems, (NIPS) 2010 Workshop (2010)
Hu, R.: Active learning for text classification. Thesis, Dublin Institute of Technology (2011)
Settles, B.: From theories to queries: active learning in practice. In: Active Learning and Experimental Design Workshop, pp. 1–18 (2011)
Xu, Z., Akella, R., Zhang, Y.: Incorporating Diversity and Density in Active Learning for Relevance Feedback. Springer (2007)
Hoi, S.C., Jin, R., Zhu, J., Lyu, M.R.: Batch mode active learning and its application to medical image classification. In: Proceedings of the 23rd International Conference on Machine Learning, pp. 417–424. ACM (2006)
Guo, Y., Schuurmans, D.: Discriminative batch mode active learning. In: Advances in Neural Information Processing Systems, pp. 593–600 (2008)
Hu, R., Mac Namee, B., Delany, S.J. : Sweetening the dataset: using active learning to label unlabelled datasets. In: Proceedings of the 19th Irish Conference on Artificial Intelligence and Cognitive Science (AICS ’08) (2008)
Ray Chaudhuri, T., Hamey, L.G.: Minimisation of data collection by active learning. In: Proceedings. IEEE International Conference on Neural Networks, 1995, vol. 3, pp. 1338–1341, IEEE (1995)
Borchers, H.W.: pracma: Practical Numerical Math Functions (2015) (R package version 1.8.6)
Haertel, R., Felt, P., Ringger, E., Seppi, K.: Parallel active learning: eliminating wait time with minimal staleness. In: Proceedings of the NAACL HLT 2010 Workshop on Active Learning for Natural Language Processing, pp. 33–41. Association for Computational Linguistics (2010)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
O’Neill, J., Jane Delany, S., MacNamee, B. (2017). Model-Free and Model-Based Active Learning for Regression. In: Angelov, P., Gegov, A., Jayne, C., Shen, Q. (eds) Advances in Computational Intelligence Systems. Advances in Intelligent Systems and Computing, vol 513. Springer, Cham. https://doi.org/10.1007/978-3-319-46562-3_24
Download citation
DOI: https://doi.org/10.1007/978-3-319-46562-3_24
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46561-6
Online ISBN: 978-3-319-46562-3
eBook Packages: EngineeringEngineering (R0)