Abstract
House prices estimation has been the focus of both commercial and academic researches with various approaches being explored. Depending on the location, size, age, time and other factors, the value of houses may vary. This paper presents a modularized, process oriented, data enabled and machine learning based framework, designed to help the decision makers within the housing ecosystem to have more realistic estimation of the house prices. The development of the framework leverages the Design Science Research Methodology (DSRM) and the HM Land Registry Price Paid Data is ingested into the framework as the base transactions data. 1.1 million London based transaction records between January 2011 and December 2020 have been exploited for model design and evaluation. The proposed framework also leverages a range of neighborhood data including the location of rail stations, supermarkets and bus stops to explore the possible impact on house prices. Five machine learning algorithms have been exploited and three evaluation metrics have been presented and with a focus on RMSE. Results show that an increase in the variety of parameters enables improved accuracy which ultimately will enable decision making. The potential for future work based on this paper can explore the impact of the introduction of other groups of data on the accuracy of machine learning models designed for the estimation of house prices.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Office of National Statistics. Employee earnings in the UK (2020). https://www.ons.gov.uk/employmentandlabourmarket/peopleinwork/earningsandworkinghours/bulletins/annualsurveyofhoursandearnings/2020. Accessed 30 May 2021
HM Land Registry 2020. UK House Price Index for March 2020. GOV.UK. https://www.gov.uk/government/news/uk-house-price-index-for-march-2020. Accessed 5 Jan 2021
Hong, J., Choi, H., Kim, W.S.: A house price valuation based on the random forest approach: the mass appraisal of residential property in South Korea. Int. J. Strateg. Prop. Manag. 24(3), 140–152 (2020)
Ferlan, N., Bastic, M., Psunder, I.: Influential factors on the market value of residential properties. Eng. Econ. 28(2), 135–144 (2017)
Ge, C., et al.: An integrated model for urban subregion house price forecasting: a multi-source data perspective. In: 2019 IEEE International Conference on Data Mining (ICDM), pp. 1054–1059 (2019)
Wang, C., Wu, H.: A new machine learning approach to house price estimation. New Trends Math. Sci. 6(4), 165–171 (2018)
Chi, B., et al.: Creating a new dataset to analyse house prices in England (2019). https://discovery.ucl.ac.uk/id/eprint/10082766/. Accessed 29 Sep 2021
GOV.UK. Statistical Dataset Price Paid Data. https://www.gov.uk/government/statistical-data-sets/price-paid-data-downloads. Accessed 20 May 2021
Office of National Statistics, Postcode products (2021). https://www.ons.gov.uk/methodology/geography/geographicalproducts/postcodeproducts. Accessed 30 May 2021
Park, B., Bae, J.K.: Using machine learning algorithms for housing price prediction: The case of Fairfax County, Virginia housing data. Online] Expert Systems with Application (2014). https://doi.org/10.1016/j.eswa.2014.11.040. Accessed 30 March 2021
Madhuri, C.R., Anuradha, G., Pujitha, M.V.: House price prediction using regression techniques: a comparative study. In: 2019 International Conference on Smart Structures and Systems (ICSSS) (2019)
Rico-Juan, J.R., Taltavull de La Paz, P.: Machine learning with explainability or spatial hedonics tools? An analysis of the asking prices in the housing market in Alicante, Spain. Expert Syst. Appl. (2021)
Hammad, R.K.M.: A Hybrid E-Learning Framework: Process-Based, Semantically-Enriched and Service-Oriented. PhD Thesis. University of West England, Bristol (2018)
Scikit-Learn. sklear.pipeline.Pipeline. https://scikit-learn.org/stable/modules/generated/sklearn.pipeline.Pipeline.html. Accessed 2 Oct 2021
Ke, G., et al.: Lightgbm: a highly efficient gradient boosting decision tree. Adv. Neural. Inf. Process. Syst. 30, 3146–3154 (2017)
Chen, T., Guestrin, C.: Xgboost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 785–794 (2016a)
Truong, Q., Nguyen, M., Dang, H., Mei, B.: Housing price prediction via improved machine learning techniques. Procedia Comput. Sci. 174, 433–442 (2020)
Lu, S., Li, Z., Qin, Z., Yang, X., Goh, R.S.M.: A hybrid regression technique for house prices prediction. In: 2017 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM), pp. 319–323. IEEE (2017)
Wolpert, D.H.: Stacked generalization. Neural Netw. 5(2), 241–259 (1992)
Moody, J.: What does RSME really mean? (2019). https://towardsdatascience.com/what-does-rmse-really-mean-806b65f2e48e. Accessed 11 Sept 2021
Zhang, A.: Evaluating Machine Learning Models. O’Reilly Online Learning (2012). https://www.oreilly.com/library/view/evaluating-machine-learning/9781492048756/ch04.html. Accessed 28 Jun 2021
Koehrsen, W.: A Conceptual Explanation of Bayesian Hyperparameter Optimization for Machine Learning (2018). A Conceptual Explanation of Bayesian Hyperparameter Optimization for Machine Learning | by Will Koehrsen | Towards Data Science
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Awonaike, A., Ghorashi, S.A., Hammaad, R. (2022). A Machine Learning Framework for House Price Estimation. In: Abraham, A., Gandhi, N., Hanne, T., Hong, TP., Nogueira Rios, T., Ding, W. (eds) Intelligent Systems Design and Applications. ISDA 2021. Lecture Notes in Networks and Systems, vol 418. Springer, Cham. https://doi.org/10.1007/978-3-030-96308-8_90
Download citation
DOI: https://doi.org/10.1007/978-3-030-96308-8_90
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-96307-1
Online ISBN: 978-3-030-96308-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)