Abstract
Music streaming services are getting increasingly popular as a result of the prevalent need for web and smart gadgets. Melophiles are drawn toward a range of musical genres and create a unique digital footprint. The emotional reactions of the music listeners may cause physiological changes as well as neurological benefits. However, any musician should avoid being influenced by an algorithmic recommendation presented in the research. The most prominent aspect for any musician is to stay true to their own musical style and to be novel, honest, and sincere. The main objective of this research work is to provide precise music predictions to the artists before they launch their music in the market. This work also aims to improve the accuracy of the prediction models using some feature-engineering-based techniques to manipulate the Spotify dataset and predict whether the song will be a hit or not. The proposed work leverages state-of-the-art machine learning models, including Linear Regression, Lasso Regression, Ridge Regression, Elastic Net, Random Forest, GBM, and neural networks to explore the predictive factors influencing the inclusion of a particular music track in the featured Spotify Hit-50 list. Furthermore, it offers descriptive statistical analysis of the features that can help in popularity prediction.
Similar content being viewed by others
Dataset Availability
The dataset is available on request.
References
Guan W. Analysis of music education management mode in colleges and universities in China. 2024;6:127–48. https://doi.org/10.25236/FER.2023.060525.
Akintoye OD. East African Journal of Law and Ethics challenges of protecting music intellectual property in the digital new age in Nigeria. 2023;6:15–28. https://doi.org/10.37284/eajle.6.1.1189.IEEE.
Tabata T, Wang TY. Life cycle assessment of co2 emissions of online music and videos streaming in Japan. Appl Sci (Switzerland). 2021;11:3992. https://doi.org/10.3390/app11093992.
Kaminskas M, Ricci F. Contextual music information retrieval and recommendation: state of the art and challenges. Comput Sci Rev. 2012;6:89–119. https://doi.org/10.1016/j.cosrev.2012.04.002.
Vitale JL. Music makes you smarter: a new paradigm for music education? Perceptions and perspectives from four groups of elementary education stakeholders. Can J Educ. 2011;34:317–43.
Engez A, Aarikka-Stenroos L. Stakeholder contributions to commercialization and market creation of a radical innovation: bridging the micro- and macro levels. J Bus Ind Mark. 2022;38:31–44. https://doi.org/10.1108/JBIM-03-2022-0136.
Duman D, Neto P, Mavrolampados A, Toiviainen P, Luck G. Music we move to: Spotify audio features and reasons for listening. PLoS ONE. 2022;17:1–18. https://doi.org/10.1371/journal.pone.0275228.
Yakura H, Nakano T, Goto M. An automated system recommending background music to listen to while working. User Model User Adap Interact. 2022;32:355–88. https://doi.org/10.1007/s11257-022-09325-y.
Yoon C, Gonzalez R, Bechara A, Berns GS, Dagher AA, Dubé L, et al. Decision neuroscience and consumer decision making. Mark Lett. 2012;23:473–85. https://doi.org/10.1007/s11002-012-9188-z.
Li X. Information retrieval method of professional music teaching based on hidden Markov model. In: Proceedings—2022 14th international conference on measuring technology and mechatronics automation, ICMTMA 2022. 2022. pp. 1072–5. https://doi.org/10.1109/ICMTMA54903.2022.00216.
Raimond Y, Sutton C, Sandler M. Automatic interlinking of music datasets on the semantic web. In: CEUR workshop proceedings, vol. 369. 2008.
Shulman B, Sharma A, Cosley D. Predictability of popularity: Gaps between prediction and understanding. In: Proceedings of the 10th international conference on web and social media, ICWSM 2016. 2016. pp. 348–57.
Malheiro R, Panda R, Gomes P, Paiva RP. Emotionally-relevant features for classification and regression of music lyrics. IEEE Trans Affect Comput. 2018;9:240–54. https://doi.org/10.1109/TAFFC.2016.2598569.
Choi K, Lee JH, Hu X, Downie JS. Music subject classification based on lyrics and user interpretations. Proc Assoc Inf Sci Technol. 2016;53:1–10. https://doi.org/10.1002/pra2.2016.14505301041.
Martin-Gutierrez D, Hernandez Penaloza G, Belmonte-Hernandez A, Alvarez GF. A multimodal end-to-end deep learning architecture for music popularity prediction. IEEE Access. 2020;8:39361–74. https://doi.org/10.1109/ACCESS.2020.2976033.
Diekroeger D. Can song lyrics predict genre? In: Proceedings of the 11th international symposium on computer music multidisciplinary research. 2015. pp. 457–71.
Zangerle E, Pichl M, Hupfauf B, Specht G. Can microblogs predict music charts? An analysis of the relationship between #nowplaying tweets and music charts. In: Proceedings of the 17th international society for music information retrieval conference, ISMIR 2016. 2016. pp. 365–71.
Kyauk E, Park E, Pham J. Predicting song popularity. 2016.
Nunes JC, Ordanini A, Valsesia F. The power of repetition: repetitive lyrics in a song increase processing fluency and drive market success. J Consum Psychol. 2015;25:187–99.
Ritchie D, North AC. Energy, popularity, and the circumplex: a computerized analysis of emotion in 143,353 musical pieces. Empir Stud Arts. 2017;36:127–61.
Somme SVD, Sogancioglu G, Paperno D. Popularity of music tracks based on lyrics. Utrecht University; 2021.
Sandag GA, Manueke AM. Predictive models for popularity of solo and group singers in Spotify using decision tree. In: 2020 2nd international conference on cybernetics and intelligent system, ICORIS 2020. 2020. pp. 2–6. https://doi.org/10.1109/ICORIS50180.2020.9320838.
Yee YK, Raheem M. Predicting music popularity using spotify and youtube features. Indian J Sci Technol. 2022;15:1786–99. https://doi.org/10.17485/ijst/v15i36.2332.
Capó M, Pérez A, Lozano JA. LASSO for streaming data with adaptative filtering. Stat Comput. 2023;33:1–14. https://doi.org/10.1007/s11222-022-10181-4.
De Vlaming R, Groenen PJF. The current and future use of ridge regression for prediction in quantitative genetics. Biomed Res Int. 2015. https://doi.org/10.1155/2015/143712.
Liang Y, Zhao J, Sampath Kumar D, Srinivasan D. Real-time and consistent sparse estimation of power system distribution factors using online adaptive elastic-net. Int J Electr Power Energy Syst. 2022;142: 108361. https://doi.org/10.1016/j.ijepes.2022.108361.
Minastireanu E-A, Mesnita G. Light GBM machine learning algorithm to online click fraud detection. J Inf Assur Cybersecur. 2019. https://doi.org/10.5171/2019.263928.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article is part of the topical collection “Diverse Applications in Computing, Analytics and Networks” guest edited by Archana Mantri and Sagar Juneja.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Arora, S., Rani, R. Soundtrack Success: Unveiling Song Popularity Patterns Using Machine Learning Implementation. SN COMPUT. SCI. 5, 278 (2024). https://doi.org/10.1007/s42979-024-02619-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s42979-024-02619-5