[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

Open Access iconOpen Access

ARTICLE

crossmark

Music Genre Classification Using DenseNet and Data Augmentation

by Dao Thi Le Thuy1, Trinh Van Loan2,*, Chu Ba Thanh3, Nguyen Hieu Cuong1

1 Faculty of Information Technology, University of Transport and Communications, Hanoi, 100000, Vietnam
2 School of Information and Communication Technology, Hanoi University of Science and Technology, Hanoi, 100000, Vietnam
3 Faculty of Information Technology, University of Technology and Education Hung Yen, 160000, Vietnam

* Corresponding Author: Trinh Van Loan. Email: email

Computer Systems Science and Engineering 2023, 47(1), 657-674. https://doi.org/10.32604/csse.2023.036858

Abstract

It can be said that the automatic classification of musical genres plays a very important role in the current digital technology world in which the creation, distribution, and enjoyment of musical works have undergone huge changes. As the number of music products increases daily and the music genres are extremely rich, storing, classifying, and searching these works manually becomes difficult, if not impossible. Automatic classification of musical genres will contribute to making this possible. The research presented in this paper proposes an appropriate deep learning model along with an effective data augmentation method to achieve high classification accuracy for music genre classification using Small Free Music Archive (FMA) data set. For Small FMA, it is more efficient to augment the data by generating an echo rather than pitch shifting. The research results show that the DenseNet121 model and data augmentation methods, such as noise addition and echo generation, have a classification accuracy of 98.97% for the Small FMA data set, while this data set lowered the sampling frequency to 16000 Hz. The classification accuracy of this study outperforms that of the majority of the previous results on the same Small FMA data set.

Keywords


Cite This Article

APA Style
Thuy, D.T.L., Loan, T.V., Thanh, C.B., Cuong, N.H. (2023). Music genre classification using densenet and data augmentation. Computer Systems Science and Engineering, 47(1), 657-674. https://doi.org/10.32604/csse.2023.036858
Vancouver Style
Thuy DTL, Loan TV, Thanh CB, Cuong NH. Music genre classification using densenet and data augmentation. Comput Syst Sci Eng. 2023;47(1):657-674 https://doi.org/10.32604/csse.2023.036858
IEEE Style
D. T. L. Thuy, T. V. Loan, C. B. Thanh, and N. H. Cuong, “Music Genre Classification Using DenseNet and Data Augmentation,” Comput. Syst. Sci. Eng., vol. 47, no. 1, pp. 657-674, 2023. https://doi.org/10.32604/csse.2023.036858



cc Copyright © 2023 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 1295

    View

  • 534

    Download

  • 0

    Like

Share Link