[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3302425.3302450acmotherconferencesArticle/Chapter ViewAbstractPublication PagesacaiConference Proceedingsconference-collections
research-article

Pathological Voice Classification Based on Wavelet Packet Multiscale Analysis

Published: 21 December 2018 Publication History

Abstract

Vocal fold paralysis is a common type of laryngeal diseases, and it is fundamental different from vocal fold non-paralysis (including vocal fold nodules, polyps, etc.). Based on the differences between the various of laryngeal diseases, a classification algorithm for pathological voice is proposed. In which the wavelet packet transformation and multi-scale analysis are used. Firstly, the sub-bands of different frequency signals are obtained using wavelet packet decomposition, and then the nonlinear features i.e. the Hurst parameter, 2-Rényi entropy, Box-counting dimension and attractor are extracted from different frequency bands. These features are used to evaluate the contribution of each frequency band in detecting and classifying pathological voices. The experimental data are derived from the Massachusetts Eye and Ear Infirmary (MEEI) database and the Saarbrucken Voice Database (SVD). The experimental results show that when using the support vector machine to classify the Hurst parameter and the 2-Rényi entropy combined features, the method used in this paper can achieve good classification results when classifying normal, paralysis and non-paralysis these three kinds of voices on those two databases. The average classification accuracies on these two databases are 98.37% and 92.83% respectively.

References

[1]
M. Markaki and Y. Stylianou, "Voice pathology detection and discrimination based on modulation spectral features," IEEE Trans. Audio, Speech Lang. Process., vol. 19, no. 7, pp. 1938--1948, 2011.
[2]
L. A. Forero M., M. Kohler, M. M. B. R. Vellasco, and E. Cataldo, "Analysis and Classification of Voice Pathologies Using Glottal Signal Parameters," J. Voice, vol. 30, no. 5, pp. 549--556, 2016.
[3]
G. Muhammad et al., "Voice pathology detection using interlaced derivative pattern on glottal source excitation," Biomed. Signal Process. Control, vol. 31, pp. 156--164, 2017.
[4]
M. Dahmani and M. Guerti, "Vocal folds pathologies classification using Naïve Bayes Networks," 2017 6th Int. Conf. Syst. Control. ICSC 2017, pp. 426--432, 2017.
[5]
P. Henríquez, J. B. Alonso, M. A. Ferrer, C. M. Travieso, J. I. Godino-Llorente, and F. Díaz-de-María, "Characterization of Healthy and Pathological Voice Through Measures Based on Nonlinear Dynamics," IEEE Trans. Audio, Speech Lang. Process., vol. 17, no. 6, pp. 1186--1195, 2009.
[6]
G. Muhammad and M. Melhem, "Biomedical Signal Processing and Control Pathological voice detection and binary classification using MPEG-7 audio features," Biomed. Signal Process. Control, vol. 11, pp. 1--9, 2014.
[7]
X. Zhang, Z. Tao, and H. Zhao, "Pathological Voice Recognition by Deep Neural Network," 2017 4th Int. Conf. Syst. Informatics, no. ICSAI, pp. 464--468, 2017.
[8]
V. Majidnezhad and I. Kheidorov, "A novel Method for Feature Extraction in vocal fold pathology diagnosis," Res. J. Appl. Sci. Eng. Technol., vol. 5, no. 6, pp. 2245--2254, 2013.
[9]
E. S. Fonseca, R. C. Guido, A. C. Silvestre, and J. C. Pereira, "Discrete Wavelet Transform and Support Vector Machine Applied to Pathological Voice Signals Identification," in IEEE International Symposium on Multimedia, 2005, p. 5 pp.
[10]
X. Ye, X. Xia, J. Zhang, and Y. Chen, "Effects of trends and seasonalities on robustness of the Hurst parameter estimators," Iet Signal Process., vol. 6, no. 9, pp. 849--856, 2013.
[11]
M. A. Little, P. E. McSharry, S. J. Roberts, D. A. E. Costello, and I. M. Moroz, "Exploiting nonlinear recurrence and fractal scaling properties for voice disorder detection," Biomed. Eng. Online, vol. 6, pp. 1--19, 2007.
[12]
Massachusetts Eye and Ear Infirmary, Voice Disorders Database, Version 1.03. Kay Elemetrics Corp., Lincoln Park, NJ, 1994, CD-ROM.
[13]
W. J. Barry and M. Pützer. Saarbrucken voice database. Institute of Phonet- ics, University of Saarland. Accessed: Mar. 10, 2017.{Online}.Available:http://www.stimmdatenbank.coli.uni-saarland.de/.
[14]
I. Ullah, M. Hussain, E. ul H. Qazi, and H. Aboalsamh, "An automated system for epilepsy detection using EEG brain signals based on deep learning approach," Expert Syst. Appl., vol. 107, pp. 61--71, 2018.

Cited By

View all
  • (2020)Combined Sustained Vowels Improve the Performance of the Haar Wavelet for Pathological Voice Characterization2020 International Conference on Systems, Signals and Image Processing (IWSSIP)10.1109/IWSSIP48289.2020.9145258(381-386)Online publication date: Jul-2020

Index Terms

  1. Pathological Voice Classification Based on Wavelet Packet Multiscale Analysis

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ACAI '18: Proceedings of the 2018 International Conference on Algorithms, Computing and Artificial Intelligence
    December 2018
    460 pages
    ISBN:9781450366250
    DOI:10.1145/3302425
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    • The Hong Kong Polytechnic: The Hong Kong Polytechnic University
    • City University of Hong Kong: City University of Hong Kong

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 21 December 2018

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Support Vector Machine (SVM)
    2. nonlinear feature
    3. vocal fold paralysis
    4. wavelet packet transform

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ACAI 2018

    Acceptance Rates

    ACAI '18 Paper Acceptance Rate 76 of 192 submissions, 40%;
    Overall Acceptance Rate 173 of 395 submissions, 44%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)3
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 28 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2020)Combined Sustained Vowels Improve the Performance of the Haar Wavelet for Pathological Voice Characterization2020 International Conference on Systems, Signals and Image Processing (IWSSIP)10.1109/IWSSIP48289.2020.9145258(381-386)Online publication date: Jul-2020

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media