Language Generalization Using Active Learning in the Context of Parkinson’s Disease Classification

S. A. Moreno-Acevedo^10,12,
C. D. Rios-Urrego¹⁰,
J. C. Vásquez-Correa¹²,
J. Rusz¹¹,
E. Nöth¹³ &
…
J. R. Orozco-Arroyave^10,13

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 14102))

Included in the following conference series:

International Conference on Text, Speech, and Dialogue

626 Accesses
1 Altmetric

Abstract

Speech traits have enabled the evaluation and monitoring of the neurological state of different disorders, including Parkinson’s Disease (PD) using classical and deep approaches. Considering that speech contains paralinguistic information, the native language of the speaker influences the performance of the trained models when classifying the presence of the disease. Although researchers have performed several studies using corpora from different acoustic and language conditions, there is no baseline for the accuracy of a system to classify PD in cross-language scenarios. This study evaluates the generalization capability of different classical and deep methods to discriminate between PD patients and healthy speakers. The experiments are performed in cross-language scenarios. In particular, an Active Learning (AL) strategy is considered to evaluate the influence of the training data selection to improve the model’s performance under cross-language settings. The results indicate that models based on Wav2Vec 2.0 yielded the best results in detecting the presence of the disease in such non-controlled cross-language scenarios. In addition, the AL selection outperformed the results compared to a random selection of training samples. The considered AL based-approach allows to achieve high accuracies using a careful selection of training data in an adaptively manner. This is particularly important when dealing with non-annotated and limited data, such as the case of pathological speech modeling.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Is There Any Additional Information in a Neural Network Trained for Pathological Speech Classification?

On the Use of a Foundation Acoustic Model to Identify Highly Relevant Phonetic Information of Parkinson’s Speech

Automatic speech-based assessment to discriminate Parkinson’s disease from essential tremor with a cross-language approach

Article Open access 17 February 2024

Notes

1.
https://github.com/jcvasquezc/DisVoice/tree/master/disvoice/articulation.

References

Abdelwahab, M., Busso, C.: Active learning for speech emotion recognition using deep neural network. In: Proceedings of ACII, pp. 1–7. IEEE (2019)
Google Scholar
Baevski, A., et al.: wav2vec 2.0: a framework for self-supervised learning of speech representations. In: Advances in Neural Information Processing Systems, vol. 33, pp. 12449–12460 (2020)
Google Scholar
Bocklet, T., et al.: Detection of persons with Parkinson’s disease by acoustic, vocal, and prosodic analysis. In: Proceedings of ASRU, pp. 478–483 (2011)
Google Scholar
Bocklet, T., et al.: Automatic evaluation of Parkinson’s speech-acoustic, prosodic and voice related cues. In: Proceedings of INTERSPEECH, pp. 1149–1153 (2013)
Google Scholar
El Maachi, I., et al.: Deep 1d-convnet for accurate Parkinson disease detection and severity prediction from gait. Expert Syst. Appl. 143, 113075 (2020)
Google Scholar
Goetz, C.G., et al.: Movement disorder society-sponsored revision of the unified Parkinson’s disease rating scale (MDS-UPDRS): scale presentation and clinimetric testing results. Mov. Disord. 23(15), 2129–2170 (2008)
Article Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Identity mappings in deep residual networks. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9908, pp. 630–645. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46493-0_38
Chapter Google Scholar
Jankovic, J.: Parkinson’s disease: clinical features and diagnosis. J. Neurol. Neurosurg. Psychiatry 79(4), 368–376 (2008)
Article Google Scholar
Karan, B., Sekhar, S., Orozco-Arroyave, J.R.: Non-negative matrix factorization-based time-frequency feature extraction of voice signal for Parkinson’s disease prediction. Comput. Speech Lang. 69, 1–17 (2021)
Article Google Scholar
Kim, D., Kang, P.: Cross-modal distillation with audio-text fusion for fine-grained emotion classification using BERT and wav2vec 2.0. Neurocomputing 506, 168–183 (2022)
Article Google Scholar
Makiuchi, M.R., et al.: Multimodal emotion recognition with high-level speech and text features. In: Proceedings of ASRU, pp. 350–357. IEEE (2021)
Google Scholar
Malhotra, K., et al.: Active learning methods for low resource end-to-end speech recognition. In: Proceeding of INTERSPEECH, pp. 2215–2219 (2019)
Google Scholar
Mallela, J., et al.: Voice based classification of patients with amyotrophic lateral sclerosis, Parkinson’s disease and healthy controls with CNN-LSTM using transfer learning. In: Proceedings of ICASSP, pp. 6784–6788. IEEE (2020)
Google Scholar
Orozco-Arroyave, J.R.: Analysis of Speech of People with Parkinson’s Disease. Logos Verlag Berlin GmbH (2015)
Google Scholar
Orozco-Arroyave, J.R., et al.: New Spanish speech corpus database for the analysis of people suffering from Parkinson’s disease. In: Proceedings of LREC, pp. 342–347 (2014)
Google Scholar
Ozbolt, A.S., et al.: Things to consider when automatically detecting Parkinson’s disease using the phonation of sustained vowels: analysis of methodological issues. Appl. Sci. 12(3), 991 (2022)
Article Google Scholar
Quan, C., et al.: A deep learning based method for Parkinson’s disease detection using dynamic features of speech. IEEE Access 9, 10239–10252 (2021)
Article Google Scholar
Rios-Urrego, C.D., Vásquez-Correa, J.C., Orozco-Arroyave, J.R., Nöth, E.: Is there any additional information in a neural network trained for pathological speech classification? In: Ekštein, K., Pártl, F., Konopík, M. (eds.) TSD 2021. LNCS (LNAI), vol. 12848, pp. 435–447. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-83527-9_37
Chapter Google Scholar
Rusz, J.: Detecting speech disorders in early Parkinson’s disease by acoustic analysis. Habilitation thesis, Czech Technical University in Prague (2018)
Google Scholar
Rusz, J., et al.: Objective acoustic quantification of phonatory dysfunction in Huntington’s disease. PLoS ONE 8(6), e65881 (2013)
Google Scholar
Rusz, J., et al.: Characteristics and occurrence of speech impairment in Huntington’s disease: possible influence of antipsychotic medication. J. Neural Transm. 121(12), 1529–1539 (2014)
Article Google Scholar
Sakar, B.E., et al.: Collection and analysis of a Parkinson speech dataset with multiple types of sound recordings. IEEE J. Biomed. Health Inform. 17(4), 828–834 (2013)
Article Google Scholar
Settles, B.: Uncertainty sampling, pp. 11–20 (2012)
Google Scholar
Spencer, K.A., Rogers, M.A.: Speech motor programming in hypokinetic and ataxic dysarthria. Brain Lang. 94(3), 347–366 (2005)
Article Google Scholar
Vásquez-Correa, J.C., et al.: Towards an automatic evaluation of the dysarthria level of patients with Parkinson’s disease. J. Commun. Disord. 76, 21–36 (2018)
Article Google Scholar
Vasquez-Correa, J.C., et al.: End-2-end modeling of speech and gait from patients with Parkinson’s disease: comparison between high quality vs. smartphone data. In: Proceedings of ICASSP, pp. 7298–7302. IEEE (2021)
Google Scholar
Vásquez-Correa, J.C., et al.: Transfer learning helps to improve the accuracy to classify patients with different speech disorders in different languages. Pattern Recogn. Lett. 150, 272–279 (2021)
Article Google Scholar

Download references

Author information

Authors and Affiliations

GITA Lab., Universidad de Antioquia UdeA, Medellín, Colombia
S. A. Moreno-Acevedo, C. D. Rios-Urrego & J. R. Orozco-Arroyave
Department of Circuit Theory, Czech Technical University in Prague, Prague, Czech Republic
J. Rusz
Fundación Vicomtech, Basque Research and Technology Alliance (BRTA), Donostia-San Sebastián, Spain
S. A. Moreno-Acevedo & J. C. Vásquez-Correa
LME Lab., Friedrich-Alexander Universität, Erlangen-Nürnberg, Germany
E. Nöth & J. R. Orozco-Arroyave

Authors

S. A. Moreno-Acevedo
View author publications
You can also search for this author in PubMed Google Scholar
C. D. Rios-Urrego
View author publications
You can also search for this author in PubMed Google Scholar
J. C. Vásquez-Correa
View author publications
You can also search for this author in PubMed Google Scholar
J. Rusz
View author publications
You can also search for this author in PubMed Google Scholar
E. Nöth
View author publications
You can also search for this author in PubMed Google Scholar
J. R. Orozco-Arroyave
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to S. A. Moreno-Acevedo .

Editor information

Editors and Affiliations

University of West Bohemia, Pilsen, Czech Republic
Kamil Ekštein
University of West Bohemia, Pilsen, Czech Republic
František Pártl
University of West Bohemia, Pilsen, Czech Republic
Miloslav Konopík

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Moreno-Acevedo, S.A., Rios-Urrego, C.D., Vásquez-Correa, J.C., Rusz, J., Nöth, E., Orozco-Arroyave, J.R. (2023). Language Generalization Using Active Learning in the Context of Parkinson’s Disease Classification. In: Ekštein, K., Pártl, F., Konopík, M. (eds) Text, Speech, and Dialogue. TSD 2023. Lecture Notes in Computer Science(), vol 14102. Springer, Cham. https://doi.org/10.1007/978-3-031-40498-6_31

Download citation

DOI: https://doi.org/10.1007/978-3-031-40498-6_31
Published: 23 August 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-40497-9
Online ISBN: 978-3-031-40498-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics