Significance of Dimensionality Reduction in CNN-Based Vowel Classification from Imagined Speech Using Electroencephalogram Signals

Oindrila Banerjee¹¹,
D. Govind¹¹,
Akhilesh Kumar Dubey¹¹ &
…
Suryakanth V. Gangashetty¹¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13721))

Included in the following conference series:

International Conference on Speech and Computer

1037 Accesses
4 Citations

Abstract

The work presented in this paper aims to show the effectiveness of dimensionality reduction in convolutional neural network (CNN) based vowel classification from covert/imagined speech. Imagined speech is referred to as phonological classes, words or sentences pronounced internally. It is acquired in a non-invasive manner by placing Electroencephalogram (EEG) sensors over the cerebral cortex region of the head. Covert speech is a spontaneous imagination or active thoughts of speaking of a human being without any articulatory movements. Therefore, identifying phonological classification attracted many applications for those who have the inability to speak due to lock-in syndrome or motor muscular impairments. The present study develops a CNN-based vowel classification system by processing EEG representing the imagined speech. In the proposed methodology, the CNN features extracted from the spectrograms (Time-Frequency) representation of each EEG channel data have been subjected to dimensionality reduction using principal component analysis (PCA). Dimensionally reduced CNN features are further subjected to linear discriminant analysis for transformation and classification. A significant improvement in the imaginary vowel classification performance is confirmed for linear discriminant analysis (LDA) based classification of dimensionality reduced CNN feature vectors. The CNN-based vowel classification performance is just above the chance level. The variational mode decomposition (VMD) based preprocessing of the EEG channels prior to classification further improved the performance of vowel recognition. The performances are observed to be consistent on all 15 subjects in the open access Coretto DB EEG database where each imagined utterance has been acquired using 6 EEG Channels.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 79.50; Price includes VAT (United Kingdom)

Softcover Book: GBP 99.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

EEG Vowel Silent Speech Signal Discrimination Based on APIT-EMD and SVD

Effect of functional and effective brain connectivity in identifying vowels from articulation imagery procedures

Article 06 July 2022

Time Domain Analysis of EEG to Classify Imagined Speech

References

Biswas, S., Sinha, R.: Lateralization of brain during EEG based covert speech classification. In: Proceedings of 15th IEEE India Council International Conference (INDICON), pp. 1–5 (2018)
Google Scholar
Biswas, S., Sinha, R.: Wavelet filterbank-based EEG rhythm-specific spatial features for covert speech classification. IET Sig. Process. 16(1), 92–105 (2022)
Article Google Scholar
Cooney, C., Folli, R., Coyle, D.: Optimizing layers improves CNN generalization and transfer learning for imagined speech decoding from EEG. In: Proceedings of the IEEE International Conference on Systems, Man and Cybernetics (SMC), pp. 1311–1316 (2019)
Google Scholar
Coretto, G.A.P., Gareis, I.E., Rufiner, H.L.: Open access database of EEG signals recorded during imagined speech. In: Proceedings of the 12th International Symposium on Medical Information Processing and Analysis, vol. 10160, p. 1016002 (2017)
Google Scholar
Dragomiretskiy, K., Zosso, D.: Variational mode decomposition. IEEE Trans. Sig. Process. 62(3), 531–544 (2014)
Article MathSciNet MATH Google Scholar
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification. wiley, New York (2001)
MATH Google Scholar
Govind, D., Pravena, D., Ajay, S.G.: Improved epoch extraction using variational mode decomposition based spectral smoothing of zero frequency filtered emotive speech signals. In: Proceedings of the National Conference on Communications (NCC) (2018)
Google Scholar
James, G., Witten, D., Hastie, T., Tibshirani, R.: An introduction to statistical learning with applications in R. Springer (2014)
Google Scholar
Lawhern, V.J., Solon, A.J., Waytowich, N.R., Gordon, S.M., Hung, C.P., Lance, B.J.: Eegnet: a compact convolutional neural network for EEG-based brain-computer interfaces. J. Neural Eng. 15(5),(2018)
Google Scholar
LeCun, Y., Huang, F.J., Bottou, L.: Learning methods for generic object recognition with invariance topose and lighting. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (2004)
Google Scholar
LeCun, Y., Kavukcuoglu, K., Farabet, C.: Convolutional neural networks and applications in vision. In: Proceedings of the IEEE International Symposium on Circuits and Systems (ISCAS) (2010)
Google Scholar
Lee, D.Y., Lee, M., Lee, S.W.: Decoding imagined speech based on deep metric learning for intuitive BCI communication. IEEE Trans. Neural Syst. Rehab. Eng. 29, 1363–1374 (2021)
Article Google Scholar
Lee, D.Y., Lee, M., Lee, S.W.: Decoding imagined speech based on deep metric learning for intuitive. BCI Commun. 29, 1363–1374 (2021)
Google Scholar
Nicolas-Alonso, L.F., Gomez-Gil, J.: Brain computer interfaces, a review. Sensors. 12(2), 1211–1279 (2012)
Article Google Scholar
Panachakel, J.T., Ramakrishnan, A.: Decoding covert speech from EEG-a comprehensive review. Front. NeuroSci. (2021). https://doi.org/10.3389/fnins.2021.642251
Article Google Scholar
Panachakel, J.T., Ramakrishnan, A., Ananthapadmanabha, T.: Decoding imagined speech using wavelet features and deep neural networks. In: Proceedings of the IEEE 16th India Council International Conference (INDICON), pp. 1–4 (2019)
Google Scholar
Pankaj, D., Govind, D., Narayanankutty, K.A.: A novel method for removing RICIAN noise from MRI based on variational mode decomposition. Biomed. Sig. Process. Control 69,(2021)
Google Scholar
Rabiner, L.R., Juang, B.H.: Fundamentals of Speech Recognition. PTR Prentice Hall,Englewood Cliffs, N.J. (1993)
Google Scholar
Zhao, S., Rudzicz, F.: Classifying phonological categories in imagined and articulated speech. In: Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2015)
Google Scholar

Download references

Acknowledgments

The presente work is funded by the ongoing MEITY, Govt. of India, funded NLTM-BHASHINI consortium project titled “Speech technologies in Indian languages: speech quality control”. The authors were motivated to work in the decoding of imagined speech from the Global Initiative for Academic Networks (GIAN) course on Cognitive Speech Processing conducted by Prof. H. L. Rufiner, Research Institute for Signals, Systems and Computational Intelligence, National University of Litoral (UNL), Santa Fe, Argentina and Prof. S. R Mahadeva Prasanna. Department of Electrical Engineering, IIT Dharwad, INDIA. GIAN course was organized in April 2022.

Author information

Authors and Affiliations

Speech Research Group, Department of Computer Science and Engineering, Koneru Lakshamaiah Education Foundation, Vaddeswaram, Guntur, 522502, Andhra Pradesh, India
Oindrila Banerjee, D. Govind, Akhilesh Kumar Dubey & Suryakanth V. Gangashetty

Authors

Oindrila Banerjee
View author publications
You can also search for this author in PubMed Google Scholar
D. Govind
View author publications
You can also search for this author in PubMed Google Scholar
Akhilesh Kumar Dubey
View author publications
You can also search for this author in PubMed Google Scholar
Suryakanth V. Gangashetty
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to D. Govind .

Editor information

Editors and Affiliations

Indian Institute of Technology Dharwad, Dharwad, India
S. R. Mahadeva Prasanna
St. Petersburg Federal Research Center of the Russian Academy of Sciences, St. Petersburg, Russia
Alexey Karpov
Koneru Lakshmaiah Education Foundation, Vaddeswaram, India
K. Samudravijaya
KIIT Group of Colleges, Gurugram, India
Shyam S. Agrawal

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Banerjee, O., Govind, D., Dubey, A.K., Gangashetty, S.V. (2022). Significance of Dimensionality Reduction in CNN-Based Vowel Classification from Imagined Speech Using Electroencephalogram Signals. In: Prasanna, S.R.M., Karpov, A., Samudravijaya, K., Agrawal, S.S. (eds) Speech and Computer. SPECOM 2022. Lecture Notes in Computer Science(), vol 13721. Springer, Cham. https://doi.org/10.1007/978-3-031-20980-2_5

Download citation

DOI: https://doi.org/10.1007/978-3-031-20980-2_5
Published: 10 November 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-20979-6
Online ISBN: 978-3-031-20980-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Significance of Dimensionality Reduction in CNN-Based Vowel Classification from Imagined Speech Using Electroencephalogram Signals

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

EEG Vowel Silent Speech Signal Discrimination Based on APIT-EMD and SVD

Effect of functional and effective brain connectivity in identifying vowels from articulation imagery procedures

Time Domain Analysis of EEG to Classify Imagined Speech

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Significance of Dimensionality Reduction in CNN-Based Vowel Classification from Imagined Speech Using Electroencephalogram Signals

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

EEG Vowel Silent Speech Signal Discrimination Based on APIT-EMD and SVD

Effect of functional and effective brain connectivity in identifying vowels from articulation imagery procedures

Time Domain Analysis of EEG to Classify Imagined Speech

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation