Metastable Substructure Embedding and Robust Classification of Multichannel EEG Data Using Spectral Graph Kernels †
<p>The proposed workflow.</p> "> Figure 2
<p>Plots show the optimal clusters of eigenvectors for a synthetic dataset: (<b>a</b>) synthetic dataset with 5 clusters, (<b>b</b>) optimal clustering structure for top-2 eigenvectors, (<b>c</b>) silhouette scores, (<b>d</b>) SSE distance scores.</p> "> Figure 3
<p>Plots show the existence of connectivity substructures (clusters) for sample MIT CHB Epileptiform EEG data. (<b>a</b>) Heatmap of connectivity matrix (>0.5) for preictal-state sample; (<b>b</b>) inherent clustering structure for the preictal-state sample; (<b>c</b>) heatmap of connectivity matrix (>0.5) for ictal-state sample; (<b>d</b>) inherent clustering structure for the ictal-state sample.</p> "> Figure 3 Cont.
<p>Plots show the existence of connectivity substructures (clusters) for sample MIT CHB Epileptiform EEG data. (<b>a</b>) Heatmap of connectivity matrix (>0.5) for preictal-state sample; (<b>b</b>) inherent clustering structure for the preictal-state sample; (<b>c</b>) heatmap of connectivity matrix (>0.5) for ictal-state sample; (<b>d</b>) inherent clustering structure for the ictal-state sample.</p> "> Figure 4
<p>Plots show the existence of connectivity substructures (clusters) for sample cognitive load EEG-ERPs. (<b>a</b>) Heatmap of the correlation matrix (>0.5) for idle-state sample; (<b>b</b>) inherent clustering structure for the idle-state sample; (<b>c</b>) heatmap of the correlation matrix (>0.5) for d1B-state sample; (<b>d</b>) inherent clustering structure for the d1B-state sample.</p> "> Figure 5
<p>Plots (<b>a</b>–<b>c</b>) show the heatmap of the Gram matrices produced by the WL kernel, the optimal clusters of top-2 eigenvectors, and the corresponding silhouette scores for 30 subsequent preictal-state samples. Plots (<b>d</b>–<b>f</b>) show the plots for 30 subsequent ictal-state samples.</p> "> Figure 6
<p>Performance comparison regarding accuracy, AUC, precision, recall, and F1-score with and without cost-sensitive learning for different workflows comprising different combinations of connectivity metrics, graph kernels, and classifiers: Plot (<b>a</b>) shows the combinations with mPLV; Plot. (<b>b</b>) shows combinations with correlation coefficient.</p> ">
Abstract
:1. Introduction
2. Materials and Methods
- 1.
- EEG Preprocessing and Graph Construction
- 2.
- Feature Extraction Using Koopman Spectral Operators
- 3.
- Ensemble Classifiers
- 4.
- Cost-Sensitive Learning
- 5.
- Evaluation Metrics
Novel Contributions
Algorithm 1: Proposed Workflow |
|
Algorithm 2: Algorithm Outline for Graph Koopman Kernel Embedding |
|
Algorithm 3: DMD Algorithm for Koopman operator Approximation |
|
3. Results
3.1. Determining Optimal No. of Clusters
3.2. Connectivity Substructures
3.3. Metastable Substructures Estimation by Graph-Koopman-Kernels
3.4. Classifier Performance
4. Discussion
- Interpretation and Implications of Metacluster Substructures for Discriminating Spatial Temporal Patterns
- Seizure prediction: The appearance of expanding high-intensity subclusters in preictal states provides early indicators of impending seizures.
- Real-time EEG monitoring: Tracking changes in Gram matrix patterns can enable adaptive brain–computer interfaces (BCIs) to respond dynamically to user cognitive states.
- Cognitive workload analysis: Monitoring synchronisation dynamics can inform fatigue detection systems in high-performance work environments.
- Strengths of the Proposed Framework
- Efficacy of Graph Kernels in Capturing Metastable Connectivity Substructures
- Integration of Koopman Operators for Linearizing Nonlinear Dynamics By leveraging Koopman spectral operators, this framework provides a linear representation of EEG’s inherently nonlinear dynamics. This enables spectral decomposition into metastable states, offering interpretable features for classification. The eigenfunctions derived from the Koopman operator reveal subtle temporal transitions in brain connectivity that are otherwise challenging to detect with traditional methods. This study is one of the first to apply Koopman-based embeddings to EEG data in combination with graph kernels, marking a significant advancement in the field.
- Robust Classification with Cost-Sensitive Learning The integration of cost-sensitive learning (CSL) ensures that the framework effectively handles class imbalance, a common issue in EEG datasets where critical events such as seizures are underrepresented. The results demonstrate that incorporating CSL significantly improved the F1-scores for minority classes, as shown in Table 2. For example, the mPLV-WL-DT workflow achieved an F1-score of 91% with CSL compared to 88.9% without CSL, highlighting its efficacy in addressing imbalance while maintaining high classification accuracy.
- Explainability of Features The framework provides explainable insights into the metastable substructures of brain connectivity, which are visualised through eigenfunctions and clustering structures. This addresses a critical gap in the literature, as many existing methods focus solely on accuracy metrics without delving into the interpretability of the features derived. By visualizing the connectivity substructures (Figure 3) and eigenvector clusters (Figure 4), this study enhances the transparency of the classification process and contributes to a deeper understanding of EEG connectivity dynamics.
- Novelty of the Findings
- First-of-Its-Kind Application of Graph Koopman Kernel Embeddings While graph kernels and Koopman operators have been applied independently in other domains, their combined application for EEG classification is novel. This integration enables the robust estimation of latent metastable structures, bridging the gap between theoretical advancements in dynamical systems and practical EEG data analysis.
- Comprehensive Evaluation of Graph Kernels for EEG Connectivity Unlike prior studies that utilise graph kernels in a limited scope, this work systematically evaluates the performance of multiple kernels across different workflows. The demonstrated superiority of the WL kernel in terms of runtime and accuracy offers clear guidance for future studies aiming to analyse large-scale EEG datasets efficiently.
- Improved Classification Performance The mPLV-WL-DT workflow achieved the highest performance metrics (accuracy: 91.7%, F1-score: 91%) among all tested combinations, outperforming alternative methods such as Random Forest (RF) and Support Vector Classifier (SVC). These results underscore the effectiveness of the proposed framework in achieving reliable classification, even with challenging imbalanced datasets.
- Bridging the Gap Between Graph-Theoretical Models and EEG Clinical Applications By leveraging advanced graph-theoretical models to estimate connectivity substructures, the proposed framework contributes to both neuroscience research and clinical applications. The ability to classify EEG states with high precision and interpretability has implications for cognitive state monitoring, seizure prediction, and brain–computer interface (BCI) development.
5. Conclusions
- Implications for Future Research
- Dataset generalisation: While the framework was tested on epilepsy and cognitive workload datasets, its applicability to other EEG datasets remains an area for exploration. Future studies can extend this work to include datasets related to mental health, neurodegenerative diseases, or motor imagery tasks.
- Incorporation of deep learning models: Although ensemble classifiers such as Decision Trees and Random Forests performed well, integrating Koopman embeddings with deep learning architectures, such as Convolutional Neural Networks (CNNs) or Long Short-Term Memory (LSTM) networks, may further improve classification performance.
- Refinement of kernels for EEG dynamics: The development of specialised graph kernels tailored to EEG dynamics, potentially incorporating additional spatial, temporal, and spectral features, could enhance the framework’s robustness and generalisability.
- Real-time applications: The low computational complexity of the WL kernel makes this framework suitable for real-time applications, such as seizure prediction or cognitive load monitoring in adaptive learning environments.
- Limitations and Challenges
- Dependency on connectivity measures: The choice of connectivity measure (e.g., mPLV, correlation) influences the construction of graphs, and alternative measures could yield different results. Further studies are required to determine the optimal connectivity metric for specific EEG tasks.
- Scalability to large datasets: Although the WL kernel is computationally efficient, the scalability of the framework to larger EEG datasets with higher temporal resolution remains to be evaluated.
- Subject-specific variability: EEG signals are highly variable across subjects, which may affect the generalisability of the findings. Incorporating subject-specific normalisation or transfer learning techniques could address this issue.
- Generalizability: Investigation with more EEG datasets is required to establish generalizability of the workflow.
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
MDPI | Multidisciplinary Digital Publishing Institute |
DOAJ | Directory of open-access journals |
GKKE | Graph Koopman kernel embedding |
EEG | Electroencephalography |
SNR | Signal-to-noise ratio |
DMD | Dynamic Mode Decomposition |
EMD | Empirical Mode Decomposition |
LSTM | Long Short-Term Memory |
MLP | Multilayer Perceptron |
SVD | Singular Value Decomposition |
CSL | Cost-sensitive learning |
CNN | Convolutional Neural Networks |
ROC | Receiver-operating characteristic curve |
kernel CCA | Kernel canonical correlation analysis |
mPLV | Mean-phase locking value |
WL | Weisfeiler–Lehman Kernel |
SD | Spectral decomposition kernel |
RW | Random Walk kernel |
DT | Decision Tree Classifier |
RF | Random Forest Classifier |
SVC | Support Vector Classifier |
SVM | Support Vector Machine |
AUC | Area under the curve |
fMRI | Functional magnetic resonance imaging |
SSE | Sum of squared error |
BCI | Brain–computer interface |
EDF | European Data Format |
References
- World Health Organization. Intersectoral Global Action Plan on Epilepsy and Other Neurological Disorders 2022–2031; World Health Organization: Geneva, Switzerland, 2023; ISBN 978-92-4-007662-4 (electronic version); ISBN 978-92-4-007663-1 (print version). [Google Scholar]
- Pathak, V.; Dhyani, P.; Mahanti, P.; Gaur, V. Tracking Slow Modulations of Effective Connectivity for Early Epilepsy Detection: A Review. In Knowledge Modelling and Big Data Analytics in Healthcare; CRC Press: Boca Raton, FL, USA, 2021; pp. 229–255, eBook ISBN 9781003142751. [Google Scholar]
- Capouskova, K.; Kringelbach, M.L.; Deco, G. Modes of cognition Evidence from metastable brain dynamics. Neuroimage 2022, 260, 119489. [Google Scholar] [CrossRef] [PubMed]
- Lusch, B.; Kutz, J.N.; Brunton, S.L. Deep learning for universal linear embeddings of nonlinear dynamics. Nat. Commun. 2016, 9, 4950. [Google Scholar] [CrossRef] [PubMed]
- Peng, Y.; Qiu, T.; Wei, L. An approach to extracting graph kernel features from functional brain networks and its applications to analysing noisy EEG signals. Biomed. Signal Process. Control 2023, 80, 104269. [Google Scholar] [CrossRef]
- Saichand, N.V.; Naik, S.G. Epileptic seizure detection using novel Multilayer LSTM Discriminant Network and dynamic mode Koopman decomposition. Biomed. Signal Process. Control 2021, 68, 102723. [Google Scholar] [CrossRef]
- Giannakis, D. Delay-coordinate maps, coherence, and approximate spectra of evolution operators. Res. Math. Sci. 2021, 8, 8. [Google Scholar] [CrossRef]
- Zhang, W.; Yu, Y.-C.; Li, J.-S. Dynamics reconstruction and classification via Koopman features. Data Min. Knowl. Discov. 2019, 33, 1710–1735. [Google Scholar] [CrossRef] [PubMed]
- Zhang, C.; Zuazua, E. A quantitative analysis of Koopman operator methods for system identification and predictions. Comptes Rendus Mécanique 2023, 351, 721–751. [Google Scholar] [CrossRef]
- Stam, C.J.; Reijneveld, J.C. Graph theoretical analysis of complex networks in the brain. Nonlinear Biomed. Phys. 2007, 5, 3. [Google Scholar] [CrossRef] [PubMed]
- Bullmore, E.; Sporns, O. Complex brain networks: Graph theoretical analysis of structural and functional systems. Nat. Rev. Neurosci. 2009, 10, 186–198. [Google Scholar] [CrossRef] [PubMed]
- Vecchio, F.; Miraglia, F.; Marra, C.; Quaranta, D.; Vita, M.G.; Bramanti, P.; Rossini, P.M. Human brain networks in cognitive decline: A graph theoretical analysis of cortical connectivity from EEG data. J. Alzheimer’s Dis. 2014, 41, 113–127. [Google Scholar] [CrossRef] [PubMed]
- Lehnertz, K.; Geier, C.; Rings, T.; Stahn, K. Capturing time-varying brain dynamics. EPJ Nonlinear Biomed. Phys. 2017, 5, 2. [Google Scholar] [CrossRef]
- Rubinov, M.; Sporns, O. Complex measures of brain connectivity: Uses and interpretations. Neuroimage 2010, 52, 1059–1069. [Google Scholar] [CrossRef]
- Jia, L.; Gaüzère, B.; Honeine, P. Graph kernels based on linear patterns: Theoretical and experimental comparisons. Expert Syst. Appl. 2022, 189, 116095. [Google Scholar] [CrossRef]
- Shervashidze, N.; Schweitzer, P.; Leeuwen, E.J. van, Mehlhorn, K.; Borgwardt, K.M. Weisfeiler-Lehman graph kernels. J. Mach. Learn. Res. 2011, 12, 2539–2561. [Google Scholar]
- Yehuda, Y.; Freedman, D.; Radinsky, K. Self-supervised classification of clinical multivariate time series using time series dynamics. In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Long Beach, CA, USA, 6–10 August 2023; pp. 5416–5427. [Google Scholar] [CrossRef]
- Qian, S.; Chou, C.-A. A Koopman-operator-theoretical approach for anomaly recognition and detection of multi-variate EEG system. Biomed. Signal Process. Control 2021, 69, 102911. [Google Scholar] [CrossRef]
- Gallos, I.K.; Lehmberg, D.; Dietrich, F.; Siettos, C. Data-driven modelling of brain activity using neural networks, diffusion maps, and the Koopman operator. Chaos Interdiscip. J. Nonlinear Sci. 2024, 34, 013151. [Google Scholar] [CrossRef] [PubMed]
- Clainche, S.L.; Vega, J.M. Spatio-temporal Koopman decomposition. J. Nonlinear Sci. 2018, 28, 1793–1842. [Google Scholar] [CrossRef]
- Liang, Z.; Luo, Z.; Liu, K.; Qiu, J.; Liu, Q. Online learning koopman operator for closed-loop electrical neurostimulation in epilepsy. IEEE J. Biomed. Health Inform. 2023, 27, 492–503. [Google Scholar] [CrossRef] [PubMed]
- Muralinath, R.N.; Pathak, V.; Mahanti, P.K. Optimizing Spectral Graph Kernels for Time-Delay Embedding of Multichannel EEG Data. In Proceedings of the 2024 8th International Conference on Algorithms, Computing and Systems, Hong Kong, China, 11–13 October 2024; pp. 62–67. [Google Scholar]
- Kriege, N.M.; Morris, C.; Rey, A.; Sohler, C. A property testing framework for the theoretical expressivity of graph kernels. In Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, Stockholm Sweden, 13–19 July 2018; pp. 2348–2354. [Google Scholar] [CrossRef]
- Pathak, V.; Gaur, V.; Mahanti, P.K.; Chandra, S. Exploring the efficacy of explainable deep learning in identifying neuromarkers for precise prediction of epilepsy and causal connectivity analysis. In Proceedings of the 15th International Conference on Computer Modeling and Simulation, Dalian, China, 16–18 June 2023; pp. 126–132. [Google Scholar] [CrossRef]
- Li, Q.; Dietrich, F.; Bollt, E.M.; Kevrekidis, I.G. Extended dynamic mode decomposition with dictionary learning: A data-driven adaptive spectral decomposition of the Koopman operator. Chaos 2017, 27, 103111. [Google Scholar] [CrossRef]
- Takeishi, N.; Kawahara, Y.; Yairi, T. Learning Koopman invariant subspaces for dynamic mode decomposition. In Advances in Neural Information Processing Systems; Guyon, I., Von Luxburg, U., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., Garnett, R., Eds.; Curran Associates, Inc.: New York, NY, USA, 2017; pp. 1130–1140. ISBN 9781510860964. Available online: https://proceedings.neurips.cc/paper/2017/file/3a835d3215755c435ef4fe9965a3f2a0-Paper.pdf (accessed on 6 February 2025).
- Yeung, E.; Kundu, S.; Hodas, N. Learning deep neural network representations for Koopman operators of nonlinear dynamical systems. In Proceedings of the 2019 American Control Conference (ACC), Philadelphia, PA, USA, 10–12 July 2017. [Google Scholar]
- Klus, S.; Husic, B.E.; Mollenhauer, M.; Noé, F. Kernel methods for detecting coherent structures in dynamical data. Chaos Interdiscip. J. Nonlinear Sci. 2019, 29, 123112. [Google Scholar] [CrossRef] [PubMed]
- Melnyk, K.; Klus, S.; Montavon, G.; Conrad, T.O.F. GraphKKE: Graph Kernel Koopman embedding for human microbiome analysis. Appl. Netw. Sci. 2020, 5, 96. [Google Scholar] [CrossRef]
- Cabral, J.; Castaldo, F.; Vohryzek, J.; Litvak, V.; Bick, C.; Lambiotte, R.; Friston, K.; Kringelbach, M.L.; Deco, G. Metastable oscillatory modes emerge from synchronisation in the brain spacetime connectome. Commun. Phys. 2022, 5, 184. [Google Scholar] [CrossRef] [PubMed]
- Ortiz, M.; de la Ossa, L.; Juan, J.; Iáñez, E.; Torricelli, D.; Tornero, J.; Azorín, J.M. An EEG database for the cognitive assessment of motor imagery during walking with a lower-limb exoskeleton. Sci. Data 2023, 10, 343. [Google Scholar] [CrossRef] [PubMed]
Without Cost-Sensitive Learning | With Cost-Sensitive Learning | |||||||||
---|---|---|---|---|---|---|---|---|---|---|
Methodology + Kernel + Classifier | Accuracy | AUC | Precision | Recall | F1 | Accuracy | AUC | Precision | Recall | F1 |
mPLV-WL-DT | 0.9167 | 0.90 | 1 | 0.8 | 0.8889 | 0.9167 | 0.9 | 0.9271 | 0.9167 | 0.9148 |
mPLV-WL-RF | 0.8333 | 0.9714 | 0.8 | 0.8 | 0.8 | 0.8333 | 0.9571 | 0.8333 | 0.8333 | 0.8333 |
mPLV-WL-SVC | 0.75 | 0.7286 | 0.75 | 0.60 | 0.6667 | 0.4167 | 0.5 | 0.1736 | 0.4167 | 0.2451 |
mPLV-SD-DT | 0.5833 | 0.5857 | 0.5 | 0.6 | 0.5455 | 0.6667 | 0.6571 | 0.6667 | 0.6667 | 0.6667 |
mPLV-SD-RF | 0.4167 | 0.4286 | 0.3333 | 0.4 | 0.3636 | 0.3333 | 0.3286 | 0.3333 | 0.3333 | 0.3333 |
mPLV-SD-SVC | 0.5833 | 0.5571 | 0.5 | 0.4 | 0.4444 | 0.5833 | 0.5571 | 0.5729 | 0.5833 | 0.5741 |
mPLV-RW-DT | 0.9167 | 0.9286 | 0.8333 | 1 | 0.9091 | 0.8333 | 0.8286 | 0.8333 | 0.8333 | 0.8333 |
mPLV-RW-RF | 0.75 | 0.9143 | 0.75 | 0.6 | 0.6667 | 0.6667 | 0.8286 | 0.6667 | 0.6667 | 0.6667 |
mPLV-RW-SVC | 0.5833 | 0.5571 | 0.50 | 0.4 | 0.4444 | 0.5833 | 0.5571 | 0.5729 | 0.5833 | 0.5833 |
corrCoff-WL-DT | 0.4167 | 0.5 | 0.4167 | 1 | 0.5882 | 0.25 | 0.5 | 0.0625 | 0.25 | 0.1 |
corrCoff-WL-RF | 0.4162 | 0.5 | 0.4167 | 1 | 0.5882 | 0.25 | 0.5 | 0.0625 | 0.25 | 0.1 |
corrCoff-WL-SVC | 0.4162 | 0.5 | 0.4167 | 1 | 0.5882 | 0.25 | 0.5 | 0.0625 | 0.25 | 0.1 |
corrCoff-SD-DT | 0.4167 | 0.5 | 0.4167 | 1 | 0.5882 | 0.5 | 0.5 | 0.5556 | 0.5 | 0.5143 |
corrCoff-SD-RF | 0.4167 | 0.5 | 0.4167 | 1 | 0.5882 | 0.4167 | 0.4062 | 0.5556 | 0.4167 | 0.3963 |
corrCoff-SD-SVC | 0.4167 | 0.5 | 0.4167 | 1 | 0.5882 | 0.3333 | 0.5 | 0.1111 | 0.3333 | 0.1667 |
corrCoff-RW-DT | 0.4167 | 0.5 | 0.4167 | 1 | 0.5882 | 0.3333 | 0.5 | 0.1111 | 0.3333 | 0.1667 |
corrCoff-RW-RF | 0.4167 | 0.5 | 0.4167 | 1 | 0.5882 | 0.3333 | 0.5 | 0.1111 | 0.3333 | 0.1667 |
corrCoff-RW-SVC | 0.4167 | 0.5 | 0.4167 | 1 | 0.5882 | 0.3333 | 0.5 | 0.1111 | 0.3333 | 0.1667 |
Without Cost-Sensitive Learning | With Cost-Sensitive Learning | |||||||||
---|---|---|---|---|---|---|---|---|---|---|
Workflow | Acc | AUC | Prec | Rec | F1 | Acc | AUC | Prec | Rec | F1 |
mPLV-WL-DT | 91.7 | 90.0 | 100 | 80.0 | 88.9 | 91.7 | 90.0 | 92.7 | 91.7 | 91.0 |
mPLV-WL-RF | 83.3 | 97.0 | 80.0 | 80.0 | 80.0 | 83.3 | 95.7 | 83.3 | 83.3 | 83.0 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Muralinath, R.N.; Pathak, V.; Mahanti, P.K. Metastable Substructure Embedding and Robust Classification of Multichannel EEG Data Using Spectral Graph Kernels. Future Internet 2025, 17, 102. https://doi.org/10.3390/fi17030102
Muralinath RN, Pathak V, Mahanti PK. Metastable Substructure Embedding and Robust Classification of Multichannel EEG Data Using Spectral Graph Kernels. Future Internet. 2025; 17(3):102. https://doi.org/10.3390/fi17030102
Chicago/Turabian StyleMuralinath, Rashmi N., Vishwambhar Pathak, and Prabhat K. Mahanti. 2025. "Metastable Substructure Embedding and Robust Classification of Multichannel EEG Data Using Spectral Graph Kernels" Future Internet 17, no. 3: 102. https://doi.org/10.3390/fi17030102
APA StyleMuralinath, R. N., Pathak, V., & Mahanti, P. K. (2025). Metastable Substructure Embedding and Robust Classification of Multichannel EEG Data Using Spectral Graph Kernels. Future Internet, 17(3), 102. https://doi.org/10.3390/fi17030102