Automatic Detection of Atrial Fibrillation in ECG Using Co-Occurrence Patterns of Dynamic Symbol Assignment and Machine Learning
<p>The overall pipeline of the proposed approach. The interbeat (RR) intervals are applied to dynamic symbol assignment (DSA) to map electrocardiography (ECG) signals to a symbolic sequence. The thresholds <math display="inline"><semantics> <mrow> <msub> <mi>T</mi> <mrow> <mi>i</mi> </mrow> </msub> </mrow> </semantics></math> is used to maps the symbols in the RR interval. The pattern transition probability <math display="inline"><semantics> <mi>P</mi> </semantics></math> is computed from co-occurrence pattern transition matrix <math display="inline"><semantics> <mi>M</mi> </semantics></math> for symbols <math display="inline"><semantics> <mrow> <mfenced close="}" open="{"> <mrow> <mi>a</mi> <mo>,</mo> <mi>b</mi> <mo>,</mo> <mi>c</mi> </mrow> </mfenced> </mrow> </semantics></math>. Finally, the <math display="inline"><semantics> <mi>P</mi> </semantics></math> is transformed to a 1-dimensional array <math display="inline"><semantics> <mover accent="true"> <mi>P</mi> <mo stretchy="false">→</mo> </mover> </semantics></math> using row-based concatenation, and the DSA features are extracted. The DSA features are fed to the k-nearest neighbor (kNN), support vector machine (SVM), random forest (RF), rotation forest (RoF), and ensemble learning (EL) classifiers to differentiate normal and paroxysmal atrial fibrillation (PAF) segments. The dark-blue arrow refers to the flow from one process to the next. The grey arrow refers to the intermediate outcome of the process.</p> "> Figure 2
<p>The pipeline of the dynamic symbol assignment (DSA) approach; representative RR intervals as input to the DSA method (<b>a</b>), distance evaluation of the input data (<b>b</b>), distance approximation using dynamic threshold lists (<b>c</b>), and representation of symbolic sequence after symbolization (<b>d</b>).</p> "> Figure 3
<p>The electrocardiography (ECG) is composed of <math display="inline"><semantics> <mrow> <mi>N</mi> <mo>+</mo> <mn>1</mn> </mrow> </semantics></math> R-waves and <math display="inline"><semantics> <mi>N</mi> </semantics></math> RR intervals.</p> "> Figure 4
<p>Representative ECG measurements of SR (<b>a</b>,<b>c</b>), and PAF (<b>b</b>,<b>d</b>) segments obtained from AFPDB and AFTDB, respectively. The identification of R-Peaks in PAF segments (within the red-rectangle area) are not easily identifiable.</p> "> Figure 5
<p>Representative SR (<b>a</b>), and PAF (<b>b</b>) segments with their corresponding RR interval and discretized symbolic sequences.</p> "> Figure 6
<p>Representative heatmaps of transition patterns for SR (<b>a</b>), and PAF (<b>b</b>) segments using co-occurrence matrices.</p> "> Figure 7
<p>Boxplots representing the distribution of maximum co-occurrence values for varied <math display="inline"><semantics> <mrow> <mfenced close="|" open="|"> <mi mathvariant="double-struck">V</mi> </mfenced> </mrow> </semantics></math> with <math display="inline"><semantics> <mrow> <mi>W</mi> <mo>=</mo> <mn>3</mn> </mrow> </semantics></math> and for varied <math display="inline"><semantics> <mrow> <mi>W</mi> </mrow> </semantics></math> with <math display="inline"><semantics> <mrow> <mfenced close="|" open="|"> <mi mathvariant="double-struck">V</mi> </mfenced> <mo>=</mo> <mn>3</mn> </mrow> </semantics></math> in AFPDB (<b>a</b>,<b>c</b>) and AFTDB (<b>b</b>,<b>d</b>).</p> "> Figure 8
<p>The average receiver operating characteristic (ROC) plots represent the performance of the classifiers for the DSA feature obtained using <math display="inline"><semantics> <mrow> <mfenced close="|" open="|"> <mi mathvariant="double-struck">V</mi> </mfenced> <mo>=</mo> <mn>3</mn> <mo>,</mo> <mo> </mo> <mn>5</mn> <mo>,</mo> <mo> </mo> <mn>7</mn> </mrow> </semantics></math> with <math display="inline"><semantics> <mrow> <mi>W</mi> <mo>=</mo> <mn>3</mn> </mrow> </semantics></math> for AFPDB (<b>a</b>–<b>c</b>) and AFTDB (<b>d</b>–<b>f</b>).</p> "> Figure 9
<p>Comparison of P (<b>a</b>) and R (<b>b</b>) obtained for a varied length of time series by the DSA method and its classification using different classifiers.</p> "> Figure 10
<p>The wearable T-shirt (Pro-Kit, Hexoskin, Quebec, Canada) (<b>a</b>), the cECG chair (Smart Seat, Capical, Braunschweig, Germany) (<b>d</b>), the acquired one-minute ECG segments from these sensors (<b>b</b>,<b>e</b>), and the corresponding transition pattern evaluated using our method (<b>c</b>,<b>f</b>), respectively.</p> ">
Abstract
:1. Introduction
2. Materials and Methods
2.1. Dynamic Symbol Assignment (DSA)
2.1.1. Distance Approximation
2.1.2. Symbolization
2.2. Co-Occurrence Patterns
DSA Features
2.3. Supervised Learning Classification
2.3.1. SVM
2.3.2. kNN
2.3.3. RF
2.3.4. RoF
2.3.5. EL
3. Evaluation
3.1. Hypotheses
3.2. Experiments
- E1: The heatmaps represent the global transition patterns of SR and PAF segments. The pattern transition between SR and PAF segments is computed, and the obtained patterns are compared visually between the segments. H1 will be accepted if the heatmaps for SR and PAF segments are distinct and exhibit varying patterns.
- E2: The DSA approach is applied to the various annotated databases, and the performance of the classifiers to differentiate SR and PAF segments is compared using their F-measure (F). H2 will be accepted if the F is higher (>90.0%).
- E3: The average performance of DSA feature for different and is compared individually using two databases. Statistical analysis is performed for varied , , and also to obtain comparable results. The performance of the classifiers is compared using the receiver operating characteristics (ROC) curve for multiple databases. The average values of the true positive rate and false positive rate obtained from each cross-validation are used. H3 is accepted if the performance gains for optimal parameters of and . and the area under ROC curves (AUCs), are above 90.0% for multiple databases. Based on the higher values of F, the optimal parameters of and are fixed for further experiments (E4 and E5).
- E4: The average F and AUCs are computed on the DSA features for a varied length of the signals and the values are compared. Most of the authors have evaluated their methods on the same length of signals. Here, the DSA approach is evaluated for a varied signal length. H4 is accepted if varied signal lengths yield similar or improved performance.
- E5: The F is computed on the series of two different but comparable databases to validate the cross-data analysis, and the values are compared. The majority of previous studies evaluated their methods on the same dataset [21,30,31,32]. Here, the one-minute PAF segments are obtained from AFTDB, and one-minute SR segments are extracted from AFPDB. We applied our DSA method across datasets. H5 is accepted if the F is above 95.0%.
3.3. Validation
3.4. Databases
4. Results
- 1.
- R1: The representative ECG signals, their corresponding RR interval, and the discretized series with symbolic sequences are shown in Figure 5. The differences in the distance between the R-spikes of the signals are visible in SR and PAF segments. The discretized symbolic sequence for SR segments contains relatively constant symbols, while the symbolic sequences of PAF segments are irregular and represent a frequent transition in patterns (Figure 5). For example, the symbol sequences and are for SR and PAF segments, respectively.
- 2.
- R2: Comparing SR and PAF segments in both databases, the highest performance of F = 93.6% and F = 98.3% is obtained using and 3, respectively (Table 4). The RoF and EL classifiers have high performance for varied . The P and R are also found to be the highest for in both databases. The RoF classifier yields the highest ACC of 93.6% and 98.3% for AFPDB and AFTDB databases, respectively. With varied classifiers, the DSA method differentiates SR and PAF segments in both databases, which confirms our hypothesis H2;
- 3.
- R3: The DSA features for varied discriminate SR and PAF segments in both databases (Figure 7). The maximum difference in the median in each database is 0.526 for . The DSA feature for varied with in AFPDB and is shown in Figure 7c,d. The higher value of the DSA feature indicates the presence of a similar pattern in the segments. The median is higher for SR segments and ranges from 0.5 to 0.75 for both databases. The mean decreases with higher values of W. The percentage difference of the average DSA feature is greater than 25% for varied W in both AFPDB and AFTDB databases. The smallest and largest difference in the average of DSA feature is observed with W=5 and 3 obtaining 0.122 and 0.526, respectively.
- 4.
- R4: For the AFPDB database, kNN and RoF obtained the highest F of 96.2% and 96.0%, respectively (Table 6). However, the AUC is high for longer time series with 3-min and 4-min sequences (96.9% and 97.2% in Table 6). The gain in AUC for a longer sequence can be explained by the fact that longer signals characterize the dynamics of signals effectively. In terms of P and R, kNN and RoF are found to be consistently high for a varied length of the signals (Figure 9). Furthermore, the ACC is observed to be high and consistent for kNN, RF, and RoF in 3-min sequences. However, the pattern transition in varied lengths of the sequences is captured by our DSA method (Table 6), which confirms our hypothesis H4.
- 5.
- R5: The performance ranges from 84.1% to 99.8% for varied using different classifiers. For the cross-dataset, the top three symbol lengths and obtained the highest F of 99.8% (Table 7). RoF and EL yield the highest performance. Except for SVM and kNN, the performance of DSA with varied is higher than 90.0%. Similarly, the AUC also ranges from 85.0% to 99.8% for varied using different classifiers. The F is found to be consistent in for RoF and EL classifiers. The kNN classifier yields the lowest performance of classifiers. Thus, we can conclude that our method is effective in the cross-data analysis (H5 is true).
5. Discussion
5.1. Existing Challenges
5.2. Multi-Length ECGs
5.3. Cross Dataset Analysis
5.4. Application with Novel Sensing Devices
5.5. Comparison and Future Scope
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Morillo, C.A.; Banerjee, A.; Perel, P.; Wood, D.; Jouven, X. Atrial fibrillation: The current epidemic. J. Geriatr. Cardiol. 2017, 14, 195–203. [Google Scholar]
- Kashif, M.; Jonas, S.M.; Deserno, T.M. Deterioration of R-Wave Detection in Pathology and Noise: A Comprehensive Analysis Using Simultaneous Truth and Performance Level Estimation. IEEE Trans. Biomed. Eng. 2016, 64, 2163–2175. [Google Scholar] [CrossRef] [PubMed]
- Kaptoge, S.; Pennells, L.; De Bacquer, D.; Cooney, M.T.; Kavousi, M.; Stevens, G.; Riley, L.M.; Savin, S.; Khan, T.; Altay, S.; et al. World Health Organization cardiovascular disease risk charts: Revised models to estimate risk in 21 global regions. Lancet Glob. Health 2019, 7, e1332–e1345. [Google Scholar] [CrossRef] [Green Version]
- Deserno, T.M.; Marx, N. Computational electrocardiography: Revisiting Holter ECG monitoring. Methods Inf. Med. 2016, 55, 305–311. [Google Scholar]
- Chandra, B.S.; Sastry, C.S.; Jana, S.; Patidar, S. Atrial fibrillation detection using convolutional neural networks. In Proceedings of the 2017 Computing in Cardiology (CinC), Rennes, France, 24–27 September 2017; pp. 1–4. [Google Scholar]
- He, R.; Wang, K.; Zhao, N.; Liu, Y.; Yuan, Y.; Li, Q.; Zhang, H. Automatic Detection of Atrial Fibrillation Based on Continuous Wavelet Transform and 2D Convolutional Neural Networks. Front. Physiol. 2018, 9, 1206. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Rizwan, A.; Zoha, A.; Ben Mabrouk, I.; Sabbour, H.M.; Al-Sumaiti, A.S.; Alomainy, A.; Imran, M.A.; Abbasi, Q.H. A Review on the State of the Art in Atrial Fibrillation Detection Enabled by Machine Learning. IEEE Rev. Biomed. Eng. 2020, 14, 219–239. [Google Scholar] [CrossRef] [Green Version]
- Burguera, A. Fast QRS Detection and ECG Compression Based on Signal Structural Analysis. IEEE J. Biomed. Health Inform. 2018, 23, 123–131. [Google Scholar] [CrossRef]
- Spicher, N.; Kukuk, M. Delineation of Electrocardiograms Using Multiscale Parameter Estimation. IEEE J. Biomed. Health Inform. 2020, 24, 2216–2229. [Google Scholar] [CrossRef]
- Andersen, R.S.; Peimankar, A.; Puthusserypady, S. A deep learning approach for real-time detection of atrial fibrillation. Expert Syst. Appl. 2019, 115, 465–473. [Google Scholar] [CrossRef]
- Mahajan, R.; Viangteeravat, T.; Akbilgic, O. Improved detection of congestive heart failure via probabilistic symbolic pattern recognition and heart rate variability metrics. Int. J. Med. Inform. 2017, 108, 55–63. [Google Scholar] [CrossRef]
- Wang, J.; Warnecke, J.M.; Haghi, M.; Deserno, T.M. Unobtrusive Health Monitoring in Private Spaces: The Smart Vehicle. Sensors 2020, 20, 2442. [Google Scholar] [CrossRef] [PubMed]
- Castro, I.; Patel, A.; Deviaene, M.; Huysmans, D.; Borzee, P.; Buyse, B.; Testelmans, D.; Van Huffel, S.; Varon, C.; Torfs, T. Unobtrusive, through-clothing ECG and Bioimpedance Monitoring in Sleep Apnea Patients. TC 2020, 19, 3. [Google Scholar]
- Satija, U.; Ramkumar, B.; Manikandan, M.S. A Review of Signal Processing Techniques for Electrocardiogram Signal Quality Assessment. IEEE Rev. Biomed. Eng. 2018, 11, 36–52. [Google Scholar] [CrossRef]
- Yin, H.; Yang, S.; Zhu, X.; Ma, S.; Chen, L. Symbolic representation based on trend features for biomedical data classification. Technol. Health Care 2015, 23, S501–S510. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Niu, J.; Tang, Y.; Sun, Z.; Zhang, W. Inter-Patient ECG Classification with Symbolic Representations and Multi-Perspective Convolutional Neural Networks. IEEE J. Biomed. Health Inform. 2019, 24, 1321–1332. [Google Scholar] [CrossRef]
- Adam, M.; Oh, S.L.; Sudarshan, V.K.; Koh, J.E.; Hagiwara, Y.; Tan, J.H.; Tan, R.S.; Acharya, U.R. Automated characterization of cardiovascular diseases using relative wavelet nonlinear features extracted from ECG signals. Comput. Methods Programs Biomed. 2018, 161, 133–143. [Google Scholar] [CrossRef]
- Akbilgic, O.; Howe, J.A. Symbolic pattern recognition for sequential data. Seq. Anal. 2017, 36, 528–540. [Google Scholar] [CrossRef]
- Akbilgic, O.; Howe, J.A.; Davis, R.L. Categorizing atrial fibrillation via symbolic pattern recognition. J. Med. Stat. Inform. 2016, 4, 8. [Google Scholar] [CrossRef]
- Zhang, C.; Chen, Y.; Yin, A.; Wang, X. Anomaly detection in ECG based on trend symbolic aggregate approximation. Math. Biosci. Eng. 2019, 16, 2154–2167. [Google Scholar] [CrossRef]
- Mohebbi, M.; Ghassemian, H. Prediction of paroxysmal atrial fibrillation based on non-linear analysis and spectrum and bispectrum features of the heart rate variability signal. Comput. Methods Programs Biomed. 2012, 105, 40–49. [Google Scholar] [CrossRef]
- Moody, G.; Goldberger, A.; McClennen, S.; Swiryn, S. Predicting the onset of paroxysmal atrial fibrillation: The computers in cardiology challenge 2001. Comput. Cardiol. 2001, 28, 113–116. [Google Scholar]
- Moody, G. Spontaneous termination of atrial fibrillation: A challenge from physionet and computers in cardiology 2004. Comput. Cardiol. 2004, 31, 101–104. [Google Scholar]
- Goldberger, A.L.; Amaral, L.A.; Glass, L.; Hausdorff, J.M.; Ivanov, P.C.; Mark, R.G.; Mietus, J.E.; Moody, G.B.; Peng, C.-K.; Stanley, H.E. Physiobank, physiotoolkit, and physionet: Components of a new research resource for complex physiologic signals. Circulation 2000, 101, E215–E220. [Google Scholar] [CrossRef] [Green Version]
- Le, N.Q.K.; Do, D.T.; Hung, T.N.K.; Lam, L.H.T.; Huynh, T.-T.; Nguyen, N.T.K. A Computational Framework Based on Ensemble Deep Neural Networks for Essential Genes Identification. Int. J. Mol. Sci. 2020, 21, 9070. [Google Scholar] [CrossRef]
- Venugopal, G.; Navaneethakrishna, M.; Ramakrishnan, S. Extraction and analysis of multiple time window features associated with muscle fatigue conditions using sEMG signals. Expert Syst. Appl. 2014, 41, 2652–2659. [Google Scholar] [CrossRef]
- Karthick, P.; Ghosh, D.M.; Ramakrishnan, S. Surface electromyography based muscle fatigue detection using high-resolution time-frequency methods and machine learning algorithms. Comput. Methods Programs Biomed. 2018, 154, 45–56. [Google Scholar] [CrossRef] [PubMed]
- Wu, X.; Gao, Y.; Jiao, D. Multi-Label Classification Based on Random Forest Algorithm for Non-Intrusive Load Monitoring System. Processes 2019, 7, 337. [Google Scholar] [CrossRef] [Green Version]
- Huang, Q.; Chen, Y.; Liu, L.; Tao, D.; Li, X. On Combining Biclustering Mining and AdaBoost for Breast Tumor Classification. IEEE Trans. Knowl. Data Eng. 2020, 32, 728–738. [Google Scholar] [CrossRef]
- Zong, W.; Mukkamala, R.; Mark, R.G. A methodology for predicting paroxysmal atrial fibrillation based on ECG arrhythmia feature analysis. In Proceedings of the Computers in Cardiology 2001 Vol28 (Cat No01CH37287), Rotterdam, The Netherlands, 23–26 September 2001; pp. 125–128. [Google Scholar]
- Sutton, J.R.; Mahajan, R.; Akbilgic, O.; Kamaleswaran, R. Physonline: An open source machine learning pipeline for real-time analysis of streaming physiological waveform. IEEE J. Biomed. Health Inform. 2019, 23, 59–65. [Google Scholar] [CrossRef]
- Pourbabaee, B.; Roshtkhari, M.J.; Khorasani, K. Deep convolutional neural networks and learning ECG features for screening paroxysmal atrial fibrillation patients. IEEE Trans. Syst. Man Cybern. Syst. 2018, 48, 2095–2104. [Google Scholar] [CrossRef]
- Le, N.Q.K.; Do, D.T.; Chiu, F.-Y.; Yapp, E.K.Y.; Yeh, H.-Y.; Chen, C.-Y. XGBoost Improves Classification of MGMT Promoter Methylation Status in IDH1 Wildtype Glioblastoma. J. Pers. Med. 2020, 10, 128. [Google Scholar] [CrossRef] [PubMed]
- Ganapathy, N.; Veeranki, Y.R.; Swaminathan, R. Convolutional neural network based emotion classification using electrodermal activity signals and time-frequency features. Expert Syst. Appl. 2020, 159, 113571. [Google Scholar] [CrossRef]
- Zhang, Y.; Guo, Y.; Yang, P.; Chen, W.; Lo, B. Epilepsy Seizure Prediction on EEG Using Common Spatial Pattern and Convolutional Neural Network. IEEE J. Biomed. Health Inform. 2019, 24, 465–474. [Google Scholar] [CrossRef] [PubMed]
- Habib, A.; Karmakar, C.; Yearwood, J. Choosing a sampling frequency for ECG QRS detection using convolutional networks. arXiv 2020, arXiv:2007.02052. [Google Scholar]
- Castro, I.D.; Varon, C.; Torfs, T.; Van Huffel, S.; Puers, R.; Van Hoof, C. Evaluation of a Multichannel Non-Contact ECG System and Signal Quality Algorithms for Sleep Apnea Detection and Monitoring. Sensors 2018, 18, 577. [Google Scholar] [CrossRef] [Green Version]
- Lan, Z.; Sourina, O.; Wang, L.; Scherer, R.; Muller-Putz, G.R. Domain Adaptation Techniques for EEG-Based Emotion Recognition: A Comparative Study on Two Public Datasets. IEEE Trans. Cogn. Dev. Syst. 2019, 11, 85–94. [Google Scholar] [CrossRef]
- Li, X.; Song, D.; Zhang, P.; Zhang, Y.; Hou, Y.; Hu, B. Exploring EEG features in cross-subject emotion recognition. Front. Neurosci. 2018, 12, 162. [Google Scholar] [CrossRef] [Green Version]
- Park, J.; Lee, S.; Jeon, M. Atrial fibrillation detection by heart rate variability in Poincare plot. Biomed. Eng. Online 2009, 8, 38. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Natarajan, A.; Angarita, G.; Gaiser, E.; Malison, R.; Ganesan, D.; Marlin, B.M. Domain adaptation methods for improving lab-to-field generalization of cocaine detection using wearable ECG. In Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing, Heidelberg, Germany, 12–16 September 2016; pp. 875–885. [Google Scholar]
- Montes, J.; Young, J.C.; Tandy, R.; Navalta, J.W. Reliability and Validation of the Hexoskin Wearable Bio-Collection Device During Walking Conditions. Int. J. Exerc. Sci. 2018, 11, 806. [Google Scholar]
- Wang, J.; Spicher, N.; Warnecke, J.; Haghi, M.; Schwartze, J.; Deserno, T. Unobtrusive Health Monitoring in Private Spaces: The Smart Home. Sensors 2021, 21, 864. [Google Scholar] [CrossRef]
- Carre Technologies Inc. (Hexoskin), Montreal, Canada. 2008. Available online: https://www.hexoskin.com/ (accessed on 1 May 2021).
- Capical GmbH, Braunschweig, Germany. 2010. Available online: http://www.capical.de (accessed on 1 May 2021).
- Ganapathy, N.; Swaminathan, R.; Deserno, T.M. Adaptive learning and cross training improves R-wave detection in ECG. Comput. Methods Programs Biomed. 2021, 200, 105931. [Google Scholar] [CrossRef] [PubMed]
Symbol Pattern | Pattern Transition Probabilities | ||
---|---|---|---|
0.04 | 0.07 | 0 | |
0.04 | 0.04 | 0.07 | |
0 | 0 | 0 | |
0.04 | 0.07 | 0 | |
0.04 | 0.14 | 0.07 | |
0 | 0.14 | 0.04 | |
0 | 0 | 0 | |
0.04 | 0.07 | 0.04 | |
0 | 0 | 0.07 |
Database | Hyper-Parameters Range | Optimal |
---|---|---|
SVM | cost = (0.001, 0.01, 0.1, 1) | 0.01 |
gamma = (0.01, 0.1, 1) | 0.1 | |
kernel = (Linear, Polynomial, Radial basis function) | Linear | |
kNN | c = (1, 2, 3…,10) | 5 |
weights = (Uniform, Distance) | Uniform | |
metric = (Euclidean, Manhattan, Minkowski) | Euclidean | |
RF | max_depth = (10, 20, 30, …, 50, None) | None |
max_features = (‘auto’, ‘sqrt’) | sqrt | |
min_samples_leaf = (1, 2, 3) | 1 | |
min_samples_split = (2, 4, 6, 8, 10, 12) | 10 | |
n_estimators = (100, 200, 300, …, 500) | 100 | |
RoF | max_features = (‘auto’, ‘sqrt’) | sqrt |
classifers = (‘RF’, ‘J48′, ’Decision tree’) | RF | |
maxGroup = (1, 2, 3…, 10) | 3 | |
minGroup = (1, 2, 3…, 10) | 3 | |
projectionFilter = (‘PCA’, ’random’) | PCA | |
EL | number_of_classifers = (1,2,3…,10) | 3 |
classifiers_used = (SVM, kNN, RF, RoF) | SVM, RF, RoF |
Database | Leads | Subjects | Records | Sampling Rate (Hz) | Length (min) | Quantization Bit | AF Details (min) | Total Length (h) |
---|---|---|---|---|---|---|---|---|
PAF Prediction Challenge—2001 | 2 | 48 | 50 | 128 | 25, 5 | 16 | 5 | 24 |
AF Termination Challenge—2004 | 2 | 30 | 30 | 128 | 1 | 16 | 1 | 20–24 |
Classifier | AFPDB | AFTDB | |||||||
---|---|---|---|---|---|---|---|---|---|
ACC | P | R | F | ACC | P | R | F | ||
3 | SVM | 80.0 | 82.8 | 80.0 | 81.4 | 86.7 | 89.4 | 86.7 | 88.0 |
kNN | 81.8 | 84.8 | 81.8 | 83.3 | 81.7 | 86.7 | 81.7 | 84.1 | |
RF | 91.8 | 91.9 | 91.8 | 91.8 | 93.5 | 96.9 | 96.7 | 96.8 | |
RoF | 92.7 | 92.8 | 92.7 | 92.7 | 98.3 | 98.4 | 98.3 | 98.3 | |
EL | 91.8 | 91.9 | 91.8 | 91.8 | 98.3 | 98.4 | 98.3 | 98.3 | |
4 | SVM | 77.3 | 81.0 | 77.3 | 79.1 | 85.0 | 88.4 | 85.0 | 86.7 |
kNN | 52.7 | 61.4 | 52.7 | 56.7 | 50.0 | 50.0 | 50.0 | 50.0 | |
RF | 81.8 | 84.0 | 81.8 | 82.9 | 93.3 | 94.1 | 93.3 | 93.7 | |
RoF | 93.6 | 93.7 | 93.6 | 93.6 | 98.3 | 98.4 | 98.3 | 98.3 | |
EL | 89.1 | 89.1 | 89.1 | 89.1 | 93.3 | 93.3 | 93.3 | 93.3 | |
5 | SVM | 77.3 | 81.9 | 77.3 | 79.5 | 85.0 | 88.4 | 85.0 | 86.7 |
kNN | 50.0 | 50.0 | 50.0 | 50.0 | 50.0 | 50.0 | 50.0 | 50.0 | |
RF | 72.7 | 82.4 | 72.7 | 77.2 | 48.3 | 83.5 | 75.0 | 79.0 | |
RoF | 88.2 | 89.2 | 88.2 | 88.7 | 85.0 | 87.1 | 85.0 | 86.0 | |
EL | 91.8 | 91.9 | 91.8 | 91.8 | 95.0 | 95.0 | 95.0 | 95.0 |
Classifier | AFPDB | AFTDB | |||||||
---|---|---|---|---|---|---|---|---|---|
3 | 5 | 7 | 9 | 3 | 5 | 7 | 9 | ||
3 | SVM | 86.9 | 82.1 | 81.4 | 80.6 | 84.1 | 89.1 | 88.0 | 88.0 |
kNN | 89.2 | 89.0 | 83.3 | 72.5 | 98.3 | 84.1 | 84.1 | 62.8 | |
RF | 90.9 | 91.8 | 91.8 | 92.0 | 93.7 | 96.7 | 96.8 | 88.9 | |
RoF | 90.0 | 91.8 | 92.7 | 94.6 | 98.3 | 99.8 | 98.3 | 98.3 | |
EL | 88.2 | 91.0 | 91.8 | 93.6 | 95.2 | 98.3 | 98.3 | 95.0 | |
4 | SVM | 85.2 | 79.9 | 79.1 | 79.1 | 81.5 | 80.2 | 86.7 | 86.7 |
kNN | 90.1 | 62.8 | 56.7 | 50.0 | 92.3 | 50.0 | 50.0 | 62.8 | |
RF | 91.8 | 91.1 | 82.9 | 84.8 | 95.0 | 88.1 | 93.7 | 85.4 | |
RoF | 90.9 | 92.7 | 93.6 | 93.6 | 96.8 | 99.8 | 98.3 | 96.8 | |
EL | 88.2 | 92.8 | 89.1 | 91.8 | 91.7 | 99.8 | 93.3 | 93.4 | |
5 | SVM | 82.9 | 79.1 | 79.5 | 60.7 | 78.9 | 77.7 | 86.7 | 47.1 |
kNN | 73.7 | 60.7 | 50.0 | 50.0 | 63.9 | 50.0 | 50.0 | 62.8 | |
RF | 87.5 | 77.9 | 77.2 | 67.8 | 95.0 | 73.9 | 79.0 | 70.4 | |
RoF | 90.0 | 87.9 | 88.7 | 77.9 | 99.4 | 84.1 | 86.0 | 80.3 | |
EL | 92.7 | 93.6 | 91.8 | 85.5 | 98.3 | 95.0 | 95.0 | 81.7 |
Classifier | Time Segments (min) | ||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | 5 | |||||||||||
ACC | F | AUCs | ACC | F | AUCs | ACC | F | AUCs | ACC | F | AUCs | ACC | F | AUCs | |
SVM | 85.7 | 86.7 | 86.0 | 88.0 | 88.4 | 88.0 | 88.0 | 88.4 | 88.0 | 90.0 | 90.0 | 90.0 | 86.4 | 86.9 | 86.4 |
kNN | 92.0 | 92.1 | 96.2 | 94.0 | 94.2 | 96.1 | 96.0 | 96.1 | 96.9 | 96.0 | 96.2 | 97.2 | 89.0 | 89.1 | 93.9 |
RF | 94.0 | 94.0 | 97.6 | 94.0 | 94.0 | 96.1 | 96.0 | 96.1 | 95.8 | 92.0 | 92.1 | 96.1 | 90.1 | 90.1 | 95.8 |
RoF | 92.0 | 92.0 | 97.1 | 96.0 | 96.1 | 95.5 | 96.0 | 96.0 | 96.0 | 94.0 | 94.1 | 95.7 | 90.0 | 90.0 | 96.2 |
EL | 92.0 | 92.1 | 96.0 | 90.0 | 90.0 | 95.4 | 94.0 | 94.1 | 92.0 | 94.0 | 94.1 | 94.1 | 88.2 | 88.3 | 92.2 |
Classifier | Symbol Lengths | |||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
3 | 5 | 7 | 9 | |||||||||
ACC | F | AUCs | ACC | F | AUCs | ACC | F | AUCs | ACC | F | AUCs | |
SVM | 85.0 | 86.7 | 85.0 | 85.0 | 86.7 | 85.0 | 85.0 | 86.7 | 85.0 | 85.00 | 86.7 | 85.0 |
kNN | 95.0 | 95.1 | 99.3 | 88.3 | 89.4 | 94.0 | 82.7 | 84.1 | 95.4 | 55.0 | 63.9 | 65.7 |
RF | 93.3 | 93.3 | 99.3 | 96.7 | 96.7 | 99.4 | 95.0 | 95.2 | 99.8 | 98.3 | 98.3 | 99.8 |
RoF | 96.7 | 96.7 | 99.6 | 99.8 | 99.8 | 99.8 | 99.8 | 99.8 | 99.8 | 99.8 | 99.8 | 99.8 |
EL | 98.3 | 98.3 | 96.9 | 99.8 | 99.8 | 99.8 | 99.8 | 99.8 | 99.8 | 96.7 | 96.8 | 99.8 |
Existing Methods | Database | Classifiers | Validation | Length (min) | P | R | F | ACC |
---|---|---|---|---|---|---|---|---|
Mohebbi and Ghassemian [21] | AFPDB | SVM | Split | - | 96.3 | 93.1 | - | - |
Zong et al. [30] | AFPDB | ARMA, FL | Split | 30 | - | - | - | 80.88 |
Sutton et al. [31] | AFPDB | LD, LR, DT, RF | 5-fold | 1 | 100 | 73.6 | - | 82.0 |
Pourbabaee et al. [32] | AFPDB | CNN, kNN, SVM, MLP | Split | 5 | - | - | - | 91.0 |
Park et al. [40] | Combined AFPDB and AFTDB | SVM | 4-fold | 1 | 91.4 | 92.9 | - | - |
Our Method | AFPDB | SVM, kNN, RF, RoF, EL | 5-fold | 1, 5 | 94.6 | 94.5 | 94.6 | 94.0 |
AFTDB | 1 | 99.8 | 99.8 | 99.8 | 99.8 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Ganapathy, N.; Baumgärtel, D.; Deserno, T.M. Automatic Detection of Atrial Fibrillation in ECG Using Co-Occurrence Patterns of Dynamic Symbol Assignment and Machine Learning. Sensors 2021, 21, 3542. https://doi.org/10.3390/s21103542
Ganapathy N, Baumgärtel D, Deserno TM. Automatic Detection of Atrial Fibrillation in ECG Using Co-Occurrence Patterns of Dynamic Symbol Assignment and Machine Learning. Sensors. 2021; 21(10):3542. https://doi.org/10.3390/s21103542
Chicago/Turabian StyleGanapathy, Nagarajan, Diana Baumgärtel, and Thomas M. Deserno. 2021. "Automatic Detection of Atrial Fibrillation in ECG Using Co-Occurrence Patterns of Dynamic Symbol Assignment and Machine Learning" Sensors 21, no. 10: 3542. https://doi.org/10.3390/s21103542
APA StyleGanapathy, N., Baumgärtel, D., & Deserno, T. M. (2021). Automatic Detection of Atrial Fibrillation in ECG Using Co-Occurrence Patterns of Dynamic Symbol Assignment and Machine Learning. Sensors, 21(10), 3542. https://doi.org/10.3390/s21103542