Sequence-Information Recognition Method Based on Integrated mDTW
<p>Schematic diagram of the core framework process.</p> "> Figure 2
<p>Process of dynamic warping, including forward and backward steps.</p> "> Figure 3
<p>The definition of useless DTWs.</p> "> Figure 4
<p>Detailed presentation of the core parts of the framework.</p> "> Figure 5
<p>Point-matching results, black line indicates wrong, green indicates right. Red and blue lines are two handwriting characters.</p> ">
Abstract
:1. Introduction
Our Contribution
2. Related Works
2.1. Related Works of DTW
2.2. Related Works of Sequence Data
3. Methodology
3.1. Multidimensional DTW
- Step 2: initialize G as a zero matrix
- Step 4: repeat Step 3 m times until all elements in G are updated
- Step 5: start from , jump to a temporal location via .
- Step 6: Repeat Step5 until terminal (red arrows in Figure 2)
3.2. Sequence Clustering
- Step 1 Separate the whole database into groups by their characters. Thus, each group has handwriting samples with the same character.
- Step 2 Choose a cluster number and then divide each group to clusters by minimizing Equation (7)
- step3 Repeat step2 times to obtain all characters clustered. After that, we obtain a center set with sequence samples.
Algorithm 1: Hierarchical cluster for a Chinese handwriting sample group |
Input: A group of training sequences |
0: Initialize a cluster index matrix , set and others as 0 |
1: Compute a DTW distance matrix via Equation (8). |
2: for iter in |
3: find the location |
4: compute a temporal matrix by Equation (9) and replace D by |
5: merge the cluster to the |
6: end for |
7: for iter in n |
8: find the sequence sample in row of C by Equation (10) and mark it as |
9: end for |
Output: n sequences as centers |
3.3. Joint Modeling with Sequence Processing Networks
4. Experiments
4.1. Non-Rigid Sparse Point Matching
4.2. Handwriting Character Recognition Systems
Methods | Top-1 | Top-3 | Top-10 |
---|---|---|---|
Benchmark 2013 [18] | 95.3% | 97.3% | 98.8% |
VO-3 2013 [28] | 95.8% | 97.8% | 99.0% |
UWarwick 2013 [28] | 97.4% | 98.4% | 99.3% |
Runner-up 2013 [28] | 96.9% | 98.2% | 99.2% |
DropSample 2016 [26] | 97.2% | 98.4% | 99.4% |
Ensemble-9 2016 [26] | 97.5% | 98.5% | 99.5% |
NET4-subseq30 2017 [27] | 97.9% | 98.9% | 99.5% |
NET123456 2017 [27] | 98.1% | 99.0% | 99.5% |
RNN:Single 2019 [19] | 97.2% | 98.4% | 99.4% |
RNN:Ensemble 2019 [19] | 97.6% | 98.6% | 99.5% |
MSCS+ASA+SCL 2022 [29] | 96.7% | 97.6% | 98.9% |
SqueezeNext+CCBAM10 2023 [29] | 97.4% | 98.6% | 99.2% |
DLHCR 2024 [29] | 97.8% | 98.7% | 99.4% |
mDTW-Cluster | 68.7% | 88.8% | 94.6% |
mDTW+cGRU | 97.4% | 99.1% | 99.6% |
mDTW+Transformer | 98.5% | 99.3% | 99.8% |
4.3. Human Activity Recognition Experiments
4.4. Ablation Study
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Cai, X.; Xu, T.; Yi, J.; Huang, J.; Rajasekaran, S. DTWNet: A Dynamic Time Warping Network. Adv. Neural Inf. Process. Syst. 2019, 32, 11640–11650. [Google Scholar]
- Ismail Fawaz, H.; Forestier, G.; Weber, J.; Idoumghar, L.; Muller, P.A. Deep learning for time series classification: A review. Data Min. Knowl. Discov. 2019, 33, 917–963. [Google Scholar] [CrossRef]
- Qu, Y.; Yang, M.; Zhang, J.; Xie, W.; Qiang, B.; Chen, J. An outline of multi-sensor fusion methods for mobile agents indoor navigation. Sensors 2021, 21, 1605. [Google Scholar] [CrossRef] [PubMed]
- Luo, Z.; Qi, R.; Li, Q.; Zheng, J.; Shao, S. ABODE-Net: An Attention-based Deep Learning Model for Non-intrusive Building Occupancy Detection Using Smart Meter Data. In Proceedings of the International Conference on Smart Computing and Communication, New York, NY, USA, 18–20 November 2022; pp. 152–164. [Google Scholar]
- Song, C.; Lu, M.; Wang, Y.; Lu, W. A dynamic time warping loss-based closed-loop CNN for seismic impedance inversion. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5925313. [Google Scholar] [CrossRef]
- Middlehurst, M.; Schäfer, P.; Bagnall, A. Bake off redux: A review and experimental evaluation of recent time series classification algorithms. Data Min. Knowl. Discov. 2024, 38, 1958–2031. [Google Scholar] [CrossRef]
- Shen, J.; Bao, S.D.; Yang, L.C.; Li, Y. The PLR-DTW method for ECG based biometric identification. In Proceedings of the 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Boston, MA, USA, 30 August–3 September 2011; pp. 5248–5251. [Google Scholar]
- Schirrmeister, R.T.; Springenberg, J.T.; Fiederer, L.D.J.; Glasstetter, M.; Eggensperger, K.; Tangermann, M.; Hutter, F.; Burgard, W.; Ball, T. Deep learning with convolutional neural networks for EEG decoding and visualization. Hum. Brain Mapp. 2017, 38, 5391–5420. [Google Scholar] [CrossRef]
- Lerogeron, H.; Picot-Clémente, R.; Rakotomamonjy, A.; Heutte, L. Approximating dynamic time warping with a convolutional neural network on EEG data. Pattern Recognit. Lett. 2023, 171, 162–169. [Google Scholar] [CrossRef]
- Kate, R.J. Using dynamic time warping distances as features for improved time series classification. Data Min. Knowl. Discov. 2016, 30, 283–312. [Google Scholar] [CrossRef]
- Zhang, H.; Dong, Y.; Li, J.; Xu, D. An efficient method for time series similarity search using binary code representation and hamming distance. Intell. Data Anal. 2021, 25, 439–461. [Google Scholar] [CrossRef]
- Gold, O.; Sharir, M. Dynamic Time Warping and Geometric Edit Distance: Breaking the Quadratic Barrier. ACM Trans. Algorithms 2016, 14, 50. [Google Scholar] [CrossRef]
- Ibrahim, M.Z.; Mulvaney, D. Geometry based lip reading system using multi dimension dynamic time warping. In Proceedings of the 2012 Visual Communications and Image Processing, San Diego, CA, USA, 27–30 November 2012; pp. 1–6. [Google Scholar]
- Gupta, N.; Gupta, S.K.; Pathak, R.K.; Jain, V.; Rashidi, P.; Suri, J.S. Human activity recognition in artificial intelligence framework: A narrative review. Artif. Intell. Rev. 2022, 55, 4755–4808. [Google Scholar] [CrossRef] [PubMed]
- Ramanujam, E.; Perumal, T.; Padmavathi, S. Human activity recognition with smartphone and wearable sensors using deep learning techniques: A review. IEEE Sensors J. 2021, 21, 13029–13040. [Google Scholar] [CrossRef]
- Lockhart, J.W.; Pulickal, T.; Weiss, G.M. Applications of mobile activity recognition. In Proceedings of the 2012 ACM Conference on Ubiquitous Computing, Pittsburgh, PA, USA, 5–8 September 2012; pp. 1054–1058. [Google Scholar]
- Vaizman, Y.; Ellis, K.; Lanckriet, G. Recognizing detailed human context in the wild from smartphones and smartwatches. IEEE Pervasive Comput. 2017, 16, 62–74. [Google Scholar] [CrossRef]
- Liu, C.L.; Yin, F.; Wang, D.H.; Wang, Q.F. Online and offline handwritten Chinese character recognition: Benchmarking on new databases. Pattern Recognit. 2013, 46, 155–162. [Google Scholar] [CrossRef]
- Ren, H.; Wang, W.; Liu, C. Recognizing online handwritten Chinese characters using RNNs with new computing architectures. Pattern Recognit. 2019, 93, 179–192. [Google Scholar] [CrossRef]
- Zhang, X.Y.; Bengio, Y.; Liu, C.L. Online and offline handwritten Chinese character recognition: A comprehensive study and new benchmark. Pattern Recognit. 2017, 61, 348–360. [Google Scholar] [CrossRef]
- Li, Z.; Xiao, Y.; Wu, Q.; Jin, M.; Lu, H. Deep template matching for offline handwritten Chinese character recognition. J. Eng. 2020, 2020, 120–124. [Google Scholar] [CrossRef]
- Lian, W.; Zhang, L.; Yang, M. An Efficient Globally Optimal Algorithm for Asymmetric Point Matching. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 1281–1293. [Google Scholar] [CrossRef]
- Zhao, B.; Yang, M.; Pan, H.; Zhu, Q.; Tao, J. Nonrigid point matching of Chinese characters for robot writing. In Proceedings of the 2017 IEEE International Conference on Robotics and Biomimetics (ROBIO), Macau, China, 5–8 December 2017; pp. 762–767. [Google Scholar]
- Chen, Z.; Jiang, P.; Huang, R. Unsupervised Non-Rigid Point Cloud Matching through Large Vision Models. arXiv 2024, arXiv:2408.08568. [Google Scholar]
- Lee, N.; Min, J.; Lee, J.; Kim, S.; Lee, K.; Park, J.; Cho, M. 3D Geometric Shape Assembly via Efficient Point Cloud Matching. arXiv 2024, arXiv:2407.10542. [Google Scholar]
- Yang, W.; Jin, L.; Tao, D.; Xie, Z.; Feng, Z. DropSample: A new training method to enhance deep convolutional neural networks for large-scale unconstrained handwritten Chinese character recognition. Pattern Recognit. 2016, 58, 190–203. [Google Scholar] [CrossRef]
- Zhang, X.Y.; Yin, F.; Zhang, Y.M.; Liu, C.L.; Bengio, Y. Drawing and recognizing chinese characters with recurrent neural network. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 40, 849–862. [Google Scholar] [CrossRef] [PubMed]
- Yin, F.; Wang, Q.F.; Zhang, X.Y.; Liu, C.L. ICDAR 2013 Chinese handwriting recognition competition. In Proceedings of the 2013 12th International Conference on Document Analysis and Recognition, Washington, DC, USA, 25–28 August 2013; pp. 1464–1470. [Google Scholar]
- Kriuk, B.; Kriuk, F. Deep Learning-Driven Approach for Handwritten Chinese Character Classification. arXiv 2024, arXiv:2401.17098. [Google Scholar]
- Wang, X.; Zhang, L.; Huang, W.; Wang, S.; Wu, H.; He, J.; Song, A. Deep convolutional networks with tunable speed–accuracy tradeoff for human activity recognition using wearables. IEEE Trans. Instrum. Meas. 2021, 71, 2503912. [Google Scholar] [CrossRef]
- Jiang, W.; Yin, Z. Human activity recognition using wearable sensors by deep convolutional neural networks. In Proceedings of the 23rd ACM International Conference on Multimedia, Brisbane, Australia, 26–30 October 2015; pp. 1307–1310. [Google Scholar]
- Ronao, C.A.; Cho, S.B. Human activity recognition with smartphone sensors using deep learning neural networks. Expert Syst. Appl. 2016, 59, 235–244. [Google Scholar] [CrossRef]
- Hammerla, N.Y.; Halloran, S.; Plötz, T. Deep, convolutional, and recurrent models for human activity recognition using wearables. arXiv 2016, arXiv:1604.08880. [Google Scholar]
- Ordóñez, F.J.; Roggen, D. Deep convolutional and lstm recurrent neural networks for multimodal wearable activity recognition. Sensors 2016, 16, 115. [Google Scholar] [CrossRef]
- Ignatov, A. Real-time human activity recognition from accelerometer data using Convolutional Neural Networks. Appl. Soft Comput. 2018, 62, 915–922. [Google Scholar] [CrossRef]
- Hu, C.; Chen, Y.; Hu, L.; Peng, X. A novel random forests based class incremental learning method for activity recognition. Pattern Recognit. 2018, 78, 277–290. [Google Scholar] [CrossRef]
- Zeng, M.; Gao, H.; Yu, T.; Mengshoel, O.J.; Langseth, H.; Lane, I.; Liu, X. Understanding and improving recurrent networks for human activity recognition by continuous attention. In Proceedings of the 2018 ACM International Symposium on Wearable Computers, Singapore, 8–12 October 2018; pp. 56–63. [Google Scholar]
- Ma, H.; Li, W.; Zhang, X.; Gao, S.; Lu, S. AttnSense: Multi-level Attention Mechanism For Multimodal Human Activity Recognition. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI-19, Macao, China, 10–16 August 2019; pp. 3109–3115. [Google Scholar] [CrossRef]
- Teng, Q.; Wang, K.; Zhang, L.; He, J. The layer-wise training convolutional neural networks using local loss for sensor-based human activity recognition. IEEE Sensors J. 2020, 20, 7265–7274. [Google Scholar] [CrossRef]
- Tang, Y.; Teng, Q.; Zhang, L.; Min, F.; He, J. Layer-wise training convolutional neural networks with smaller filters for human activity recognition using wearable sensors. IEEE Sensors J. 2020, 21, 581–592. [Google Scholar] [CrossRef]
- Leng, Z.; Kwon, H.; Plötz, T. Generating virtual on-body accelerometer data from virtual textual descriptions for human activity recognition. In Proceedings of the 2023 ACM International Symposium on Wearable Computers, Cancun, Mexico, 8–12 October 2023; pp. 39–43. [Google Scholar]
- Saha, B.; Samanta, R.; Ghosh, S.K.; Roy, R.B. TinyTNAS: GPU-Free, Time-Bound, Hardware-Aware Neural Architecture Search for TinyML Time Series Classification. arXiv 2024, arXiv:2408.16535. [Google Scholar]
Methods | UCI-HAR | WISDM | PAMAP2 | UniMib | OPPOR |
---|---|---|---|---|---|
DCNN 2015 [31] | 95.1% | - | - | - | - |
tFFT+Convnet 2016 [32] | 95.7% | - | - | - | - |
b-LSTM-S 2016 [33] | - | - | - | - | 92.7% |
DeepConvLSTM 2016 [34] | - | - | - | - | 93.0% |
CondConv 2018 [35] | - | 98.7% | 94.0% | 77.31% | 81.1% |
CIRF 2018 [36] | - | - | - | - | 89.1% |
LSTM+CT+CSA 2018 [37] | - | - | 89.96% | - | - |
AttnSense 2019 [38] | - | - | 89.3% | - | - |
The Layer-wise CNN 2020 [39] | 96.9% | 98.8% | 92.9% | 78.0% | 81.0% |
Lego CNN 2021 [40] | 91.4% | 97.5% | 93.5% | 74.4% | 88.1% |
Real+Virtual 2023 [41] | - | - | 69.9% | - | - |
TinyTNAS 2024 [42] | 96.7% | 96.5% | 93.7% | 77.9% | 93.2% |
mDTW+Transformer | 88.7% | 98.5% | 94.2% | 78.5% | 93.5% |
Simple | Normal | Complex | |
---|---|---|---|
Methods | HCCR | UCI-HAR | WISDM | PAMAP2 | UniMib | OPPOR |
---|---|---|---|---|---|---|
cGRU | 95.6% | 84.1% | 94.8% | 92.9% | 75.3% | 88.1% |
LSTM | 95.1% | 86.3% | 94.5% | 93.7% | 75.8% | 89.6% |
Transformer | 96.6% | 86.2% | 95.5% | 93.2% | 76.4% | 90.4% |
mDTW+cGRU | 97.4% | 87.6% | 97.1% | 94.1% | 77.1% | 92.1% |
mDTW+LSTM | 97.2% | 88.1% | 97.9% | 94.1% | 77.7% | 92.7% |
mDTW+Transformer | 98.5% | 88.7% | 98.5% | 94.2% | 78.5% | 93.5% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Sun, B.; Chen, C. Sequence-Information Recognition Method Based on Integrated mDTW. Appl. Sci. 2024, 14, 8716. https://doi.org/10.3390/app14198716
Sun B, Chen C. Sequence-Information Recognition Method Based on Integrated mDTW. Applied Sciences. 2024; 14(19):8716. https://doi.org/10.3390/app14198716
Chicago/Turabian StyleSun, Boliang, and Chao Chen. 2024. "Sequence-Information Recognition Method Based on Integrated mDTW" Applied Sciences 14, no. 19: 8716. https://doi.org/10.3390/app14198716
APA StyleSun, B., & Chen, C. (2024). Sequence-Information Recognition Method Based on Integrated mDTW. Applied Sciences, 14(19), 8716. https://doi.org/10.3390/app14198716