[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1143844.1143974acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlConference Proceedingsconference-collections
Article

Fast time series classification using numerosity reduction

Published: 25 June 2006 Publication History

Abstract

Many algorithms have been proposed for the problem of time series classification. However, it is clear that one-nearest-neighbor with Dynamic Time Warping (DTW) distance is exceptionally difficult to beat. This approach has one weakness, however; it is computationally too demanding for many realtime applications. One way to mitigate this problem is to speed up the DTW calculations. Nonetheless, there is a limit to how much this can help. In this work, we propose an additional technique, numerosity reduction, to speed up one-nearest-neighbor DTW. While the idea of numerosity reduction for nearest-neighbor classifiers has a long history, we show here that we can leverage off an original observation about the relationship between dataset size and DTW constraints to produce an extremely compact dataset with little or no loss in accuracy. We test our ideas with a comprehensive set of experiments, and show that it can efficiently produce extremely fast accurate classifiers.

References

[1]
Chen, L. & Kamel, M. S. (2005). Design of Multiple Classifier Systems for Time Series Data. Multiple Classifier Systems, pp. 216--225.]]
[2]
Chen, L., Özsu, M. T., & Oria, V. (2005). Using Multi-Scale Histograms to Answer Pattern Existence and Shape Match Queries. SSDBM '05.]]
[3]
Dasarathy, B. V. (1991). Nearest Neighbor (NN) Norms: NN Pattern Classification Techniques. IEEE Computer Society Press, pp. 388--397.]]
[4]
Eads, D., Glocer, K., Perkins, S., & Theiler, J. (2005). Grammar-guided feature extraction for time series classification. NIPS '05.]]
[5]
Fu, A. W., Keogh, E., Lau, L. Y. H., & Ratanamahatana, C. A. (2005). Scaling and Time Warping in Time Series Querying. VLDB '05, pp. 649--660.]]
[6]
Geurts, P. (2002). Contributions to decision tree induction: bias/variance tradeoff and time series classification. Ph.D. thesis, University of Liege.]]
[7]
Grass, J. & Zilberstein, S. (1996). Anytime Algorithm Development Tools. Sigart Artificial Intellligence, Vol 7, ACM Press.]]
[8]
Han, J. & Kamber, M. (2000). Data Mining Concepts and Techniques. Morgan Kaufmann Publishers.]]
[9]
Hayashi, A., Mizuhara, Y., & Suematsu, N. (2005). Embedding Time Series Data for Classification. Machine Learning and Data Mining in Pattern Recognition, pp. 356--365.]]
[10]
Karydis, I., Nanopoulos, A., Papadopoulos, A., & Manolopoulos, Y. (2005). Music Retrieval in P2P Networks Under the Warping Distance, 7th International Conference on Enterprise Information Systems.]]
[11]
Keogh, E. (2002). Exact Indexing of Dynamic Time Warping. VLDB '02, pp. 406--417, Hong Kong, Aug 20--23.]]
[12]
Keogh, E. (2006). UCR Time Series Archive www.cs.ucr.edu/~eamonn/TSDMA/]]
[13]
Kim, S., Smyth, P., & Luther, S. (2004). Modeling waveform shapes with random effects segmental hidden Markov Models. Technical Report, UCI-ICS 04--05.]]
[14]
Lei, H. & Govindaraju, V. (2004). Regression Time Warping for Similarity Measure of Sequence. CIT'04, pp. 826--830.]]
[15]
Megalooikonomou, V., Wang, Q., Li, G., & Faloutsos, C. A. (2005). Multiresolution Symbolic Representation of Time Series. ICDE '05, pp. 668--679.]]
[16]
Megalooikonomou, V. (2006). Personal Communication.]]
[17]
Nanopoulos, A., Alcock, R., & Manolopoulos, Y. (2001). Feature-based Classification of Time-series Data. International Journal of Computer Research, pp. 49--61.]]
[18]
Pekalska, E., Duin, R. P. W., & Paclik, P. (2006). Prototype Selection for Dissimilarity-Based Classifiers. Pattern Recognition, 39:2, pp. 189--208.]]
[19]
Ratanamahatana, C. A. & Keogh, E. (2005). Three myths about Dynamic Time Warping Data Mining. SDM '05.]]
[20]
Rodríguez, J. J. & Alonso, C. J. (2004). Interval and dynamic time warping-based decision trees. In Proceedings of the 2004 ACM symposium on Applied computing (SAC), pp. 548--552.]]
[21]
Rodríguez, J. J., Alonso, C. J., & Boströöm, H. (2000). Learning First Order Logic Time Series Classifiers: Rules and Boosting. PKDD '00, pp. 299--308.]]
[22]
Sakoe, H. & Chiba, S. Dynamic programming algorithm optimization for spoken word recognition. (1978). IEEE Trans. Acoustics, Speech, and Signal Proc., Vol. ASSP-26.]]
[23]
Shou, Y., Mamoulis, N., & Cheung, D. W. (2005). Fast and exact warping of time series using adaptive segmental approximations. Machine Learning, Vol 28, pp. 231--267.]]
[24]
Wei, L., Keogh, E., Van Herle, H., & Mafra-Neto, A. (2005). Atomic Wedgie: Efficient Query Filtering for Streaming Time Series. ICDM '05, pp. 490--497.]]
[25]
Wilson, D. R. & Martinez, T. R. (1997). Instance Pruning Techniques. ICML '97, Morgan Kaufmann, pp. 403--411.]]
[26]
Wu, Y. & Chang, E. Y. (2004). Distance-function design and fusion for sequence data. CIKM '04, pp. 324--333.]]
[27]
Zhu, Y. & Shasha, D. (2003). Query by Humming: a Time Series Database Approach, SIGMOD '03.]]

Cited By

View all
  • (2024)Research on Outlier Detection Methods for Dam Monitoring Data Based on Post-Data ClassificationBuildings10.3390/buildings1409275814:9(2758)Online publication date: 3-Sep-2024
  • (2024)End-to-End Signal Classification in Signed Cumulative Distribution Transform SpaceIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2024.337245546:9(5936-5950)Online publication date: Sep-2024
  • (2024)Activity Recognition Based on Accelerometer Data with Enhanced ROCKET Algorithm2024 IEEE 18th International Symposium on Applied Computational Intelligence and Informatics (SACI)10.1109/SACI60582.2024.10619836(000321-000326)Online publication date: 23-May-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICML '06: Proceedings of the 23rd international conference on Machine learning
June 2006
1154 pages
ISBN:1595933832
DOI:10.1145/1143844
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 June 2006

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Acceptance Rates

ICML '06 Paper Acceptance Rate 140 of 548 submissions, 26%;
Overall Acceptance Rate 140 of 548 submissions, 26%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)64
  • Downloads (Last 6 weeks)5
Reflects downloads up to 07 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Research on Outlier Detection Methods for Dam Monitoring Data Based on Post-Data ClassificationBuildings10.3390/buildings1409275814:9(2758)Online publication date: 3-Sep-2024
  • (2024)End-to-End Signal Classification in Signed Cumulative Distribution Transform SpaceIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2024.337245546:9(5936-5950)Online publication date: Sep-2024
  • (2024)Activity Recognition Based on Accelerometer Data with Enhanced ROCKET Algorithm2024 IEEE 18th International Symposium on Applied Computational Intelligence and Informatics (SACI)10.1109/SACI60582.2024.10619836(000321-000326)Online publication date: 23-May-2024
  • (2024)Ramer-Douglas-Peucker Dynamic Time Warping (RDP-DTW): A Novel Data Reduction Based Dynamic Time Warping method for Time Series Classification2024 International Conference on Advances in Electrical and Communication Technologies (ICAECOT)10.1109/ICAECOT62402.2024.10828578(1-5)Online publication date: 1-Oct-2024
  • (2024)Keeping Deep Learning Models in Check: A History-Based Approach to Mitigate OverfittingIEEE Access10.1109/ACCESS.2024.340254312(70676-70689)Online publication date: 2024
  • (2024)Multi-channel anomaly detection using graphical modelsJournal of Intelligent Manufacturing10.1007/s10845-024-02447-7Online publication date: 13-Jul-2024
  • (2024)A unified framework for financial commentary predictionInformation Technology and Management10.1007/s10799-024-00439-wOnline publication date: 5-Sep-2024
  • (2024)DDTM: A Distance-Based Data Transformation Method for Time Series ClassificationArtificial Intelligence and Robotics10.1007/978-981-99-9109-9_10(94-111)Online publication date: 4-Jan-2024
  • (2024)ROCKET with Dynamic Convolution for Time Series ClassificationAdvances in Computational Collective Intelligence10.1007/978-3-031-70248-8_21(271-282)Online publication date: 8-Sep-2024
  • (2024)Co-producing Gesture-Based AT - A Case StudyDesign for Equality and Justice10.1007/978-3-031-61698-3_28(277-282)Online publication date: 9-Jul-2024
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media