[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/3121646.3121653guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Integrating novel class detection with classification for concept-drifting data streams

Published: 07 September 2009 Publication History

Abstract

In a typical data stream classification task, it is assumed that the total number of classes are fixed. This assumption may not be valid in a real streaming environment, where new classes may evolve. Traditional data stream classification techniques are not capable of recognizing novel class instances until the appearance of the novel class is manually identified, and labeled instances of that class are presented to the learning algorithm for training. The problem becomes more challenging in the presence of concept-drift, when the underlying data distribution changes over time. We propose a novel and efficient technique that can automatically detect the emergence of a novel class in the presence of concept-drift by quantifying cohesion among unlabeled test instances, and separation of the test instances from training instances. Our approach is non-parametric, meaning, it does not assume any underlying distributions of data. Comparison with the state-of-the-art stream classification techniques prove the superiority of our approach.

References

[1]
Chen, S., Wang, H., Zhou, S., Yu, P.: Stop chasing trends: Discovering high order models in evolving data. In: Proc. ICDE, pp. 923-932 (2008)
[2]
Hulten, G., Spencer, L., Domingos, P.: Mining time-changing data streams. In: Proc. ACM SIGKDD, pp. 97-106 (2001)
[3]
Kolter, J., Maloof, M.: Using additive expert ensembles to cope with concept drift. In: Proc. ICML, pp. 449-456 (2005)
[4]
Markou, M., Singh, S.: Novelty detection: A review-part 1: Statistical approaches, part 2: Neural network based approaches. Signal Processing 83 (2003)
[5]
Masud, M.M., Gao, J., Khan, L., Han, J., Thuraisingham, B.: A practical approach to classify evolving data streams: Training with limited amount of labeled data. In: Proc. ICDM, pp. 929-934 (2008)
[6]
Roberts, S.J.: Extreme value statistics for novelty detection in biomedical signal processing. In: Proc. Int. Conf. on Advances in Medical Signal and Information Processing, pp. 166-172 (2000)
[7]
Spinosa, E.J., de Leon, A.P., de Carvalho, F., Gama, J.: Olindda: a cluster-based approach for detecting novelty and concept drift in data streams. In: Proc. 2007 ACM symposium on Applied computing, pp. 448-452 (2007)
[8]
University of Texas at Dallas Technical report UTDCS-13-09 (June 2009), http://www.utdallas.edu/~mmm058000/reports/UTDCS-13-09.pdf
[9]
Wang, H., Fan, W., Yu, P., Han, J.: Mining concept-drifting data streams using ensemble classifiers. In: Proc. ACM SIGKDD, pp. 226-235 (2003)
[10]
yan Yeung, D., Chow, C.: Parzen-window network intrusion detectors. In: Proc. International Conference on Pattern Recognition, pp. 385-388 (2002)
[11]
Yang, Y., Wu, X., Zhu, X.: Combining proactive and reactive predictions for data streams. In: Proc. ACM SIGKDD, pp. 710-715 (2005)
[12]
Yang, Y., Zhang, J., Carbonell, J., Jin, C.: Topic-conditioned novelty detection. In: Proc. ACM SIGKDD, pp. 688-693 (2002)

Cited By

View all
  • (2011)A study of decision tree induction for data stream mining using boosting genetic programming classifierProceedings of the Second international conference on Swarm, Evolutionary, and Memetic Computing - Volume Part I10.1007/978-3-642-27172-4_39(315-322)Online publication date: 19-Dec-2011
  • (2010)Improving gaussian process classification with outlier detectionProceedings of the 10th Asian conference on Computer vision - Volume Part IV10.5555/1966111.1966125(153-164)Online publication date: 8-Nov-2010
  • (2010)Classification and novel class detection of data streams in a dynamic feature spaceProceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part II10.5555/1888305.1888328(337-352)Online publication date: 20-Sep-2010
  • Show More Cited By
  1. Integrating novel class detection with classification for concept-drifting data streams

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Guide Proceedings
    ECMLPKDD'09: Proceedings of the 2009th European Conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
    September 2009
    758 pages
    ISBN:3642041736
    • Editors:
    • Wray Buntine,
    • Marko Grobelnik,
    • Dunja Mladenić,
    • John Shawe-Taylor

    Sponsors

    • Yahoo! Research
    • PASCAL2 - Pattern Analysis, Statistical Modelling and Computational Learning
    • Google Inc.
    • Microsoft Research: Microsoft Research
    • Nokia Connecting People: Nokia Connecting People

    Publisher

    Springer-Verlag

    Berlin, Heidelberg

    Publication History

    Published: 07 September 2009

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 17 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2011)A study of decision tree induction for data stream mining using boosting genetic programming classifierProceedings of the Second international conference on Swarm, Evolutionary, and Memetic Computing - Volume Part I10.1007/978-3-642-27172-4_39(315-322)Online publication date: 19-Dec-2011
    • (2010)Improving gaussian process classification with outlier detectionProceedings of the 10th Asian conference on Computer vision - Volume Part IV10.5555/1966111.1966125(153-164)Online publication date: 8-Nov-2010
    • (2010)Classification and novel class detection of data streams in a dynamic feature spaceProceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part II10.5555/1888305.1888328(337-352)Online publication date: 20-Sep-2010
    • (2010)Classification and novel class detection of data streams in a dynamic feature spaceProceedings of the 2010th European Conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II10.1007/978-3-642-15883-4_22(337-352)Online publication date: 20-Sep-2010
    • (2010)Classification and novel class detection in data streams with active miningProceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part II10.1007/978-3-642-13672-6_31(311-324)Online publication date: 21-Jun-2010

    View Options

    View options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media