[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2505515.2505528acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Feedback-driven multiclass active learning for data streams

Published: 27 October 2013 Publication History

Abstract

Active learning is a promising way to efficiently build up training sets with minimal supervision. Most existing methods consider the learning problem in a pool-based setting. However, in a lot of real-world learning tasks, such as crowdsourcing, the unlabeled samples, arrive sequentially in the form of continuous rapid streams. Thus, preparing a pool of unlabeled data for active learning is impractical. Moreover, performing exhaustive search in a data pool is expensive, and therefore unsuitable for supporting on-the-fly interactive learning in large scale data. In this paper, we present a systematic framework for stream-based multi-class active learning. Following the reinforcement learning framework, we propose a feedback-driven active learning approach by adaptively combining different criteria in a time-varying manner. Our method is able to balance exploration and exploitation during the learning process. Extensive evaluation on various benchmark and real-world datasets demonstrates the superiority of our framework over existing methods.

References

[1]
V. Ambati, S. Vogel, and J. G. Carbonell. Active learning and crowd-sourcing for machine translation. In The International Conference on Language Resources and Evaluation, 2010.
[2]
S. Argamon-Engelson and I. Dagan. Committee-based sample selection for probabilistic classifiers. Journal of Artificial Intelligence Research (JAIR), 11:335--360, 1999.
[3]
Y. Baram, R. El-Yaniv, and K. Luz. Online choice of active learning algorithms. Journal of Artificial Intelligence Research (JAIR), 5:255--291, Dec. 2004.
[4]
S. Chen, T. Zhang, C. Zhang, and Y. Cheng. A real-time face detection and recognition system for a mobile robot in a complex background. Artif. Life Robot., 15(4):439--443, Dec. 2010.
[5]
Y. Cheng, Y. Xie, K. Zhang, A. Agrawal, and A. Choudhary. Cluchunk: clustering large scale user-generated content incorporating chunklet information. In Proceedings of the 1st International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications, BigMine '12, pages 12--19, 2012.
[6]
Y. Cheng, K. Zhang, Y. Xie, A. Agrawal, and A. Choudhary. On active learning in hierarchical classification. In Proceedings of the 21st ACM international conference on Information and knowledge management, CIKM '12, pages 2467--2470, New York, NY, USA, 2012. ACM.
[7]
Y. Cheng, K. Zhang, Y. Xie, A. Agrawal, W.-k. Liao, and A. Choudhary. Learning to group web text incorporating prior information. In Proceedings of the 2011 IEEE 11th International Conference on Data Mining Workshops, ICDMW '11, pages 212--219. IEEE Computer Society, 2011.
[8]
Y. Cheng, T. Zhang, and S. Chen. Fast person-specific image retrieval using a simple and efficient clustering method. In Proceedings of the 2009 international conference on Robotics and biomimetics, ROBIO'09, pages 1973--1977, Piscataway, NJ, USA, 2009. IEEE Press.
[9]
W. Chu, M. Zinkevich, L. Li, A. Thomas, and B. Tseng. Unbiased online active learning in data streams. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2011, pages 195--203, New York, NY, USA, 2011. ACM.
[10]
A. M. Dan Pelleg. Active learning for anomaly and rare-category detection. In Advances in Neural Information Processing Systems 18, December 2004.
[11]
S. Dasgupta, A. T. Kalai, and C. Monteleoni. Analysis of perceptron-based active learning. J. Mach. Learn. Res., 10:281--299, June 2009.
[12]
P. Donmez, J. G. Carbonell, and P. N. Bennett. Dual strategy active learning. In Proceedings of the 18th European conference on Machine Learning, ECML '07, pages 116--127, Berlin, Heidelberg, 2007. Springer-Verlag.
[13]
S. Ebert, M. Fritz, and B. Schiele. Ralf: A reinforced active learning formulation for object class recognition. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Providence, Rhode Island, 06/2012 2012.
[14]
Y. Freund, H. S. Seung, E. Shamir, and N. Tishby. Selective sampling using the query by committee algorithm. Mach. Learn., 28(2--3):133--168, Sept. 1997.
[15]
T. M. Hospedales, S. Gong, and T. Xiang. Finding rare classes: adapting generative and discriminative models in active learning. In Proceedings of the 15th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, PAKDD 2011, pages 296--308, Berlin, Heidelberg, 2011. Springer-Verlag.
[16]
S.-J. Huang, R. Jin, and Z.-H. Zhou. Active learning by querying informative and representative examples. In Proc. of Annual Conference on Neural Information Processing Systems, pages 892--900, 2010.
[17]
J. K. Jack W. Stokes, John C. Platt and M. Shilman. Aladin: Active learning of anomalies to detect intrusions bibtex. Technical report, 2008.
[18]
H. T. Nguyen and A. Smeulders. Active learning using pre-clustering. In Proceedings of the Twenty-first International Conference on Machine Learning, ICML '04, pages 79--, New York, NY, USA, 2004. ACM.
[19]
T. Osugi, D. Kun, and S. Scott. Balancing exploration and exploitation: A new algorithm for active machine learning. In Proceedings of the Fifth IEEE International Conference on Data Mining, ICDM 2005, pages 330--337, Washington, DC, USA, 2005. IEEE Computer Society.
[20]
T. Scheffer, C. Decomain, and S. Wrobel. Active hidden markov models for information extraction. In Proceedings of the 4th International Conference on Advances in Intelligent Data Analysis, IDA '01, pages 309--318, London, UK, UK, 2001. Springer-Verlag.
[21]
B. Settles. Active learning literature survey. Technical report, 2010.
[22]
H. S. Seung, M. Opper, and H. Sompolinsky. Query by committee. In Proceedings of The Fifth Annual Workshop on Computational Learning Theory, COLT 1992, pages 287--294, New York, NY, USA, 1992. ACM.
[23]
S. Vijayanarasimhan and K. Grauman. Large-scale live active learning: Training object detectors with crawled data and crowds. In Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition, pages 1449--1456, Washington, DC, USA, 2011. IEEE Computer Society.
[24]
I.vZliobaité, A. Bifet, B. Pfahringer, and G. Holmes. Active learning with evolving streaming data. In Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part III, ECML PKDD'11, pages 597--612, Berlin, Heidelberg, 2011. Springer-Verlag.
[25]
P. Welinder and P. Perona. Online crowdsourcing: rating annotators and obtaining cost-effective labels. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2010.
[26]
K. Zhang, Y. Cheng, W.-k. Liao, and A. Choudhary. Mining millions of reviews: a technique to rank products based on importance of reviews. In Proceedings of the 13th International Conference on Electronic Commerce, ICEC '11, pages 12:1--12:8, New York, NY, USA, 2012. ACM.
[27]
X. Zhu, P. Zhang, X. Lin, and Y. Shi. Active learning from data streams. In Proceedings of the 2007 Seventh IEEE International Conference on Data Mining, ICDM 2007, pages 757--762, Washington, DC, USA, 2007. IEEE Computer Society.

Cited By

View all
  • (2024)Data-Centric Green Artificial Intelligence: A SurveyIEEE Transactions on Artificial Intelligence10.1109/TAI.2023.33152725:5(1973-1989)Online publication date: May-2024
  • (2023)An adaptive active learning algorithm with informativeness and representativenessIntelligent Data Analysis10.3233/IDA-21641827:1(199-222)Online publication date: 30-Jan-2023
  • (2022)PMAL: A Proxy Model Active Learning Approach for Vision Based Industrial ApplicationsACM Transactions on Multimedia Computing, Communications, and Applications10.1145/353493218:2s(1-18)Online publication date: 6-Oct-2022
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '13: Proceedings of the 22nd ACM international conference on Information & Knowledge Management
October 2013
2612 pages
ISBN:9781450322638
DOI:10.1145/2505515
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 October 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. active learning
  2. adaptive criteria
  3. reinforcement learning
  4. stream data mining

Qualifiers

  • Research-article

Conference

CIKM'13
Sponsor:
CIKM'13: 22nd ACM International Conference on Information and Knowledge Management
October 27 - November 1, 2013
California, San Francisco, USA

Acceptance Rates

CIKM '13 Paper Acceptance Rate 143 of 848 submissions, 17%;
Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)18
  • Downloads (Last 6 weeks)1
Reflects downloads up to 12 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Data-Centric Green Artificial Intelligence: A SurveyIEEE Transactions on Artificial Intelligence10.1109/TAI.2023.33152725:5(1973-1989)Online publication date: May-2024
  • (2023)An adaptive active learning algorithm with informativeness and representativenessIntelligent Data Analysis10.3233/IDA-21641827:1(199-222)Online publication date: 30-Jan-2023
  • (2022)PMAL: A Proxy Model Active Learning Approach for Vision Based Industrial ApplicationsACM Transactions on Multimedia Computing, Communications, and Applications10.1145/353493218:2s(1-18)Online publication date: 6-Oct-2022
  • (2022)Active Learning for Network Traffic Classification: A Technical StudyIEEE Transactions on Cognitive Communications and Networking10.1109/TCCN.2021.31190628:1(422-439)Online publication date: Mar-2022
  • (2022)Uncertainty Assessment-Based Active Learning for Reliable Fire Detection SystemsIEEE Access10.1109/ACCESS.2022.319085210(74722-74732)Online publication date: 2022
  • (2021)Active learning for imbalanced data under cold startProceedings of the Second ACM International Conference on AI in Finance10.1145/3490354.3494423(1-9)Online publication date: 3-Nov-2021
  • (2021)Crowdlearning: A framework for collaborative and personalized learning2016 IEEE Frontiers in Education Conference (FIE)10.1109/FIE.2016.7757355(1-9)Online publication date: 11-Mar-2021
  • (2020)Automatic traceability link recovery via active learningFrontiers of Information Technology & Electronic Engineering10.1631/FITEE.190022221:8(1217-1225)Online publication date: 4-Jul-2020
  • (2020)Machine Learning in the Wild: The Case of User-Centered Learning in Cyber Physical Systems2020 International Conference on COMmunication Systems & NETworkS (COMSNETS)10.1109/COMSNETS48256.2020.9027329(275-281)Online publication date: Jan-2020
  • (2020)Activity recognition through interactive machine learning in a dynamic sensor settingPersonal and Ubiquitous Computing10.1007/s00779-020-01414-228:1(273-286)Online publication date: 9-Jun-2020
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media