[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3580305.3599409acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article
Free access

Learning from Positive and Unlabeled Multi-Instance Bags in Anomaly Detection

Published: 04 August 2023 Publication History

Abstract

In the multi-instance learning (MIL) setting instances are grouped together into bags. Labels are provided only for the bags and not on the level of individual instances. A positive bag label means that at least one instance inside the bag is positive, while a negative bag label restricts all the instances in the bag to be negative. MIL data naturally arises in many contexts, such as anomaly detection, where labels are rare and costly, and one often ends up annotating the label for sets of instances. Moreover, in many real-world anomaly detection problems, only positive labels are collected because they usually represent critical events. Such a setting, where only positive labels are provided along with unlabeled data, is called Positive and Unlabeled (PU) learning. Despite being useful for several use cases, there is no work dedicated to learning from positive and unlabeled data in a multi-instance setting for anomaly detection. Therefore, we propose the first method that learns from PU bags in anomaly detection. Our method uses an autoencoder as an underlying anomaly detector. We alter the autoencoder's objective function and propose a new loss that allows it to learn from positive and unlabeled bags of instances. We theoretically analyze this method. Experimentally, we evaluate our method on 30 datasets and show that it performs better than multiple baselines adapted to work in our setting.

Supplementary Material

MP4 File (rtfp1275-2min-promo.mp4)
Nowadays, sustainable energy is becoming more and more important. Wind turbines can produce more than 6 million kWh per year, enough to supply 1500 average EU households with electricity. Unfortunately, some adverse events -like blade icing- slow down the production of energy. Automatically detecting these adverse events on time is an important task called anomaly detection. However, because anomalous events are rare, experts cannot afford to annotate all the collected data. In practice, experts sometimes annotate that an anomaly occurred during the day without specifying the exact time ... but the system must detect the exact time of anomalous events. This falls into the area of Positive and Unlabeled Multi-Instance Learning. This setting has three key challenges: how to 1) link day labels to instance labels, 2) overcome the absence of normal labels, and 3) deal with anomalies that may not follow specific patterns. How do we solve these challenges? Check out our paper!

References

[1]
Pierre Baldi. 2012. Autoencoders, unsupervised learning, and deep architectures. In Proceedings of ICML workshop on unsupervised and transfer learning. JMLR Workshop and Conference Proceedings, PMLR, Bellevue, Washington, USA, 37--49.
[2]
Han Bao, Tomoya Sakai, Issei Sato, and Masashi Sugiyama. 2018. Convex formulation of multiple instance learning from positive and unlabeled bags. Neural Networks, Vol. 105 (2018), 132--141.
[3]
Teresa Basile, Nicola Di Mauro, Floriana Esposito, Stefano Ferilli, and Antonio Vergari. 2017. Density estimators for positive-unlabeled learning. In International Workshop on New Frontiers in Mining Complex Patterns. Springer, Springer International Publishing, Cham, 49--64.
[4]
Jessa Bekker and Jesse Davis. 2020. Learning from positive and unlabeled data: A survey. Machine Learning, Vol. 109 (2020), 719--760.
[5]
Forrest Briggs, Xiaoli Z Fern, and Raviv Raich. 2012. Rank-loss support instance machines for MIML instance annotation. In Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining. Association for Computing Machinery, New York, NY, USA, 534--542. https://doi.org/10.1145/2339530.2339616
[6]
Guilherme O Campos, Arthur Zimek, Jörg Sander, Ricardo JGB Campello, Barbora Micenková, Erich Schubert, Ira Assent, and Michael E Houle. 2016. On the evaluation of unsupervised outlier detection: measures, datasets, and an empirical study. Data mining and knowledge discovery, Vol. 30 (2016), 891--927.
[7]
Varun Chandola, Arindam Banerjee, and Vipin Kumar. 2009. Anomaly detection: A survey. ACM computing surveys (CSUR), Vol. 41, 3 (2009), 1--58.
[8]
Sneha Chaudhari and Shirish Shevade. 2012. Learning from positive and unlabelled examples using maximum margin clustering. In Neural Information Processing: 19th International Conference, ICONIP 2012, Doha, Qatar, November 12-15, 2012, Proceedings, Part III 19. Springer, Springer Berlin Heidelberg, Berlin, Heidelberg, 465--473.
[9]
Francesco Del Buono, Francesca Calabrese, Andrea Baraldi, Matteo Paganelli, and Francesco Guerra. 2022. Novelty detection with autoencoders for system health monitoring in industrial environments. Applied Sciences, Vol. 12, 10 (2022), 4931.
[10]
Thomas G Dietterich, Richard H Lathrop, and Tomás Lozano-Pérez. 1997. Solving the multiple instance problem with axis-parallel rectangles. Artificial intelligence, Vol. 89, 1--2 (1997), 31--71.
[11]
Marthinus Christoffel Du Plessis, Gang Niu, and Masashi Sugiyama. 2013. Clustering unclustered data: Unsupervised binary labeling of two datasets having different class balances. In 2013 Conference on Technologies and Applications of Artificial Intelligence. IEEE, 1--6.
[12]
Charles Elkan and Keith Noto. 2008. Learning classifiers from only positive and unlabeled data. In Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining. Association for Computing Machinery, New York, NY, USA, 213--220.
[13]
Abdelali Elmoufidi, Khalid El Fahssi, Said Jai-andaloussi, Abderrahim Sekkaki, Quellec Gwenole, and Mathieu Lamard. 2018. Anomaly classification in digital mammography based on multiple-instance learning. IET Image Processing, Vol. 12, 3 (2018), 320--328.
[14]
Songhe Feng and De Xu. 2010. Transductive multi-instance multi-label learning algorithm with application to automatic image annotation. Expert Systems with Applications, Vol. 37, 1 (2010), 661--670.
[15]
Jun Han and Claudio Moraga. 1995. The influence of the sigmoid function parameters on the speed of backpropagation learning. In International workshop on artificial neural networks. Springer, Springer Berlin Heidelberg, Berlin, Heidelberg, 195--201.
[16]
Songqiao Han, Xiyang Hu, Hailiang Huang, Minqi Jiang, and Yue Zhao. 2022. Adbench: Anomaly detection benchmark. Advances in Neural Information Processing Systems, Vol. 35 (2022), 32142--32159.
[17]
Jing Huo, Yang Gao, Wanqi Yang, and Hujun Yin. 2012. Abnormal event detection via multi-instance dictionary learning. In International conference on intelligent data engineering and automated learning. Springer, Springer Berlin Heidelberg, Berlin, Heidelberg, 76--83.
[18]
Jing Huo, Yang Gao, Wanqi Yang, and Hujun Yin. 2014. Multi-instance dictionary learning for detecting abnormal events in surveillance videos. International journal of neural systems, Vol. 24, 03 (2014), 1430010.
[19]
Tomoharu Iwata, Machiko Toyoda, Shotaro Tora, and Naonori Ueda. 2020. Anomaly detection with inexact labels. Machine Learning, Vol. 109 (2020), 1617--1633.
[20]
Kristen Jaskie, Charles Elkan, and Andreas Spanias. 2019. A Modified Logistic Regression for Positive and Unlabeled Learning. In 2019 53rd Asilomar Conference on Signals, Systems, and Computers. IEEE, 2007--2011. https://doi.org/10.1109/IEEECONF44664.2019.9048765
[21]
Weixin Li and Nuno Vasconcelos. 2015. Multiple instance learning for soft bags via top instances. In Proceedings of the IEEE conference on computer vision and pattern recognition.
[22]
Fei Tony Liu, Kai Ming Ting, and Zhi-Hua Zhou. 2008. Isolation forest. In 2008 Eighth IEEE International Conference on Data Mining. IEEE, 413--422. https://doi.org/10.1109/ICDM.2008.17
[23]
Guoqing Liu, Jianxin Wu, and Zhi-Hua Zhou. 2012. Key instance detection in multi-instance learning. In Asian Conference on Machine Learning. PMLR.
[24]
Oded Maron and Tomás Lozano-Pérez. 1997. A framework for multiple-instance learning. Advances in neural information processing systems (1997).
[25]
Lorenzo Perini, Paul Buerkner, and Arto Klami. 2022a. Estimating the Contamination Factor's Distribution in Unsupervised Anomaly Detection. arXiv preprint arXiv:2210.10487 (2022).
[26]
Lorenzo Perini, Vincent Vercruyssen, and Jesse Davis. 2020. Class prior estimation in active positive and unlabeled learning. In Proceedings of the 29th International Joint Conference on Artificial Intelligence and the 17th Pacific Rim International Conference on Artificial Intelligence. IJCAI-PRICAI.
[27]
Lorenzo Perini, Vincent Vercruyssen, and Jesse Davis. 2021. Quantifying the confidence of anomaly detectors in their example-wise predictions. In Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2020, Ghent, Belgium, September 14-18, 2020, Proceedings, Part III. Springer, 227--243.
[28]
Lorenzo Perini, Vincent Vercruyssen, and Jesse Davis. 2022b. Transferring the contamination factor between anomaly detection domains by shape similarity. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 36. 4128--4136.
[29]
Sang Phan, Duy-Dinh Le, and Shin'ichi Satoh. 2015. Multimedia event detection using event-driven multiple instance learning. In Proceedings of the 23rd ACM international conference on Multimedia.
[30]
John Platt et al. 1999. Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. Advances in large margin classifiers (1999).
[31]
John C Platt. 2000. 5 Probabilities for SV Machines. Advances in Large Margin Classifiers (2000).
[32]
Kenneth V Price. 2013. Differential evolution. In Handbook of optimization. Springer.
[33]
Gwenolé Quellec, Guy Cazuguel, Béatrice Cochener, and Mathieu Lamard. 2017. Multiple-instance learning for medical image and video analysis. IEEE reviews in biomedical engineering (2017).
[34]
Gwenolé Quellec, Mathieu Lamard, Michel Cozic, Gouenou Coatrieux, and Guy Cazuguel. 2016. Multiple-instance learning for anomaly detection in digital mammography. Ieee transactions on medical imaging (2016).
[35]
Leonard Sabetti and Ronald Heijmans. 2021. Shallow or deep? Training an autoencoder to detect anomalous flows in a retail payment system. Latin American Journal of Central Banking (2021).
[36]
Beomjo Shin, Junsu Cho, Hwanjo Yu, and Seungjin Choi. 2021. Sparse Network Inversion for Key Instance Detection in Multiple Instance Learning. In 2020 25th International Conference on Pattern Recognition (ICPR).
[37]
Jonas Soenen, Elia Van Wolputte, Lorenzo Perini, Vincent Vercruyssen, Wannes Meert, Jesse Davis, and Hendrik Blockeel. 2021. The Effect of Hyperparameter Tuning on the Comparative Evaluation of Unsupervised Anomaly Detection Methods. In Proceedings of the KDD'21 Workshop on Outlier Detection and Description. Outlier Detection and Description Organising Committee.
[38]
Masashi Sugiyama, Takafumi Kanamori, Taiji Suzuki, Marthinus Christoffel Du Plessis, Song Liu, and Ichiro Takeuchi. 2013. Density-difference estimation. Neural Computation, Vol. 25, 10 (2013), 2734--2775.
[39]
Vincent Vercruyssen, Lorenzo Perini, Wannes Meert, and Jesse Davis. 2023. Multi-domain Active Learning for Semi-supervised Anomaly Detection. In Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2022, Grenoble, France, September 19-23, 2022, Proceedings, Part IV. Springer, 485--501.
[40]
Vincent Vercruyssen, Meert Wannes, Verbruggen Gust, Maes Koen, Bäumer Ruben, and Davis Jesse. 2018. Semi-supervised anomaly detection with an application to water analytics. In Proceedings of 18th IEEE International Conference on Data Mining. IEEE.
[41]
Junxiang Wang, Liang Zhao, and Yanfang Ye. 2018. Semi-supervised multi-instance interpretable models for flu shot adverse event detection. In 2018 IEEE International Conference on Big Data (Big Data). IEEE.
[42]
Jia Wu, Xingquan Zhu, Chengqi Zhang, and Zhihua Cai. 2014. Multi-instance learning from positive and unlabeled bags. In Advances in Knowledge Discovery and Data Mining: 18th Pacific-Asia Conference, PAKDD 2014, Tainan, Taiwan, May 13-16, 2014. Proceedings, Part I 18. Springer, 237--248.
[43]
Xin-Shun Xu, Yuan Jiang, Xiangyang Xue, and Zhi-Hua Zhou. 2012. Semi-supervised multi-instance multi-label learning for video annotation task. In Proceedings of the 20th ACM international conference on Multimedia.
[44]
Cha Zhang, John Platt, and Paul Viola. 2005. Multiple instance boosting for object detection. Advances in neural information processing systems (2005).
[45]
Lijun Zhang, Kai Liu, Yufeng Wang, and Zachary Bosire Omariba. 2018. Ice detection model of wind turbine blades based on random forest classifier. Energies (2018).
[46]
Hongshan Zhao, Huihai Liu, Wenjing Hu, and Xihui Yan. 2018. Anomaly detection and fault analysis of wind turbine components based on deep learning network. Renewable energy (2018).
[47]
Yu Zhou and Anlong Ming. 2016. Semi-Supervised Multiple Instance Learning and its application in visual tracking. In 2016 8th International Conference on Wireless Communications & Signal Processing (WCSP). IEEE.
[48]
Zhi-Hua Zhou. 2004. Multi-instance learning: A survey. Department of Computer Science & Technology, Nanjing University, Tech. Rep (2004).
[49]
Zhi-Hua Zhou, Yu-Yin Sun, and Yu-Feng Li. 2009. Multi-instance learning by treating instances as non-iid samples. In Proceedings of the 26th annual international conference on machine learning.

Cited By

View all
  • (2025)Decoding the m6A epitranscriptomic landscape for biotechnological applications using a direct RNA sequencing approachNature Communications10.1038/s41467-025-56173-616:1Online publication date: 18-Jan-2025
  • (2024)Targeted Detection of Anomalous Merchants on Integrated Payment Platforms via Multifaceted Transaction Representation Learning2024 IEEE International Conference on Big Data (BigData)10.1109/BigData62323.2024.10825015(2170-2178)Online publication date: 15-Dec-2024

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '23: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
August 2023
5996 pages
ISBN:9798400701030
DOI:10.1145/3580305
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 04 August 2023

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. anomaly detection
  2. multi-instance learning
  3. pu learning

Qualifiers

  • Research-article

Funding Sources

  • FWO-Vlaanderen
  • VLAIO
  • Flemish Government (AI Research Program)

Conference

KDD '23
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Upcoming Conference

KDD '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)363
  • Downloads (Last 6 weeks)29
Reflects downloads up to 15 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Decoding the m6A epitranscriptomic landscape for biotechnological applications using a direct RNA sequencing approachNature Communications10.1038/s41467-025-56173-616:1Online publication date: 18-Jan-2025
  • (2024)Targeted Detection of Anomalous Merchants on Integrated Payment Platforms via Multifaceted Transaction Representation Learning2024 IEEE International Conference on Big Data (BigData)10.1109/BigData62323.2024.10825015(2170-2178)Online publication date: 15-Dec-2024

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media