[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3638529.3654038acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
research-article
Open access

Evolutionary Multi-Objective Optimisation for Fairness-Aware Self Adjusting Memory Classifiers in Data Streams

Published: 14 July 2024 Publication History

Abstract

This paper introduces a novel approach, evolutionary multi-objective optimisation for fairness-aware self-adjusting memory classifiers, designed to enhance fairness in machine learning algorithms applied to data stream classification. With the growing concern over discrimination in algorithmic decision-making, particularly in dynamic data stream environments, there is a need for methods that ensure fair treatment of individuals across sensitive attributes like race or gender. The proposed approach addresses this challenge by integrating the strengths of the self-adjusting memory K-Nearest-Neighbour algorithm with evolutionary multi-objective optimisation. This combination allows the new approach to efficiently manage concept drift in streaming data and leverage the flexibility of evolutionary multi-objective optimisation to maximise accuracy and minimise discrimination simultaneously. We demonstrate the effectiveness of the proposed approach through extensive experiments on various datasets, comparing its performance against several baseline methods in terms of accuracy and fairness metrics. Our results show that the proposed approach maintains competitive accuracy and significantly reduces discrimination, highlighting its potential as a robust solution for fairness-aware data stream classification. Further analyses also confirm the effectiveness of the strategies to trigger evolutionary multi-objective optimisation and adapt classifiers in the proposed approach.

References

[1]
Ala' M Al-Zoubi, Ali Asghar Heidari, Maria Habib, Hossam Faris, Ibrahim Aljarah, and Mohammad A Hassonah. 2020. Salp chain-based optimization of support vector machines and feature weighting for medical diagnostic information systems. Evolutionary machine learning techniques: algorithms and applications (2020), 11--34.
[2]
Waleed Ali and Sharaf Malebary. 2020. Particle swarm optimization-based feature weighting for improving intelligent phishing website detection. IEEE Access 8 (2020), 116766--116780.
[3]
Antonio Benítez-Hidalgo, Antonio J. Nebro, José García-Nieto, Izaskun Oregi, and Javier Del Ser. 2019. jMetalPy: A Python framework for multi-objective optimization with metaheuristics. Swarm and Evolutionary Computation 51 (2019), 100598.
[4]
Albert Bifet and Ricard Gavalda. 2007. Learning from time-changing data with adaptive windowing. In Proceedings of the 2007 SIAM international conference on data mining. SIAM, 443--448.
[5]
Toon Calders, Faisal Kamiran, and Mykola Pechenizkiy. [n. d.]. Building classifiers with independency constraints. In 2009 IEEE International Conference on Data Mining Workshops. 13--18.
[6]
Toon Calders and Sicco Verwer. 2010. Three naive bayes approaches for discrimination-free classification. Data mining and knowledge discovery 21 (2010), 277--292.
[7]
Michael P Cary Jr, Anna Zink, Sijia Wei, Andrew Olson, Mengying Yan, Rashaud Senior, Sophia Bessias, Kais Gadhoumi, Genevieve Jean-Pierre, Demy Wang, et al. 2023. Mitigating Racial And Ethnic Bias And Advancing Health Equity In Clinical Algorithms: A Scoping Review: Scoping review examines racial and ethnic bias in clinical algorithms. Health Affairs 42, 10 (2023), 1359--1368.
[8]
Kalyanmoy Deb. 2011. Multi-objective optimisation using evolutionary algorithms: an introduction. In Multi-objective evolutionary optimisation for product design and manufacturing. Springer, London, 3--34.
[9]
Amin Ghadami and Bogdan I Epureanu. 2022. Data-driven prediction in dynamical systems: recent developments. Philosophical Transactions of the Royal Society A 380, 2229 (2022), 20210213.
[10]
Heitor M Gomes, Albert Bifet, Jesse Read, Jean Paul Barddal, Fabrício Enembreck, Bernhard Pfharinger, Geoff Holmes, and Talel Abdessalem. 2017. Adaptive random forests for evolving data stream classification. Machine Learning 106, 9 (2017), 1469--1495.
[11]
Heitor Murilo Gomes, Jesse Read, Albert Bifet, Jean Paul Barddal, and João Gama. 2019. Machine learning for streaming data: state of the art, challenges, and opportunities. ACM SIGKDD Explorations Newsletter 21, 2 (2019), 6--22.
[12]
Moritz Hardt, Eric Price, and Nati Srebro. 2016. Equality of opportunity in supervised learning. Advances in neural information processing systems 29 (2016).
[13]
Robert J Hodrick and Edward C Prescott. 1997. Postwar US business cycles: an empirical investigation. Journal of Money, credit, and Banking (1997), 1--16.
[14]
Jonathan Huang, Galal Galal, Mozziyar Etemadi, and Mahesh Vaidyanathan. 2022. Evaluation and mitigation of racial bias in clinical machine learning models: scoping review. JMIR Medical Informatics 10, 5 (2022), e36388.
[15]
Geoff Hulten, Laurie Spencer, and Pedro Domingos. 2001. Mining time-changing data streams. In Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining. 97--106.
[16]
Vasileios Iosifidis and Eirini Ntoutsi. 2020. -Online Fairness-Aware Learning Under Class Imbalance. In International Conference on Discovery Science. Springer, 159--174.
[17]
Vasileios Iosifidis, Thi Ngoc Han Tran, and Eirini Ntoutsi. 2019. Fairness-enhancing interventions in stream classification. In International Conference on Database and Expert Systems Applications. 261--276.
[18]
Monalisa Jena, Ranjan Kumar Behera, and Santanu Kumar Rath. 2020. Machine learning models for stock prediction using real-time streaming data. In International Conference on Biologically Inspired Techniques in Many-Criteria Decision Making. Springer, 101--108.
[19]
Faisal Kamiran and Toon Calders. 2009. Classifying without discriminating. In The 2nd international conference on computer, control and communication. 1--6.
[20]
Bartosz Krawczyk, Leandro L Minku, João Gama, Jerzy Stefanowski, and Michał Woźniak. 2017. Ensemble learning for data stream analysis: A survey. Information Fusion 37 (2017), 132--156.
[21]
Tai Le Quy, Arjun Roy, Vasileios Iosifidis, Wenbin Zhang, and Eirini Ntoutsi. 2022. A survey on datasets for fairness-aware machine learning. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery (2022), e1452.
[22]
Suyun Liu and Luis Nunes Vicente. 2022. Accuracy and fairness trade-offs in machine learning: A stochastic multi-objective approach. Computational Management Science 19, 3 (2022), 513--537.
[23]
Viktor Losing, Barbara Hammer, and Heiko Wersing. [n. d.]. KNN classifier with self-adjusting memory for heterogeneous concept drift. In 2016 IEEE 16th International Conference on data mining (ICDM). 291--300.
[24]
Michalis Mavrovouniotis, Changhe Li, and Shengxiang Yang. 2017. A survey of swarm intelligence for dynamic optimization: Algorithms and applications. Swarm and Evolutionary Computation 33 (2017), 1--17.
[25]
Ninareh Mehrabi, Fred Morstatter, Nripsuta Saxena, Kristina Lerman, and Aram Galstyan. 2021. A survey on bias and fairness in machine learning. ACM computing surveys (CSUR) 54, 6 (2021), 1--35.
[26]
Shikha Mehta et al. 2017. Concept drift in streaming data classification: algorithms, platforms and issues. Procedia computer science 122 (2017), 804--811.
[27]
Antonio J Nebro, Juan José Durillo, Jose Garcia-Nieto, CA Coello Coello, Francisco Luna, and Enrique Alba. 2009. SMPSO: A new PSO-based metaheuristic for multi-objective optimization. In 2009 IEEE Symposium on computational intelligence in multi-criteria decision-making (MCDM). IEEE, 66--73.
[28]
Iratxe Niño-Adan, Diana Manjarres, Itziar Landa-Torres, and Eva Portillo. 2021. Feature weighting methods: A review. Expert Systems with Applications 184 (2021), 115424.
[29]
Kirtan Padh, Diego Antognini, Emma Lejal-Glaude, Boi Faltings, and Claudiu Musat. 2021. Addressing fairness in classification with a model-agnostic multi-objective algorithm. In Uncertainty in artificial intelligence. PMLR, 600--609.
[30]
Rahul Patil, Pramod D Patil, Sayali Kanase, Nikita Bhegade, Vaishnavi Chavan, and Shreyas Kashetwar. 2022. System for analyzing crime news by mining live data streams with preserving data privacy. In Sentimental Analysis and Deep Learning: Proceedings of ICSADL 2021. Springer, 799--811.
[31]
Ladislav Peška, Tsegaye Misikir Tashu, and Tomáš Horváth. 2019. Swarm intelligence techniques in recommender systems-A review of recent research. Swarm and Evolutionary Computation 48 (2019), 201--219.
[32]
Dana Pessach and Erez Shmueli. 2022. A review on fairness in machine learning. ACM Computing Surveys (CSUR) 55, 3 (2022), 1--44.
[33]
Diem Pham, Binh Tran, Su Nguyen, and Damminda Alahakoon. 2022. Fairness Aware Swarm-based Machine Learning for Data Streams. In Australasian Joint Conference on Artificial Intelligence. Springer, 205--219.
[34]
Diem Pham, Binh Tran, Su Nguyen, Damminda Alahakoon, and Mengjie Zhang. 2024. Fairness optimisation with multi-objective swarms for explainable classifiers on data streams. Complex & Intelligent Systems (2024), 1--14.
[35]
Diem Pham, Binh Tran, Su Nguyen, Damminda Alahakoon, and Mengjie Zhang. accepted, January 2024. Fairness Optimisation with Multi-objective Swarms for Explainable Classifiers on Data Streams. Complex & Intelligent Systems.
[36]
Ayaz Ur Rehman, Anas Nadeem, and Muhammad Zubair Malik. 2022. Fair feature subset selection using multiobjective genetic algorithm. In Proceedings of the Genetic and Evolutionary Computation Conference Companion. 360--363.
[37]
Haixun Wang, Wei Fan, Philip S Yu, and Jiawei Han. 2003. Mining concept-drifting data streams using ensemble classifiers. In Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining. 226--235.
[38]
Shuliang Xu and Junhong Wang. 2017. Dynamic extreme learning machine for data stream classification. Neurocomputing 238 (2017), 433--449.
[39]
Xu-Cheng Yin, Kaizhu Huang, and Hong-Wei Hao. 2015. DE2: dynamic ensemble of ensembles for learning nonstationary data. Neurocomputing 165 (2015), 14--22.
[40]
Sheng-Chi You and Hsuan-Tien Lin. 2016. A simple unlearning framework for online learning under concept drifts. In Advances in Knowledge Discovery and Data Mining: 20th Pacific-Asia Conference, PAKDD 2016, Auckland, New Zealand, April 19-22, 2016, Proceedings, Part I 20. Springer, 115--126.
[41]
Guo Yu, Lianbo Ma, Wei Du, Wenli Du, and Yaochu Jin. 2022. Towards fairness-aware multi-objective optimization. arXiv preprint arXiv:2207.12138 (2022).
[42]
Qingquan Zhang, Jialin Liu, Zeqi Zhang, Junyi Wen, Bifei Mao, and Xin Yao. 2021. Fairer machine learning through multi-objective evolutionary learning. In International conference on artificial neural networks. Springer, 111--123.
[43]
Qingquan Zhang, Jialin Liu, Zeqi Zhang, Junyi Wen, Bifei Mao, and Xin Yao. 2022. Mitigating unfairness via evolutionary multi-objective ensemble learning. IEEE Transactions on Evolutionary Computation (2022).
[44]
Wenbin Zhang and Albert Bifet. 2020. Feat: A fairness-enhancing and concept-adapting decision tree classifier. In International Conference on Discovery Science. 175--189.
[45]
Wenbin Zhang and Eirini Ntoutsi. 2019. FAHT: An Adaptive Fairness-aware Decision Tree Classifier. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence (IJCAI-19). 1480--1486.
[46]
Wenbin Zhang, Mingli Zhang, Ji Zhang, Zhen Liu, Zhiyuan Chen, Jianwu Wang, Edward Raff, and Enza Messina. [n. d.]. Flexible and adaptive fairness-aware learning in non-stationary data streams. In 2020 IEEE 32nd International Conference on Tools with Artificial Intelligence (ICTAI). 399--406.
[47]
Xingyi Zhang, Ye Tian, and Yaochu Jin. 2014. A knee point-driven evolutionary algorithm for many-objective optimization. IEEE Transactions on Evolutionary Computation 19, 6 (2014), 761--776.
[48]
Aimin Zhou, Bo-Yang Qu, Hui Li, Shi-Zheng Zhao, Ponnuthurai Nagaratnam Suganthan, and Qingfu Zhang. 2011. Multiobjective evolutionary algorithms: A survey of the state of the art. Swarm and evolutionary computation 1, 1 (2011), 32--49.
[49]
Indrė Žliobaitė. 2017. Measuring discrimination in algorithmic decision making. Data Mining and Knowledge Discovery 31, 4 (2017), 1060--1089.

Index Terms

  1. Evolutionary Multi-Objective Optimisation for Fairness-Aware Self Adjusting Memory Classifiers in Data Streams

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      GECCO '24: Proceedings of the Genetic and Evolutionary Computation Conference
      July 2024
      1657 pages
      ISBN:9798400704949
      DOI:10.1145/3638529
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 14 July 2024

      Check for updates

      Author Tags

      1. fairness
      2. multi-objective
      3. data streams
      4. swarm intelligence

      Qualifiers

      • Research-article

      Conference

      GECCO '24
      Sponsor:
      GECCO '24: Genetic and Evolutionary Computation Conference
      July 14 - 18, 2024
      VIC, Melbourne, Australia

      Acceptance Rates

      Overall Acceptance Rate 1,669 of 4,410 submissions, 38%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 94
        Total Downloads
      • Downloads (Last 12 months)94
      • Downloads (Last 6 weeks)17
      Reflects downloads up to 16 Dec 2024

      Other Metrics

      Citations

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media