[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3394486.3406708acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
tutorial
Public Access

Dealing with Bias and Fairness in Data Science Systems: A Practical Hands-on Tutorial

Published: 20 August 2020 Publication History

Abstract

Tackling issues of bias and fairness when building and deploying data science systems has received increased attention from the research community in recent years, yet a lot of the research has focused on theoretical aspects and very limited set of application areas and data sets. There is a lack of 1) practical training materials, 2) methodologies, and 3) tools for researchers and developers working on real-world algorithmic decision making system to deal with issues of bias and fairness. Today, treating bias and fairness as primary metrics of interest, and building, selecting, and validating models using those metrics is not standard practice for data scientists. In this hands-on tutorial we will try to bridge the gap between research and practice, by deep diving into algorithmic fairness, from metrics and definitions to practical case studies, including bias audits using the Aequitas toolkit (http://github.com/dssg/aequitas). By the end of this hands-on tutorial, the audience will be familiar with bias mitigation frameworks and tools to help them making decisions during a project based on intervention and deployment contexts in which their system will be used.

References

[1]
Kit T. Rodolfa, Pedro Saleiro, and Rayid Ghani. Chapter 11: Bias and fairness. In Ian Foster, Rayid Ghani, Ron S Jarmin, Frauke Kreuter, and Julia Lane, editors, Big data and social science: A practical guide to methods and tools. crc Press, 2020.
[2]
Kit T Rodolfa, Erika Salomon, Lauren Haynes, Iván Higuera Mendieta, Jamie Larson, and Rayid Ghani. Case study: predictive fairness to reduce misdemeanor recidivism through social service interventions. In Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency, pages 142--153, 2020.
[3]
Pedro Saleiro, Benedict Kuester, Loren Hinkson, Jesse London, Abby Stevens, Ari Anisfeld, Kit T. Rodolfa, and Rayid Ghani. Aequitas: A Bias and Fairness Audit Toolkit. (2018), nov 2018.
[4]
Moritz Hardt, Eric Price, and Nathan Srebro. Equality of Opportunity in Supervised Learning. Advances in Neural Information Processing Systems, (Nips):1--22, 2016.
[5]
Solon Barocas and Andrew D. Selbst. Big Data's Disparate Impact. California Law Review, 104(3):671--732, 2016.
[6]
Shira Mitchell, Eric Potash, Solon Barocas, Alexander D'Amour, and Kristian Lum. Prediction-Based Decisions and Fairness: A Catalogue of Choices, Assumptions, and Definitions. nov 2018.
[7]
Alexandra Chouldechova, Diana Benavides-Prado, Oleksandr Fialko, and Rhema Vaithianathan. A case study of algorithm-assisted decision making in child maltreatment hotline screening decisions. In Conference on Fairness, Accountability and Transparency, pages 134--148, 2018.
[8]
Alexandra Chouldechova and Aaron Roth. The frontiers of fairness in machine learning. arXiv preprint arXiv:1810.08810, 2018.
[9]
Geoff Pleiss, Manish Raghavan, Felix Wu, Jon Kleinberg, and Kilian Q Weinberger. On Fairness and Calibration. In I Guyon, U V Luxburg, S Bengio, H Wallach, R Fergus, S Vishwanathan, and R Garnett, editors, Advances in Neural Information Processing Systems 30, pages 5680--5689. Curran Associates, Inc., 2017.
[10]
Muhammad Bilal Zafar, Isabel Valera, Manuel Gomez Rodriguez, and Krishna P. Gummadi. Fairness constraints: Mechanisms for fair classification. Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, AISTATS 2017, 54, 2017.
[11]
Muhammad Bilal Zafar, Isabel Valera, Manuel Gomez Rodriguez, and Krishna P. Gummadi. Fairness beyond disparate treatment & disparate impact: Learning classification without disparate mistreatment. 26th International World Wide Web Conference, WWW 2017, pages 1171--1180, 2017.
[12]
Andrew Cotter, Maya Gupta, Heinrich Jiang, Nathan Srebro, Karthik Sridharan, Serena Wang, Blake Woodworth, and Seungil You. Training Well-Generalizing Classifiers for Fairness Metrics and Other Data-Dependent Constraints. In Proceedings of the 36th International Conference on Machine Learning, volume 97, pages 1397--1405, Long Beach, California, USA, jun 2019. PMLR.
[13]
Indre liobait ? e. Measuring discrimination in algorithmic decision making. Data Mining and Knowledge Discovery, 31(4):1060--1089, jul 2017.
[14]
Sahil Verma and Julia Rubin. Fairness definitions explained. In Proceedings of the International Workshop on Software Fairness - FairWare '18, pages 1--7, New York, New York, USA, 2018. ACM Press.
[15]
Alexandra Chouldechova. Fair Prediction with Disparate Impact: A Study of Bias in Recidivism Prediction Instruments. Big Data, 5(2):153--163, jun 2017.
[16]
Jon Kleinberg, Himabindu Lakkaraju, Jure Leskovec, Jens Ludwig, and Sendhill Mullainathan. Human Decisions and Machine Predictions*. The Quarterly Journal of Economics, 133(January):237--293, aug 2017.
[17]
Chris Russell, Matt J Kusner, Joshua Loftus, and Ricardo Silva. When Worlds Collide: Integrating Different Counterfactual Assumptions in Fairness. In I Guyon, U V Luxburg, S Bengio, H Wallach, R Fergus, S Vishwanathan, and R Garnett, editors, Advances in Neural Information Processing Systems 30, pages 6414--6423. Curran Associates, Inc., 2017.
[18]
Matt J Kusner, Joshua Loftus, Chris Russell, and Ricardo Silva. Counterfactual Fairness. In I Guyon, U V Luxburg, S Bengio, H Wallach, R Fergus, S Vishwanathan, and R Garnett, editors, Advances in Neural Information Processing Systems 30, pages 4066--4076. Curran Associates, Inc., 2017.
[19]
Julia Angwin, Jeff Larson, Surya Mattu, and Lauren Kirchner. Machine bias. ProPublica, May, 23:2016, 2016.
[20]
Michael Feldman, Sorelle A. Friedler, John Moeller, Carlos Scheidegger, and Suresh Venkatasubramanian. Certifying and Removing Disparate Impact. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining - KDD '15, pages 259--268, New York, New York, USA, 2015. ACM Press.
[21]
Alekh Agarwal, Aliiia Beygelzimer, Miroslav Dudfk, John Langford, and Wallach Hanna. A reductions approach to fair classification. 35th International Conference on Machine Learning, ICML 2018, 1:102--119, 2018.
[22]
Yahav Bechavod and Katrina Ligett. Penalizing Unfairness in Binary Classification. jun 2017.
[23]
Naman Goel, Mohammad Yaghini, and Boi Faltings. Non-Discriminatory Machine Learning through Convex Fairness Criteria. In Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society - AIES '18, pages 116--116, New York, New York, USA, 2018. ACM Press.
[24]
Blake Woodworth, Suriya Gunasekar, Mesrob I. Ohannessian, and Nathan Srebro. Learning Non-Discriminatory Predictors. In Satyen Kale and Ohad Shamir, editors, Proceedings of the 2017 Conference on Learning Theory, volume 65, pages 1920--1953, Amsterdam, Netherlands, jul 2017. PMLR.

Cited By

View all
  • (2024)Fairness in Large Language Models in Three HoursProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679090(5514-5517)Online publication date: 21-Oct-2024
  • (2024)Exploring Fairness Interpretability with FairnessFriend: A Chatbot Solution2024 IEEE 40th International Conference on Data Engineering Workshops (ICDEW)10.1109/ICDEW61823.2024.00037(246-253)Online publication date: 13-May-2024
  • (2023)Integrating a Blockchain-Based Governance Framework for Responsible AIFuture Internet10.3390/fi1503009715:3(97)Online publication date: 28-Feb-2023
  • Show More Cited By

Index Terms

  1. Dealing with Bias and Fairness in Data Science Systems: A Practical Hands-on Tutorial

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      KDD '20: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining
      August 2020
      3664 pages
      ISBN:9781450379984
      DOI:10.1145/3394486
      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 20 August 2020

      Check for updates

      Author Tags

      1. ai ethics
      2. algorithmic fairness
      3. bias

      Qualifiers

      • Tutorial

      Funding Sources

      Conference

      KDD '20
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

      Upcoming Conference

      KDD '25

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)319
      • Downloads (Last 6 weeks)26
      Reflects downloads up to 01 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Fairness in Large Language Models in Three HoursProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679090(5514-5517)Online publication date: 21-Oct-2024
      • (2024)Exploring Fairness Interpretability with FairnessFriend: A Chatbot Solution2024 IEEE 40th International Conference on Data Engineering Workshops (ICDEW)10.1109/ICDEW61823.2024.00037(246-253)Online publication date: 13-May-2024
      • (2023)Integrating a Blockchain-Based Governance Framework for Responsible AIFuture Internet10.3390/fi1503009715:3(97)Online publication date: 28-Feb-2023
      • (2023)The Possibility of Fairness: Revisiting the Impossibility Theorem in PracticeProceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency10.1145/3593013.3594007(400-422)Online publication date: 12-Jun-2023
      • (2023)Understanding Fairness Requirements for ML-based Software2023 IEEE 31st International Requirements Engineering Conference (RE)10.1109/RE57278.2023.00046(341-346)Online publication date: Sep-2023
      • (2023)HDFL: A Heterogeneity and Client Dropout-Aware Federated Learning Framework2023 IEEE/ACM 23rd International Symposium on Cluster, Cloud and Internet Computing (CCGrid)10.1109/CCGrid57682.2023.00037(311-321)Online publication date: May-2023
      • (2022)A Review on Service Delivery in Tourism and Hospitality Industry Through Artificial IntelligenceProceedings of Third International Conference on Computing, Communications, and Cyber-Security10.1007/978-981-19-1142-2_34(427-436)Online publication date: 3-Jul-2022
      • (2022)Fairness, Accountability, Sustainability, Transparency (FAST) of Artificial Intelligence in Terms of Hospitality IndustryICT Analysis and Applications10.1007/978-981-16-5655-2_48(495-504)Online publication date: 7-Jan-2022
      • (2022)Big Data in Road Transport and Mobility ResearchAI-enabled Technologies for Autonomous and Connected Vehicles10.1007/978-3-031-06780-8_19(523-546)Online publication date: 8-Sep-2022
      • (2022)Systems Bias in Federated LearningFederated Learning10.1007/978-3-030-96896-0_12(259-278)Online publication date: 8-Feb-2022
      • Show More Cited By

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media