[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3406865.3418584acmconferencesArticle/Chapter ViewAbstractPublication PagescscwConference Proceedingsconference-collections
short-paper

Interrogating Data Science

Published: 17 October 2020 Publication History

Abstract

Data science provides powerful tools and methods. CSCW researchers have contributed insightfulstudies of conventional work-practices in data science - and particularly machine learning. However,recent research has shown that human skills and collaborative decision-making, play important rolesin defining data, acquiring data, curating data, designing data, and creating data. This workshopgathers researchers and practitioners together to take a collective and critical look at data sciencework-practices, and at how those work-practices make crucial and often invisible impacts on theformal work of data science. When we understand the human and social contributions to data sciencepipelines, we can constructively redesign both work and technologies for new insights, theories, andchallenges.

References

[1]
REFERENCES
[2]
Cecilia Aragon, Clayton Hutto, Andy Echenique, Brittany Fiore-Gartland, Yun Huang, Jinyoung Kim, Gina Neff, Wanli Xing, and Joseph Bayer. 2016. Developing a research agenda for human-centered data science. In Proceedings of the 19th ACM Conference on Computer Supported Cooperative Work and Social Computing Companion. 529--535.
[3]
Shaowen Bardzell, Daniela K Rosner, and Jeffrey Bardzell. 2012. Crafting quality in design: integrity, creativity, and public sensibility. In Proceedings of the Designing Interactive Systems Conference. 11--20.
[4]
Moria Bergman, Tova Milo, Slava Novgorodov, and Wang-Chiew Tan. 2015. QOCO: A query oriented data cleaning system with oracles. Proceedings of the VLDB Endowment 8, 12 (2015), 1900--1903.
[5]
Avrim Blum, John Hopcroft, and Ravindran Kannan. 2020. Foundations of Data Science. Cambridge University Press.
[6]
Kirsten Boehner, Shay David, Joseph Kaye, and Phoebe Sengers. 2005. Critical technical practice as a methodology for values in design. In CHI 2005 Workshop on quality, values, and choices. 2--7.
[7]
Christine L Borgman. 2020. Big Data, Little Data, or No Data? Why Human Interaction with Data is a Hard Problem. In Proceedings of the 2020 Conference on Human Information Interaction and Retrieval. 1--1.
[8]
Stevie Chancellor, Eric PS Baumer, and Munmun De Choudhury. 2019. Who is the" Human" in Human-Centered Machine Learning: The Case of Predicting Mental Health from Social Media. Proceedings of the ACM on Human-Computer Interaction 3, CSCW (2019), 1--32.
[9]
Stevie Chancellor, Shion Guha, Jofish Kaye, Jen King, Niloufar Salehi, Sarita Schoenebeck, and Elizabeth Stowell. 2019. The Relationships between Data, Power, and Justice in CSCW Research. In Conference Companion Publication of the 2019 on Computer Supported Cooperative Work and Social Computing. 102--105.
[10]
Amy Cheatle and Steven J Jackson. 2015. Digital entanglements: craft, computation and collaboration in fine art furniture production. In Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing. 958--968.
[11]
Rob Comber, Shaowen Bardzell, Jeffery Bardzell, Michael Hazas, and Michael Muller. 2020. Announcing a New CHI Subcommittee: Critical and Sustainable Computing. To appear in Interactions. (2020).
[12]
Jaimie Drozdal, Justin Weisz, Dakuo Wang, Gaurav Dass, Bingsheng Yao, Changruo Zhao, Michael Muller, Lin Ju, and Hui Su. 2020. Trust in AutoML: exploring information needs for establishing trust in automated machine learning systems. In Proceedings of the 25th International Conference on Intelligent User Interfaces. 297--307.
[13]
Melanie Feinberg. 2017. A design perspective on data. In Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems. 2952--2963.
[14]
Melanie Feinberg. 2017. Material Vision. In Proceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing. 604--617.
[15]
Joel Grus. 2019. Data science from scratch: first principles with python. O'Reilly Media.
[16]
Fred Hohman, Kanit Wongsuphasawat, Mary Beth Kery, and Kayur Patel. 2020. Understanding and Visualizing Data Iteration in Machine Learning. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems. 1--13.
[17]
Laura Igual and Santi Seguí. [n.d.]. Introduction to Data Science. In Introduction to Data Science.
[18]
Sean Kandel, Andreas Paepcke, Joseph M Hellerstein, and Jeffrey Heer. 2012. Enterprise data analysis and visualization: An interview study. IEEE Transactions on Visualization and Computer Graphics 18, 12 (2012), 2917--2926.
[19]
Marina Kogan, Aaron Halfaker, Shion Guha, Cecilia Aragon, Michael Muller, and Stuart Geiger. 2020. Mapping Out Human-Centered Data Science: Methods, Approaches, and Best Practices. In Companion of the 2020 ACM International Conference on Supporting Group Work. 151--156.
[20]
Jiali Liu, Nadia Boukhelifa, and James R Eagan. 2019. Understanding the role of alternatives in data analysis practices. IEEE transactions on visualization and computer graphics 26, 1 (2019), 66--76.
[21]
Michael Muller, Melanie Feinberg, Timothy George, Steven J Jackson, Bonnie E John, Mary Beth Kery, and Samir Passi. 2019. Human-centered study of data science work practices. In Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems. 1--8.
[22]
Michael Muller, Ingrid Lange, Dakuo Wang, David Piorkowski, Jason Tsay, Q Vera Liao, Casey Dugan, and Thomas Erickson. 2019. How data science workers work with data: Discovery, capture, curation, design, creation. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems. 1--15.
[23]
Michael Muller, Christine Wolf, Josh Andres, Michael Desmond, Narendra Nath Joshi, Zahra Ashktorab, Aabhas Sharma, Kristina Brimijoin, Qian Pan, Evelyn Duesterwald, and Casey Dugan. 2020. Designing Ground Truth and the Social Life of Labels. Submitted for publication to CHI 2021.
[24]
Gina Neff, Anissa Tanweer, Brittany Fiore-Gartland, and Laura Osburn. 2017. Critique and contribute: A practice-based framework for improving critical data studies and data science. Big data 5, 2 (2017), 85--97.
[25]
Samir Passi and Steven Jackson. 2017. Data vision: Learning to see through algorithmic abstraction. In Proceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing. 2436--2447.
[26]
Kathleen H Pine and Max Liboiron. 2015. The politics of measurement and action. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems. 3147--3156.
[27]
Daniela Rosner, Jean-François Blanchette, Leah Buechley, Paul Dourish, and Melissa Mazmanian. 2012. From materials to materiality: connecting practice and theory in hc. In CHI'12 Extended Abstracts on Human Factors in Computing Systems. 2787--2790.
[28]
Daniela K Rosner. 2012. Craft, computing & culture. In Proceedings of the ACM 2012 conference on Computer Supported Cooperative Work Companion. 319--322.
[29]
Adam Rule, Aurélien Tabard, and James D Hollan. 2018. Exploration and explanation in computational notebooks. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems. 1--12.
[30]
Cathrine Seidelin. 2020. Towards a Co-design Perspective on Data. Ph.D. Dissertation. IT-University of Copenhagen.
[31]
Cathrine Seidelin, Yvonne Dittrich, and Erik Grönvall. 2018. Data work in a knowledge-broker organisation: how crossorganisational data maintenance shapes human data interactions. In Proceedings of the 32nd International BCS Human Computer Interaction Conference 32. 1--12.
[32]
Anissa Tanweer, Brittany Fiore-Gartland, and Cecilia Aragon. 2016. Impediment to insight to innovation: understanding data assemblages through the breakdown--repair process. Information, Communication & Society 19, 6 (2016), 736--752.
[33]
Jake VanderPlas. 2016. Python data science handbook: Essential tools for working with data. " O'Reilly Media, Inc.".
[34]
Roman Vershynin. 2018. High-dimensional probability: An introduction with applications in data science. Vol. 47. Cambridge university press.
[35]
DakuoWang, Justin DWeisz, Michael Muller, Parikshit Ram,Werner Geyer, Casey Dugan, Yla Tausczik, Horst Samulowitz, and Alexander Gray. 2019. Human-AI Collaboration in Data Science: Exploring Data Scientists? Perceptions of Automated AI. Proceedings of the ACM on Human-Computer Interaction 3, CSCW (2019), 1--24.
[36]
Amy X Zhang, Michael Muller, and Dakuo Wang. 2020. How do data science workers collaborate? roles, workflows, and tools. Proceedings of the ACM on Human-Computer Interaction 4, CSCW1 (2020), 1--23.

Cited By

View all
  • (2024)"Guilds" as Worker Empowerment and Control in a Chinese Data Work PlatformProceedings of the ACM on Human-Computer Interaction10.1145/36869048:CSCW2(1-27)Online publication date: 8-Nov-2024
  • (2024)Bitacora: A Toolkit for Supporting NonProfits to Critically Reflect on Social Media Data UseProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642673(1-29)Online publication date: 11-May-2024
  • (2023)From Bias to Repair: Error as a Site of Collaboration and Negotiation in Applied Data Science WorkProceedings of the ACM on Human-Computer Interaction10.1145/35796077:CSCW1(1-32)Online publication date: 16-Apr-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
CSCW '20 Companion: Companion Publication of the 2020 Conference on Computer Supported Cooperative Work and Social Computing
October 2020
559 pages
ISBN:9781450380591
DOI:10.1145/3406865
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 October 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. critical computing
  2. human centered data science (hcds)
  3. human centered machine learning (hcml)
  4. work-practices.

Qualifiers

  • Short-paper

Conference

CSCW '20
Sponsor:

Acceptance Rates

Overall Acceptance Rate 2,235 of 8,521 submissions, 26%

Upcoming Conference

CSCW '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)30
  • Downloads (Last 6 weeks)4
Reflects downloads up to 21 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)"Guilds" as Worker Empowerment and Control in a Chinese Data Work PlatformProceedings of the ACM on Human-Computer Interaction10.1145/36869048:CSCW2(1-27)Online publication date: 8-Nov-2024
  • (2024)Bitacora: A Toolkit for Supporting NonProfits to Critically Reflect on Social Media Data UseProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642673(1-29)Online publication date: 11-May-2024
  • (2023)From Bias to Repair: Error as a Site of Collaboration and Negotiation in Applied Data Science WorkProceedings of the ACM on Human-Computer Interaction10.1145/35796077:CSCW1(1-32)Online publication date: 16-Apr-2023
  • (2023)Mobilizing Social Media Data: Reflections of a Researcher Mediating between Data and OrganizationProceedings of the 2023 CHI Conference on Human Factors in Computing Systems10.1145/3544548.3580916(1-19)Online publication date: 19-Apr-2023
  • (2023) Data Flourishing: Developing Human‐Centered Data Science through Communities of Ethical Practice Proceedings of the Association for Information Science and Technology10.1002/pra2.79360:1(338-352)Online publication date: 22-Oct-2023
  • (2022)Interrogating Data Work as a Community of PracticeProceedings of the ACM on Human-Computer Interaction10.1145/35551986:CSCW2(1-28)Online publication date: 11-Nov-2022
  • (2022)Model Positionality and Computational Reflexivity: Promoting Reflexivity in Data ScienceProceedings of the 2022 CHI Conference on Human Factors in Computing Systems10.1145/3491102.3501998(1-19)Online publication date: 29-Apr-2022
  • (2022)Interrogating Human-centered Data Science: Taking Stock of Opportunities and LimitationsExtended Abstracts of the 2022 CHI Conference on Human Factors in Computing Systems10.1145/3491101.3503740(1-6)Online publication date: 27-Apr-2022
  • (2021)Datasheets for Datasets help ML Engineers Notice and Understand Ethical Issues in Training DataProceedings of the ACM on Human-Computer Interaction10.1145/34795825:CSCW2(1-27)Online publication date: 18-Oct-2021
  • (2021)Do Datasets Have Politics? Disciplinary Values in Computer Vision Dataset DevelopmentProceedings of the ACM on Human-Computer Interaction10.1145/34760585:CSCW2(1-37)Online publication date: 18-Oct-2021

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media