Computer Science > Human-Computer Interaction

arXiv:2309.10187 (cs)

[Submitted on 18 Sep 2023 (v1), last revised 3 Dec 2024 (this version, v3)]

Title:Collecting Qualitative Data at Scale with Large Language Models: A Case Study

Authors:Alejandro Cuevas, Jennifer V. Scurrell, Eva M. Brown, Jason Entenmann, Madeleine I. G. Daepp

Abstract:Chatbots have shown promise as tools to scale qualitative data collection. Recent advances in Large Language Models (LLMs) could accelerate this process by allowing researchers to easily deploy sophisticated interviewing chatbots. We test this assumption by conducting a large-scale user study (n=399) evaluating 3 different chatbots, two of which are LLM-based and a baseline which employs hard-coded questions. We evaluate the results with respect to participant engagement and experience, established metrics of chatbot quality grounded in theories of effective communication, and a novel scale evaluating "richness" or the extent to which responses capture the complexity and specificity of the social context under study. We find that, while the chatbots were able to elicit high-quality responses based on established evaluation metrics, the responses rarely capture participants' specific motives or personalized examples, and thus perform poorly with respect to richness. We further find low inter-rater reliability between LLMs and humans in the assessment of both quality and richness metrics. Our study offers a cautionary tale for scaling and evaluating qualitative research with LLMs.

Comments:	27 pages, 6 figures
Subjects:	Human-Computer Interaction (cs.HC)
Cite as:	arXiv:2309.10187 [cs.HC]
	(or arXiv:2309.10187v3 [cs.HC] for this version)
	https://doi.org/10.48550/arXiv.2309.10187

Submission history

From: Alejandro Cuevas [view email]
[v1] Mon, 18 Sep 2023 22:30:52 UTC (1,599 KB)
[v2] Tue, 10 Oct 2023 21:45:04 UTC (1,592 KB)
[v3] Tue, 3 Dec 2024 22:09:11 UTC (1,768 KB)

Computer Science > Human-Computer Interaction

Title:Collecting Qualitative Data at Scale with Large Language Models: A Case Study

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Human-Computer Interaction

Title:Collecting Qualitative Data at Scale with Large Language Models: A Case Study

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators