This article aims to provide a consolidated and unifying framework for the user-centric evaluation of conversational recommender systems rather than investigating the effects of specific design factors on a CRS’s
user experience (UX). User-centric evaluation has gained extensive attention in the community of recommender systems. Recently, researchers have discussed the importance of subjective evaluation metrics (perception-oriented) in addition to objective metrics (computation-oriented), such as algorithmic accuracy, understanding rate, and dialogue turns [
47]. The previous studies show that the user’s perception of conversations strongly influences the overall user experience of a CRS [
52,
72,
89]. Therefore, we have developed a unifying user-centric evaluation framework called
CRS-Que for conversational recommender systems. Compared with the original
ResQue model that primarily focuses on traditional recommender systems [
95], our framework seamlessly integrates several important user experience constructs of conversations into the
ResQue model, allowing researchers and practitioners to evaluate a CRS more comprehensively. Specifically, by reviewing the existing UX metrics of conversational agents, we identified eight constructs (e.g., CUI Adaptability, CUI Response Quality, and CUI Understanding) that are closely related to the quality of conversations based on the theory of rapport [
116] and humanlikeness [
34]. Then, by performing CFA, we merged several conversation constructs and integrated them into
ResQue. Ultimately,
CRS-Que model accommodates adaptability, understanding, attentiveness, response quality, rapport, and humanness. To validate our proposed evaluation framework, we conducted two user studies to evaluate two conversational recommender systems (i.e.,
MusicBot and
PhoneBot). The two studies target different recommendation domains (i.e., low user involvement and high user involvement) and devices (i.e., personal computers and mobile phones), which help us confirm the robustness and generalizability of our framework.