Designing Adaptive Instruction for Teams: a Meta-Analysis

Robert A. Sottilare ORCID: orcid.org/0000-0002-5278-2441¹,
C. Shawn Burke²,
Eduardo Salas³,
Anne M. Sinatra¹,
Joan H. Johnston¹ &
…
Stephen B. Gilbert⁴

13k Accesses
61 Citations
3 Altmetric
Explore all metrics

Abstract

The goal of this research was the development of a practical architecture for the computer-based tutoring of teams. This article examines the relationship of team behaviors as antecedents to successful team performance and learning during adaptive instruction guided by Intelligent Tutoring Systems (ITSs). Adaptive instruction is a training or educational experience tailored by artificially-intelligent, computer-based tutors with the goal of optimizing learner outcomes (e.g., knowledge and skill acquisition, performance, enhanced retention, accelerated learning, or transfer of skills from instructional environments to work environments). The core contribution of this research was the identification of behavioral markers associated with the antecedents of team performance and learning thus enabling the development and refinement of teamwork models in ITS architectures. Teamwork focuses on the coordination, cooperation, and communication among individuals to achieve a shared goal. For ITSs to optimally tailor team instruction, tutors must have key insights about both the team and the learners on that team. To aid the modeling of teams, we examined the literature to evaluate the relationship of teamwork behaviors (e.g., communication, cooperation, coordination, cognition, leadership/coaching, and conflict) with team outcomes (learning, performance, satisfaction, and viability) as part of a large-scale meta-analysis of the ITS, team training, and team performance literature. While ITSs have been used infrequently to instruct teams, the goal of this meta-analysis make team tutoring more ubiquitous by: identifying significant relationships between team behaviors and effective performance and learning outcomes; developing instructional guidelines for team tutoring based on these relationships; and applying these team tutoring guidelines to the Generalized Intelligent Framework for Tutoring (GIFT), an open source architecture for authoring, delivering, managing, and evaluating adaptive instructional tools and methods. In doing this, we have designed a domain-independent framework for the adaptive instruction of teams.

Shared Mental Models in Support of Adaptive Instruction for Teams Using the GIFT Tutoring Architecture

Article Open access 05 June 2017

Automating Team Competency Assessment in Support of Adaptive Dynamic Simulations

Adaptation and Pedagogy at the Collective Level: Recommendations for Adaptive Instructional Systems

Discover the latest articles, news and stories from top researchers in related subjects.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

While one-to-one human tutoring has claimed to be significantly more effective than one-to-many instructional methods (e.g., traditional classroom instruction; Bloom 1984; VanLehn 2011), it is neither a practical nor affordable solution in large organizations (e.g., academic, corporate, or military; Sottilare and Proctor 2012). The use of computer-based tutoring programs for learning has seen a renewed interest in training and educational domains and one-to-one computer-based tutoring continues to emerge as a practical alternative to one-to-one human tutoring.

One-to-one tutoring via Intelligent Tutoring Systems (ITSs) provides tailored experiences to engage individual learners, offers an effective means to enhance their learning and performance, but have focused mainly on well-defined educational domains (e.g., cognitive tasks involving problem solving or decision-making). Tutors for physics, mathematics, and software programming make up the bulk of the ITSs produced today. A recent review of artificial intelligence in education (AIED) meta-analyses by du Boulay (2016) noted investigations by VanLehn (2011), Ma et al. (2014), Kulik and Fletcher (2015), Steenbergen-Hu and Cooper (2013, 2014), and Pane et al. (2014). Each meta-analysis provided a range of results for effectiveness in the context of one-to-one tutoring in individual instructional domains.

In recent years, military trainers (U.S. Army Training, and Doctrine Command, TRADOC 2011; North Atlantic Treaty Organization 2012) have been requesting ITSs capabilities that can support training and education of both individuals and teams. Teams, the basic building blocks of military organizations, are important to demonstrating progress toward goals, developing solutions, and meeting challenges associated with organizational missions. As there are only a few published studies from which to draw technical specifications for team outcomes, and because most of these projects used a domain-dependent approaches to develop team models, the generalizability of those methods and technologies has not been realized.

This has been the major force behind the development of the Generalized Intelligent Framework for Tutoring (GIFT; Sottilare et al. 2011, 2012). GIFT is an open-source, modular architecture developed to reduce the cost and skill required to author, deliver, manage, and evaluate adaptive instruction. This framework is a powerful research tool and provides a starting point for a more advanced and flexible ITS processes. As part of its evaluation function, GIFT may be used as a testbed to understand the potential of ITSs as team tutoring tools (Sottilare and Holden 2013; Sottilare et al. 2011). In pursuit of this goal we began our research of team tutors by exploring the team and collaborative learning literature.

Group development might neatly be classified into three distinct areas of group interaction with different purposes: team training, teamwork, and collaborative learning (Cannon-Bowers and Bowers 2011). Van Berio (1997) compared and contrasted team training, team building, and cooperative learning. We have examined these differences and consolidated terms with similar terms in the literature:

taskwork team training is a subset of team training which is focused on developing proficiency in task domains required for a specific duty of one’s job (Salas 2015); taskwork team training is a domain-dependent learning activity; team training is often confused with the concept of teambuilding or teamwork (Van Berio 1997).
teamwork is the “coordination, cooperation, and communication among individuals to achieve a shared goal” (Salas 2015, p.5); teamwork behaviors are largely domain-independent; teamwork includes the social skills needed to function as a team; teamwork activities may include teambuilding whose goal is to strengthen the coordination, cooperation, communication, coaching, conflict management, cohesion, and collective efficacy of the group (Salas 2015); teamwork is a necessary prerequisite to satisfactory taskwork performance (Van Berio 1997).
collaborative learning (also referred to as cooperative learning) is “a situation in which two or more people learn or attempt to learn something together” (Dillenbourg 1999, p. 1); cooperative learning reinforces active participation (Van Berio 1997); collaborative learning generally focuses on a learning goal and is primarily domain-dependent and includes computer-supported collaborative learning (CSCL) activities.

While there are similarities between team taskwork, teamwork, and collaborative learning, there are also sufficient differences. This article focused primarily on teamwork and the identification of a set of behavior markers which indicate team states which are largely domain-independent. Behavioral markers indicating a high degree of collaboration for a group should not be confused with collaborative learning experiences. Although high collaboration within a group is usually an antecedent of high performance, collaborative learning experiences, which are focused on learning together, may be moderated by the group’s ability to collaborate. Collaboration is an element of teamwork and collaborative learning is an instructional method to promote group learning. In this same way, a group of experts may each have a high degree of proficiency in a particular task or collaborative learning domain, but may have their performance moderated by their ability to work together as a team. Teamwork is an antecedent of learning and successful performance (Van Berio 1997).

Teamwork Literature and Relevance to AIED and CSCL Research

Teamwork, team learning, and team performance seem to have sufficient coverage within the general training and education literature including the AIED literature on teams which seems to focus primarily on collaborative learning and collaborative problem solving. While the teamwork literature has received ample scholarly attention (Cannon-Bowers and Bowers 2011), little is known about the real ontology of core team behaviors, attitudes and cognition, and their influences on team outcomes (e.g., learning, performance, satisfaction, and viability). Most notably there have been few domain-independent approaches to the development of team models for computer-based tutoring. One such approach has its roots in neurophysiologic measures of collaboration (Stevens et al. 2009b, a; 2013). The focus of these team neurodynamics studies is to understand the changing rhythms and organizations of teams from the perspective of neurophysiology and specifically the concept of neuronal synchrony, in which a number, normalized between 0 and 1, quantifies the level of synchrony of a large population of neurons within a network or in this case individuals on a team. The theory suggests that higher synchrony measures between team members equate to higher team collaboration. Neurophysiologic measurement tools such as these may be a method to unobtrusive assessment of team states in team training experiences guided by ITSs.

Another such domain-independent approach is cooperative learning, where students work together to accomplish shared goals and are responsible to maximize not only their own learning, but the learning of all other group members (Johnson and Johnson 1986, 1999). In their ground breaking cooperative learning meta-analysis, Johnson et al. (2000) examined the impact of cooperative learning and compared several related learning strategies at the time. The primary independent variable in this study was the method of cooperative learning (a comparison of cooperation, competition, or individualistic learning) and the primary dependent variable was achievement as an outcome measure for performance.

The consistency of the results and the diversity of the cooperative learning methods provide strong validation for its effectiveness. However, the low number of studies conducted for several of the methods examined makes the reliability of their effect sizes very tentative. More and more studies have been conducted in the intervening years since the publication of the Johnson, Johnson & Stanne meta-analysis. Although this was not specifically addressed in the meta-analysis described herein, we recommend an update of the cooperative learning meta-analysis to strengthen the reliability of its results. Our goal in conducting the teamwork meta-analysis described herein was to expand the dimensions of teamwork (e.g., trust, cohesion, conflict management) to understand the broader influences for experiences beyond collaborative learning (e.g., team training and social interaction) and on outcomes beyond team performance (i.e., learning, satisfaction, and viability) where teamwork measures are represented by member attitudes, behaviors, and cognitions.

Others in the AIED and computer-supported collaborative learning (CSCL) community have built upon cooperative learning research to understand how individuals work together toward common goals and might be guided by ITS technologies (tools or methods) to enhance their learning, performance, or the quality of their overall instructional experience. Noteworthy articles by Erkens and Janssen (2008) and Dillenbourg and Hong (2008) discuss the assessment of dialogue acts during collaboration and the management of the collaborative environments respectively. A series of articles by McManus and Aiken (1993, 1995, 2016) highlight research in collaborative learning and collaborative problem solving. Soller (2001) adapted McManus and Aiken’s (1995) Collaborative Skills Network (CSN) to form her Collaborative Learning Conversation Skill Taxonomy (CLCST) which included skills, subskills, attributes, and sentence openers to promote collaborative skills and support social interaction within the Intelligent Collaborative Learning System (ICLS). Since the behavioral markers identified in our meta-analysis are primarily verbal behaviors, it was logical to examine commonalities with CSN and ICLS prior to developing an implementation plan for team tutoring in GIFT. For example, negative markers identified in our meta-analysis could be cross-referenced with appropriate skills, subskills, or attributes in the CLCST and corresponding replies could be associated with sentence openers intended to mitigate negative behaviors and promote collaboration within the group.

As part of their research in “intelligent support for learning in groups,“the AIED community examined the application of intelligent agents to: support peer tutoring (Walker et al. 2014); enhance collaboration (Adamson et al. 2014); trigger productive dialogue (Tegos et al. 2014); and assess groups during collaborative problem solving tasks (Rosen 2015). The research described in this article builds upon the AIED theme of intelligent support for learning in groups by identifying behavioral markers which may be used to assess various states of the team. Understanding the states and traits of teams and their progress toward individual and team goals is a necessary precursor for determining quality actions by the tutor and is also part of the motivation for understanding specific contributors to teamwork and team outcomes studied through the meta-analysis described in this article.

An understanding of collaborative behaviors might also be studied through team proxies where a premium is placed on communication and cooperation. Such is the case for interactions between human students and virtual peers managed by intelligent agents. Examples of this interaction is illustrated in studies of using trialogues, which assess interaction between one human student, one virtual student, and one virtual tutor (Lehman et al. 2013; Cai et al. 2014). Another example of a team proxy is the cooperative interaction described in Betty’s Brain, where the student is responsible to teach the agent (Leelawong and Biswas 2008; Biswas et al. 2016). These agent-based studies offer insight to the tutor’s perception-action coupling and highlight the need for the agent-based team tutor to be cognizant of the changing conditions of the environment and each of the individual learners on the team in order to sufficiently model the team, their interactions, and appropriate interactions by the tutor.

Finally, in our review of teams and teamwork in the AIED community, we examine the application of ITSs for a domain-specific team task. The Advanced Embedded Training System (AETS; Zachary et al. 1999) applied ITS tools and methods to the task of improving tactical training quality and reducing the manpower needed for shipboard team training in the US Navy. AETS provided layers of performance assessment, cognitive diagnosis, and team-training support on top of an existing embedded mission simulation in the Navy’s Aegis-class ships. Detailed cognitive models of the trainee’s task performance were used to drive the assessment, diagnosis and instructional functions of AETS. The embedded training approach allowed tutoring functions to be integrated with training simulations implanted in the ship’s equipment. This approach blurred the lines between training and work environments and while this approach was revolutionary and was expected to be leveraged across domains, this concept was not broadly applied and for whatever reason did not generalize to other training tasks. Regardless of its generalizability, this approach provided valuable lessons in how GIFT might be adapted to support team tutoring.

Drawing from the Sottilare et al. (2011) team tutoring model and others that tried to synthesize teamwork and team training in a qualitative way (e.g., Campion et al. 1993; Cannon-Bowers and Bowers 2011; Dyer 1984; Klein et al. 2009; Salas et al. 1992; Smith-Jentsch et al. 1998; Smith-Jentsch et al. 2008), we extracted key variables to scrutinize in a quantitative manner. Furthermore, since the relationship between these variables is complex, considering the different features and individual characteristics, intervention design, and environmental variables involved in team training, we recognize the team dynamism, and thus consider the ontologies for team outcomes in parallel to one another for clarity.

An Effective Team Architecture for ITSs

To develop a team ITS, additional work is required to identify a comprehensive design architecture, one delineating specific team model components, behavioral markers and associated measurement methods. This design architecture must be rooted in principles of intelligent tutoring, but also be based on the science of team performance and team training. Without a design architecture that contains concrete behavioral markers and assessment methods, it is difficult for a trainer or instructional designer to know how best to leverage team research to support collective training. In this article, our contribution is the discovery of significant antecedents to team performance and learning, and the identification of behavioral markers. Assessment methods will be evaluated in the future once these findings have been incorporated into GIFT.

In their summary of the literature, Dorsey et al. (2009) cited some of the main research challenges to be addressed to develop effective team ITSs were measuring team performance, improving team performance, and studying team formation and development. In light of this, we began a research initiative to develop an empirically-based ontology of the core attitudes, cognitions, and behaviors that influence team outcomes. These findings would then enable us to prioritize the most important factors influencing team outcomes that could then be instantiated in GIFT and validated in future GIFT-based tutors. This article describes the process and results of a quantitative synthesis of the existing science literature to inform the refinement of team models for performance and learning, and a process for applying these findings to ITS development. The results for satisfaction and viability outcomes will be published at a later date.

Methodology

Four meta-analytic structural equation modeling (MASEM) procedures were conducted to assess relationships of attitudes, cognitions and behaviors to team performance, learning, satisfaction, and viability. More importantly, these analyses contribute to literature on teamwork by providing an overarching model for each team outcome. This serves as a theoretical and practical understanding of teamwork within complex, diverse contexts by providing a more accurate nomological network, identifying gaps within the literature and practical insight for highlighting the simultaneous importance of team constructs, and opportunities for future research.

Literature Search

To identify primary studies for inclusion in the meta-analyses, a search was conducted using the American Psychological Association’s PsychINFO (2003-July 2013), Defense Technical Information Center, and ProQuest for combinations and variations of the following keywords: performance/ competency/ trust/ cognition/ affect/ communication/ intelligent tutoring/ human-computer interaction/ virtual human/ mood/ emotion/ skill/ knowledge/ ability/ responsibilities/ roles/ distributed/ virtual/ after action review/ feedback/ leadership/ cohesion/ personality/ effectiveness; paired with either team/ unit/ group/squad/crew.

Furthermore, the following were used as secondary search terms: progress/ goals/ experience/ perceptions/ engagement/ boredom/ confusion/ frustration/ situational awareness/ training/ coordination/ collaboration/ motivation/ cohesion/ learning/ leadership/ training/ building monitoring/ goal setting/ instructional strategies/ debriefing/ decision making/ event-based training/ mental models (team, shared)/ processes/ shared cognition/ simulation based training/ development/ transactive memory systems/ backup behavior/ planning/ coordination/ action/ transition.

Additionally, snowball, back-tracing approaches, and additional searches that included “team and learning”/ “teams and satisfaction”/ “teams and viability”/ “teams and performance” were used to supplement our searches.

In searching for primary studies, the search was bounded to include only those articles published/written during the 2003–2013 timeframe. This was done not only to make the model meaningful to current organizations (given the degree to which the nature of work has changed over the past 10 years), but also to complement and extend a number of meta-analyses which were published during the early 2000s (e.g., leadership by Burke et al. 2006; cohesion by Beal et al. 2003; team conflict by De Dreu and Weingart 2003). Our search yielded 5991 unique articles.

Inclusion Criteria

To be coded and included in analyses, articles needed to meet the following requirements. First, the study had to contain enough information to calculate a correlation between team variables to be included in the analysis. Second, the focus of the article had to be on teams whose members were interdependent. Third, due to a desire to focus on team performance within small teams, studies of teams that exceeded 9 people were not included. Finally, top management teams were excluded due to their unique nature and dynamics. Shown in Fig. 1, the filtering method of inclusion/exclusion criteria resulted in a final meta-analytic database of approximately 300 primary studies, with 296 on team performance, 41 on team satisfaction, 18 on team viability, and 11 on team learning. This resulted in over 10,000 effect sizes prior to composites being created.

Coding Procedure

Studies that passed the inclusion/exclusion criteria were coded on several categories, including sample characteristics, reliability of measures, and effect sizes. A codebook was developed that provided detail on all of the components of the coding scheme to facilitate the quantitative coding process. Prior to beginning the actual coding each coder attended a team agreement meeting to ensure that the first 50 articles coded were consistent across coders in an effort to maintain inter-coder reliability. Each coder also received effect size and composite calculation training. Subsequently, pairs of coders were assigned articles whereby they came to consensus on which articles were deemed to be “codeable” (based on the boundary conditions specified earlier). Next, each article was coded by each individual in the pair and any discrepancies were resolved through a consensus meeting. To facilitate coding towards the end of dataset each pair of raters came to consensus on “codeability,” but then split those articles in half so that each individual coded one half of the identified articles.

Analysis Methods

For the quantitative analysis, we followed the Hunter and Schmidt (2004) guidelines for a random-effects meta-analysis of correlations. When multiple effect sizes were presented within a single sample, composites were created (Nunnally 1978), and if the information required to calculate a composite was not available, the mean of the effect sizes was used. In cases where a composite or average was calculated, the reported reliability estimates were used in the Spearman-Brown formula (Li et al. 1996) in order to calculate the reliability of the composite or average. The calculation of the composite correlations and all analyses were performed using SAS Enterprise Guide 6.1 and SAS macros (Davis 2007) that executed original syntax as well as syntax modified from Arthur et al. (2001).

Our results included a sample-weighted mean point estimate of the study correlations (r) as well as the corresponding 95% confidence interval (which expresses the amount of error in r that is due to sampling error and is used for statistical significance testing). We also include the number of independent samples (k) and cumulative sample size (N) included in the calculation of the correlation (r _c) after correcting for unreliability in the predictor and criterion. Corrections for unreliability were performed using only the reliabilities reported in each article—no data was imputed.

In order to test the nomological network of team constructs, and therefore estimate their relative importance of these features for predicting team outcomes, a MASEM approach was applied. MASEM is a two-stage approach designed to test structural paths, providing a robust, theoretically driven quantitative review. We followed the recommendations from Viswesvaran and Ones (1995) and tested our model using LISREL 9 (Jöreskog and Sörbom 2004). First, we input the meta-analytic results into a correlation matrix.

Once we compiled all meta-analytic corrected coefficients, the harmonic mean was calculated to equate sample size varied across cells (see Table 2 for corrected coefficients). We draw from fit indices, such root- mean-square error of approximation (RMSEA), comparative fit index (CFI), non-normed fit index (NNFI or TLI), and reported chi-squared index (χ²) with the caution that it is highly sample dependent to look for evidence whether the proposed model is adequate.

Due to widely recognized concerns about the restrictiveness of the χ² statistic, as well as its sensitivity to sample size (Jöreskog 1969; Quintana and Maxwell 1999), less sensitive and more reliable indices in assessing the reasonableness of the fit of the proposed models were also used in this study, including Tucker-Lewis Index (TLI), Comparative Fit Index (CFI), Root Mean Square Error of Approximation (RMSEA), and Standardized Root Mean Square Residual (SRMR; Ponterotto et al. 2003). The threshold values indicating good model fit are > .95 for CFI, < .06 (N ≥ 250) for RMSEA, with < .08 as an upper limit (Hu and Bentler 1995), < .1 as an upper limit for SRMR, with < .08 suggesting excellent fit, and > .90 for TLI (Vandenberg and Lance 2000; Byrne 2001).

MASEM Results

Table 1 indicates the MASEM results for the primary team outcomes and all show adequate fit.

Table 1 Meta-Analytic Structural Equation Modeling (MASEM) Results

Designing Adaptive Instruction for Teams: a Meta-Analysis

Abstract

Similar content being viewed by others

Shared Mental Models in Support of Adaptive Instruction for Teams Using the GIFT Tutoring Architecture

Automating Team Competency Assessment in Support of Adaptive Dynamic Simulations

Adaptation and Pedagogy at the Collective Level: Recommendations for Adaptive Instructional Systems

Explore related subjects

Introduction

Teamwork Literature and Relevance to AIED and CSCL Research

An Effective Team Architecture for ITSs

Methodology

Literature Search

Inclusion Criteria

Coding Procedure

Analysis Methods

MASEM Results

Team Performance

Team Learning

Behavioral Markers

Why Use Behavioral Markers?

Deriving Behavioral Markers

Trust

Collective Efficacy

Cohesion

Communication

Conflict & Conflict Management

Application of Findings to GIFT

Applying the MASEM Analysis to the GIFT Architecture

Discussion

Recommendations and Future Research

Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation