Introduction

It is increasingly recognised that mental health is more than simply the presence or absence of symptoms. Researchers in the positive psychology tradition have long argued for a move away from traditional disease-based accounts (Seligman & Csikszentmihalyi, 2000), placing greater emphasis on subjective well-being, which refers to life satisfaction and overall levels of positive affect (Diener, 1984). Consideration of subjective well-being alongside symptoms helps to develop a more complete understanding of mental health (Antaramian, Huebner, Hills, & Valois, 2010; Keyes, 2013). Furthermore, it allows the investigation of those who are yet to show symptoms and those that are flourishing. This is particularly important when investigating mental health in children in the general population (Huebner & Hills, 2011). With the recent focus on preventing mental health problems in schoolchildren in the United Kingdom (UK; Department of Health and Department for Education, 2015, 2017) and more widely (World Health Organisation, 2012) a more complete and nuanced understanding of mental health in middle childhood is required.

The Dual-Factor Model of Mental Health

The dual-factor model suggests that subjective well-being and symptoms of distress form distinct aspects of mental health (Greenspoon & Saklofske, 2001). Therefore, at least four mental health ‘types’ are possible: (1) low symptoms and high well-being (i.e. ‘complete mental health’), (2) low symptoms and low well-being (i.e. ‘vulnerable’), (3) high symptoms and high well-being (i.e. ‘symptomatic but content’), and (4) high symptoms and low well-being (i.e. ‘troubled’) (Suldo & Shaffer, 2008; Suldo, Thalji, & Ferron, 2011; Suldo, Thalji-Raitano, Kiefer, & Ferron, 2016; Thalji, 2012). If mental health symptoms are investigated alone, both the ‘vulnerable’ and ‘symptomatic but content’ groups cannot be identified. Furthermore, membership of the different mental health groups has been shown to predict developmental outcomes in young people. Those who are ‘symptomatic but content’ show better academic outcomes, have higher perceived social support, and better physical health, compared to ‘troubled’ youths, even though both groups show symptoms of distress. In addition, those that are ‘vulnerable’ have poorer social relationships and worse academic outcomes compared to those with ‘complete mental health’, despite both having few symptoms (Suldo et al., 2011, 2016). This suggests that mental distress and subjective well-being are distinct continuums, and that the combination of distress and subjective well-being is important for young people’s development.

Traditional Methods of Studying the Dual-Factor Model

Studies investigating the dual-factor model of mental health have typically identified subgroups by using clinical or sample-based cut-offs to dichotomise mental distress and well-being scores as high or low, and then cross-tabulating those to form the four groups noted above (e.g. Antaramian et al., 2010; Lyons, Huebner, Hills, & Shinkareva, 2012; Suldo & Shaffer, 2008). That method can indicate a dissociation between mental distress and well-being, but it has significant limitations. First, the use of imposed cut-off points means there may be little differentiation between groups: individuals with similar scores that fall either side of the cut-off would be placed in different groups. Further, there could be substantial variability within groups: a large range of scores may be included within a particular group (Baillargeon, Tremblay, & Willms, 1999; Dowdy & Kamphaus, 2007; Kim, Dowdy, Furlong, & You, 2017). That is particularly problematic when looking at relations between group membership and other factors: misclassification can lead to attenuated or spurious associations with other variables (MacCallum, Zhang, Preacher, & Rucker, 2002). Second, traditional dual-factor studies start from the assumption that there are four key groups of interest (e.g. Suldo & Shaffer, 2008), which precludes the idea that symptom domain (internalising or externalising) could be important in distinguishing individuals, or that intermediary levels of difficulties or strengths may be pertinent. Indeed, it has recently been argued that a greater range of mental health and well-being subtypes exist (Dowdy et al., 2015; Kim et al., 2017), suggesting that a less deductive approach is needed.

Applying Latent Class Analysis to Investigate the Dual-Factor Model

An alternative, inductive method for identifying mental health subtypes is latent class analysis (LCA). LCA is a data-driven statistical method which can be used to categorise individuals with similar patterns of behaviour (referred to as latent classes; Collins & Lanza, 2010). Classes are derived so that each is internally homogenous and externally heterogeneous, overcoming some of the aforementioned difficulties with differentiation and misclassification.Footnote 1 To date, those researching child mental health have typically used LCA to analyse patterns of symptoms, rather than taking a more comprehensive view that incorporates subjective well-being (Petersen, Qualter, & Humphrey, 2019). Yet, in the adolescent mental health literature, a small number of studies have done so, yielding important findings. For example, Rose, Lindsey, Xiao, Finigan-Carr, and Joe (2017) identified four classes analogous to those found in traditional dual-factor studies in youth aged 13–17 years, and Moore, Dowdy, Nylund-Gibson, and Furlong (2019a, b) identified ‘complete mental health’, ‘moderately mentally healthy’, ‘symptomatic but content’, and ‘troubled’ classes, in adolescents in United States (US) grades 9–12, suggesting that common mental health types in adolescents align with categories predicted by the dual-factor model of mental health.

The Dual-Factor Model in Middle Childhood

To date, dual-factor studies that use latent class analysis have only investigated adolescent and adult mental health. In addition, traditional dual-factor studies have tended to focus on adolescent or adult populations (e.g. Antaramian et al., 2010; Lyons et al., 2012). There are a few exceptions, with evidence for the dual-factor model in children in school grades 3–6 in Canada (M age = 10.5 years, SD = 0.7; Greenspoon & Saklofske, 2001) and with children in early elementary school (US kindergarten to 2nd grade; Compton, 2016). More detailed investigation of the dual-factor model of mental health in middle childhood is necessary because mental health difficulties often emerge before adolescence, and elementary schools are increasingly seen as key institutions for promoting good mental health and preventing mental illness (Department of Health and Department for Education, 2017). The dual-factor model of mental health has important implications for addressing mental health in schools. By focussing on subjective well-being, as well as symptoms, children that may be vulnerable to mental health problems (i.e. low symptoms and low subjective well-being) can be identified in addition to those that are already symptomatic (Suldo & Shaffer, 2008). Therefore, schools would be able to identify those in need of targeted mental health treatment and those that require a less intensive approach to prevent mental health problems developing. That would enable schools to allocate scarce resources efficiently and adopt proactive, as well as reactive, strategies for improving pupils’ mental health. Therefore, research is needed to investigate whether mental health classes, similar to those predicted by the dual factor model of mental health, are found in middle childhood, how prevalent they are, and what factors influence membership of those mental health classes.

The Present Study

The present study aimed to address this gap in the literature by using LCA to investigate mental health classes in schoolchildren aged 8–9 years. A range of emotional difficulties, conduct problems, and subjective well-being items were included as mental health indicators so that a variety of mental health classes were possible, including those differentiated by type and level of distress or well-being.

In addition, the current study investigated whether the mental health classes were reliably found in different datasets and across genders. Reliability was explored by determining whether the same classes were replicated in different subsamples. Classes were expected to replicate in different datasets of similar children; however, the extent to which mental health classes would be replicated across genders could not be predicted based on previous literature. Studies that previously explored structural differences in mental health classes for males and females only investigated mental health symptoms and found mixed results, with some suggesting that classes were different (Althoff et al., 2006; Vendlinski et al., 2014), and others suggesting that they were equivalent (Baillargeon et al., 1999; Hudziak, Wadsworth, Heath, & Achenbach, 1999; Wadsworth, Hudziak, Heath, & Achenbach, 2001). Structural differences can affect the interpretation of results (Collins & Lanza, 2010), so we investigated whether males and females had structurally different mental health classes, prior to investigating whether the prevalence of males and females in each class differed.

We also investigated the external validity of identified mental health classes by exploring whether they had distinct and expected relations with other factors. Family poverty, neighbourhood disadvantage, and special educational needs and disabilities (SEND) status have been linked to poor mental health and well-being (e.g. Pryor, Strandberg-Larsen, Nybo Andersen, Hulvej Rod, & Melchior, 2019; Vedi & Bernard, 2012). Therefore, we predicted that there would be stronger relations between those factors and classes indicating poor mental health. Conversely, high levels of school connectedness and social support are associated with lower levels of mental health symptoms and higher well-being (Monahan, Oesterle, & Hawkins, 2010; Patalay & Fitzsimons, 2016), so we predicted those variables would be associated with classes indicating good mental health. Furthermore, research suggests that prior academic attainment predicts later mental health (Deighton et al., 2018; Panayiotou & Humphrey, 2018), and so positive relations were predicted between prior academic attainment and membership of good mental health classes.

Method

Participants and Procedure

This study was a secondary analysis of data collected for two randomised control trials (RCTs) of universal school-based interventions in the UK. Further information on dataset 1 (D1) can be found in Humphrey et al. (2018a) and further information on dataset 2 (D2) can be found in Humphrey et al. (2018b). Only data from children in the control arm of each trial were used because the interventions’ logic models predicted changes in mental health. In addition, only data collected from children aged 8–9 years were analysed so that participants were comparable in age. Sample characteristics are presented in Table 1.

Table 1 Sample characteristics

Measures

Mental Health Indicators

Mental health difficulties were assessed using individual items from the emotional symptoms and conduct problems subscales of the English (UK) version of Strengths and Difficulties Questionnaire (SDQ) teacher-report version (Goodman, 1997). The overall scale has been found to have strong psychometric properties with 4–12 year olds (Stone, Otten, Engels, Vermulst, & Janssens, 2010). The conduct problems items were (1) Often has temper tantrums or hot tempers, (2) Generally obedient, usually does what adults request (reverse scored), (3) Often fights with other children or bullies them, and (4) Often lies or cheats’.Footnote 2 The emotional symptoms items were (1) Often complains of headaches, stomach-aches or sickness, (2) Many worries, often seems worried, (3) Often unhappy, down-hearted or tearful, (4) Nervous or clingy in new situations, easily loses confidence and 5) Many fears, easily scared.

The SDQ asks teachers to rate whether the statements are ‘not true’, ‘somewhat true’, or ‘certainly true’. For the LCA, responses were dichotomised so that 0 = ‘not true’ and 1 = ‘somewhat’ or ‘certainly true’. Although some information about symptom severity was lost, dichotomisation reduced model complexity, which was necessary for the model to be able to run and converge on a final solution in Mplus. Similar studies have followed this procedure (e.g. Wadsworth et al., 2001).

Subjective well-being was measured using the positively worded items from psychological well-being scale of the self-report version of the KIDSCREEN-27, which has been validated with this age group (Ravens-Sieberer et al., 2007).Footnote 3 The items included in the LCA were ‘In the last week…’ (1) Has your life been enjoyable? (2) Have you been in a good mood?, (3) Have you had fun?, and (4) Have you been happy with the way you are?. Children responded on a 5-point Likert scale from, 1 = ‘not at all’ or ‘never’, to 5 = ‘extremely’ or ‘always. Treating responses as categorical resulted in a complex model, leading to model estimation difficulties. Unlike the SDQ items, the distribution of responses did not indicate a clear point for dichotomisation and there was no precedent for dichotomising those scores. Further, the items were rated on a 5-point Likert scale, which, arguably, can be treated as continuous data (Rhemtulla, Brosseau-Liard & Savalei, 2012). Therefore, those items were treated as continuous.

Covariates

Several covariates were included as predictors of latent class membership.

Sociodemographic information was obtained through the UK National Pupil Database, (see, Jay, Mc Grath-Lone, & Gilbert, 2019). Sociodemographic data were: gender, eligibility for free school meals (FSM), SEND status, and Income Deprivation Affecting Children Index (IDACI) score. The IDACI measured relative levels of deprivation in small areas of England called Lower-layer Super Output Areas (LSOAs). Scores range from 0 to 1 which can be interpreted as the proportion of families in an LSOA with children aged under 16 years, who are income deprived.

Peer support and school connectedness were assessed using ‘social and peer support’ and ‘school environment’ self-report subscales from the KIDSCREEN-27 (Ravens-Sieberer et al., 2007). Both subscales have four items and responses on a 5-point Likert scale. Total scores ranged from 4 to 20, with high scores indicating greater perceived social support or school connectedness. Both subscales have good internal reliability (Cronbach’s alpha for both scales = 0.81; Robitail et al. 2007), criterion validity (correlations with scales from the KIDSCREEN-52 item questionnaire of r = 0.96), and good test–retest reliability over a 2-week period (ICC = 0.61 and 0.74, respectively; Ravens-Sieberer et al., 2007).

Prior academic attainment was measured through Key Stage 1 national assessment scores in Maths and English, obtained from the UK National Pupil Database. These are statutory teacher assessments of children aged 6–7 years. Scores are based on government issued assessment frameworks (Standards and Testing Agency, 2018). Scores range from 3 to 30.

Data Analysis

Analyses were conducted in Mplus v8.2, using maximum likelihood estimation and an expectation–maximisation (EM) algorithm (Muthén & Muthén, 1998–2017). The complex structure of the data, due to clustering by school, was accounted for using a sandwich estimator.

The number of mental health classes in the population was previously unknown so exploratory LCA with both binary and continuous indicators was conducted: we fit an unrestricted latent class model with 1-class, and then with an increasing number of classes, until convergence issues were encountered. To prevent convergence on local maxima, the analyses were run with 1000 start values with 200 optimisations and 50 iterations.

For each model, Akaike Information Criteria (AIC), Bayesian Information Criteria (BIC), and sample size adjusted Bayesian Information Criteria (ssaBIC) were recorded. Lower values on information criteria indicated better model fit (Hagenaars & McCutcheon, 2002). The Lo–Mendell–Rubin Likelihood Ratio Test (LMR-LRT) statistic was also recorded. Significant LMR-LRT results indicated that the model with K-classes was a significant improvement over the model with K-1 classes (Masyn, 2013). Monte Carlo simulations have indicated that when a mixture of categorical and continuous variables is included in LCA, those fit indices perform well. ssaBIC and BIC generally performed best with large samples (> 1200), although, when rare classes are identified, AIC performed better (Morgan, 2015).

Classification accuracy for each model was recorded. This was not used for class enumeration purposes because it is not a fit statistic; instead, it provided information about the quality of classification provided by each model (Masyn, 2013). Entropy values and average latent class probabilities close to 1 indicated precise classification; values above 0.8 were considered good (Clark & Muthén, 2009).

Fit statistics were used to reduce the total number of models to a smaller number of candidate models. Thereafter, a number of substantive criteria were considered. First, item response and associated plots were examined to assess whether classes were homogeneous, (i.e. whether members of the same class provided the similar responses). They were also examined to assess whether there was good class separation (i.e. each class was distinct from the others). Item response probabilities close to 0 or 1 indicated good class homogeneity, whereas good latent class separation was indicated if there was a unique and characteristic response pattern associated with each class (Collins & Lanza, 2010). Second, the interpretability of classes was also assessed; specifically, whether we could interpret classes in line with theory and research. Third, models that identified classes containing less than 5% of the sample were disregarded because small classes can be unstable (Masyn, 2013) and would have too few members to enable later latent class regression analyses. Parsimony was a final consideration.

Missing data on the LCA indicators were accounted for by Full Information Maximum Likelihood (FIML; Enders, 2010).

Replicating the Latent Classes Across Datasets and Genders

Initially, the overall dataset was split into four subsamples, by gender and dataset: D1 males (n = 727), D1 females (n = 604), D2 males (n = 1061) and D2 females (n = 948).Footnote 4 Separate latent class analyses were conducted for each subgroup to determine whether the same number and type of classes were identified. Multiple-group LCA (Collins & Lanza, 2010) was used to test whether the classes were structurally equivalent in each subgroup. BIC, AIC and Chi square difference test results were recorded for the following models: (1) an unrestricted model, (2) a model assuming structural equivalence, and (3) a model assuming structural equivalence and equivalence of class prevalences. Smaller fit indices for the restricted model and non-significant Chi square difference tests indicated model equivalence.

Combining Datasets

Combining data from the subsamples of data confers the advantage of having a larger sample size, which increases statistical power. That is particularly advantageous in the final step of the analysis, which incorporates covariates into the model. Participants in each subsample were from the same age group and from a similar geographic area, indicating suitability for combining data. However, because subsamples had some different characteristics (see online material 1), we only considered it appropriate to combine data from the different subsamples when the structural equivalence of mental health classes was indicated. If structural equivalence of classes could not be assumed, data would not be combined and separate regression analyses would be carried out for each subgroup. However, if structural equivalence of classes could be assumed across subsamples (either gender, dataset, or both), data would be combined to increase the sample size for the latent class regression analysis. In this case, the class enumeration process was repeated with the combined dataset, confirming the number of classes in the best fitting model, since sample size can influence this process (Nylund, Asparouhov, & Muthén, 2007).

Adding Predictor Variables

Predictor variables were incorporated into the final latent class model using the 3-step approach (Asparouhov & Muthén, 2014), which preserved the classes identified in the class enumeration stage of the analysis, while accounting for measurement error. The predictors were then regressed onto the classes. The 3-step method is the most appropriate for examining the relations between classes and covariates (Asparouhov & Muthén, 2018). Cases with missing data on covariates were deleted list-wise by default in Mplus.

Results

Multiple-Group Latent Class Analyses

Separate analyses for each subsample, based on gender and dataset, indicated a 3- or 4-class model best fit the data (see Online Resource 2). Examination of responses indicated that both the 3- and 4-class models were structurally comparable across subsamples (Online Resource 3). Further, it was our opinion that any difference in responses between subgroups did not alter the substantive meaning of the class.

In the multiple-group analysis, BIC indicated that restricted models showed better model fit than the unrestricted model, suggesting classes were structurally equivalent across samples. Conversely, AIC indicated that the unrestricted model fit the data best. The discrepancy likely reflects that AIC does not penalise model complexity as heavily as BIC. The Chi square difference test indicated that class structures were different across subgroups (Online Resource 4). Despite not passing the very strictest tests of class equivalence between groups, we considered there to be enough evidence to assume equivalence of the 3- and 4-class models across groups. As such, the data were pooled for the next stage of analysis.

Final Latent Class Model Using Combined Data

The LCA class enumeration process was repeated for the final combined dataset. The information criteria and LMR-LRT indicated that the 4-class model best fit the data. Entropy was good (0.82) and classification for each class was accurate (average class probabilities for most likely class were all over 0.8; see Table 2 and Fig. 1). In addition, class separation was good: each mental health class had clearly distinguishable characteristics (see Fig. 2). Each class was interpretable in line with existing mental health frameworks, specifically the dual-factor model (Greenspoon & Saklofske, 2001) and dimensional models of internalising and externalising symptoms (Achenbach & Edelbrock, 1978). No class contained less than 5% of the sample. The final four classes were as follows: (1) complete mental health (n = 1895, 57%), characterised by very low probability of symptom endorsement and high subjective well-being scores; (2) vulnerable (n = 434, 13%), characterised by slightly elevated probability of endorsing mental health symptoms and below average subjective well-being; (3) conduct problems but content (n = 404, 12%), characterised by high probability of endorsing conduct problems items, moderate probability of endorsing emotional symptoms, but high subjective well-being; and (4) emotional symptoms but content (n = 606, 18%), characterised by low probability of endorsing conduct problems, high probability of endorsing emotional symptoms, and high subjective well-being (see Table 3).

Table 2 Fit statistics, classification indices, and class sizes for each model
Fig. 1
figure 1

Akaike Information Criteria (AIC), Bayesian Information Criteria (BIC) and sample size adjusted Bayesian Information Criteria (ssaBIC) values for each class model. Lower values indicate better model fit

Fig. 2
figure 2

Plot of a item response probabilities for endorsing conduct problems and emotional symptoms, and b mean scores for subjective well-being items, for each class

Table 3 Final 4-class model, average responses per class

Testing the External Validity of Classes in the Final Model

The LCA covariate analysis tested the external validity of the final class solution by investigating the relations between classes and other factors that influence mental health. 351 (10.5%) cases with missing data on one or more of the covariates were deleted list-wise, leaving 2,989 children in the latent class regression analysis. Results showed that classes were distinct in their relation with predictor variables (see Table 4).

Table 4 Odds ratios and 95% confidence intervals showing the relationship between class and predictor variables

Compared to the complete mental health class, those in the emotional symptoms but content class were significantly more likely to be female, to have SEND, to live in more affluent neighbourhoods (indicated by lower IDACI scores), and to have lower prior attainment (maths scores). Those in the conduct problems but content class were significantly more likely to be male, have SEND, and report lower levels of school connectedness. Those in the vulnerable class were significantly more likely to experience familial poverty (indicated by higher uptake of FSM) and report lower levels of both social support and school connectedness.

Comparison of the two symptomatic but content groups indicated that those with high conduct problems were significantly more likely to be male and have lower levels of school connectedness. The vulnerable group differs from both symptomatic but content groups: they reported lower levels of social support and school connectedness, and were less likely to have SEND than those in the conduct problems but content group.

Discussion

This study used LCA to identify common patterns of mental health symptoms and subjective well-being in schoolchildren aged 8–9 years. The analysis indicated that children could be categorised into four common classes, defined by a unique pattern of symptoms and subjective well-being. The identified latent classes were (1) complete mental health (57% of sample), (2) vulnerable (13%), (3) conduct problems but content (12%), and (4) emotional symptoms but content (18%). Those findings provide support for the dual-factor model of mental health, suggesting that symptoms of distress and subjective well-being are distinct constructs rather than opposite ends of a single mental health continuum (Greenspoon & Saklofske, 2001). Children in the vulnerable class, and both symptomatic but content classes, showed patterns of symptoms and subjective well-being that would not be possible on such a continuum. The classes were reliably found across datasets and gender. In addition, latent class regression analysis indicated that classes were distinct and externally valid, as evidenced by their differential associations with sociodemographic and other factors.

It is notable that LCA, a data-driven, inductive approach, produced three of the classes logically indicated by the dual-factor model: complete mental health, vulnerable, and symptomatic but content (Antaramian et al., 2010; Suldo et al., 2011, 2016; Thalji, 2012). Further, the large complete mental health class and smaller vulnerable class were comparable in size to that found in other dual-factor studies (e.g. Antaramian et al., 2010; Compton, 2016; Kelly, Hills, Huebner, & McQuillin, 2012; Rose et al., 2017; Suldo & Shaffer, 2008; Suldo et al., 2011; Thalji, 2012).

A key difference between our findings and that of other dual-factor studies is that we identified two symptomatic but content groups rather than one, distinguished by the symptom domain. Further, the classes were found to have different sociodemographic and psychosocial predictors (i.e. gender and school connectedness), which attests to their distinctiveness and, perhaps, different aetiologies. Other research has suggested that children experience qualitative differences in mental health symptoms (Basten et al., 2013, 2016; McElroy, Shevlin, & Murphy, 2017), yet many traditional dual-factor studies have not differentiated symptoms (e.g. Suldo & Shaffer, 2008). Two studies that investigated internalising and externalising symptoms separately, along with levels of well-being (co-vitality) in adolescents (Kim et al., 2017; Rebelez Ernst, 2016), found that groups showed distinct relations with covariates, supporting our findings that symptom domain is an important component of mental health types.

Recent latent class, dual-factor research with adolescents identified a single symptomatic but content class, as opposed to the two identified presently (Moore et al., 2019a, b; Rose et al., 2017). Other LCA studies may not have identified two ‘symptomatic but content classes’ because they had smaller sample sizes (ranging from c.291-1170 participants), which made it difficult to identify small classes. Indeed, in our initial subsample analyses, information criteria generally favoured the three-class model, which only had one symptomatic but content group. When the larger combined dataset was used, the four-class model with two symptomatic but content classes was clearly indicated. Further research with large samples is therefore needed to investigate whether separate internalising and externalising symptomatic but content classes are replicated in older age groups.

The final subgroup predicted by the dual-factor model is the ‘troubled’ group, characterised by high symptoms and low subjective well-being. Previous studies have identified this group in adolescence (e.g. Antaramian et al., 2010; Moore et al., 2019a, b; Suldo & Shaffer, 2008) and childhood (Compton, 2016; Greenspoon & Saklofske, 2001). However, such a group was not identified through our analysis. Developmental research indicates that subjective well-being tends to remain high during the elementary school years and then declines around 10 years old (Casas & Gonzalez-Carrasco, 2019). Further, emotional symptoms and conduct problems are relatively uncommon in childhood and increase with age (Polanczyk, Salum, Sugaya, Caye, & Rohde, 2015). Owing to the low base-rate of children with low subjective well-being, in addition to a low base-rate of children with mental health difficulties, the numbers of children aged 8–9 years with simultaneously high symptoms and low subjective well-being is likely to be low. In our analyses we rejected latent class models which had classes that represented less than 5% of the sample because such groups can be unstable (Masyn, 2013). Therefore, the lack of a troubled group may be attributable to the age of the sample and statistical decisions made during the model selection process.

A strength of our study is that we replicated the LCA in four different subsamples. Mixture modelling approaches, like LCA, have been criticised for producing results that are sample specific (Bauer & Curran, 2004). Exploratory LCA, in particular, may be susceptible to this because it is a data-driven approach. The final four-class model was most consistent across all four samples and was the best fitting model when the data were combined. However, further research is required to determine whether those results are replicated in other samples, including those in other countries.

Carrying out LCA for males and females separately, in the first instance, also allowed for detailed investigation of the mental health classes across genders. Classes were structurally analogous for males and females, which is consistent with research investigating classes of particular mental health symptoms, such as obsessions and compulsions (Althoff, Rettew, Boomsma, & Hudziak, 2009), anxiety and depression (Wadsworth et al., 2001), and aggression (Baillargeon et al., 1999). Although classes were structurally similar, the proportion of males and females in each class differed significantly. In particular, females were significantly more likely to be in the ‘emotional symptoms but content’ class and males in the ‘conduct problems but content’ class. This finding is in line with research that indicates males are more likely to display conduct problems, whereas females are more likely to display emotional symptoms (Keiley, Bates, Dodge, & Pettit, 2000).

Another strength of the study was the examination of class validity. External validity was indicated by classes showing expected relations with other factors. For example, mental health risk factors, such as SEND status, low prior Maths attainment and having low perceived peer support and school connectedness, were associated with less adaptive mental health classes. Associations were consistent with other research in this field (e.g. McLeod & Kaiser, 2004; Monahan et al., 2010; Patalay & Fitzsimons, 2016). The only unexpected result was the association between membership of the ‘emotional symptoms but content’ class and increased neighbourhood affluence. Neighbourhood poverty is generally considered a risk factor for mental health (Boyle et al., 2019). Because this class was associated with being female, having lower prior academic attainment, and being in a more affluent area, it is possible that symptom expression is rooted in a sense of failing to live up to the high expectations of parents and teachers. Future research should explore that further.

Beyond those general expected associations, the study findings provided a more nuanced account of how sociodemographic and psychosocial factors relate to more complex patterns of mental health. For example, family poverty, low social support, and low school connectedness were more strongly associated with membership of the vulnerable class than any other class. Those children have lower than average well-being and burgeoning symptoms, and research suggests they may be at risk of developing more severe symptoms later in life (Wang, Zhang, & Wang, 2011). Although our cross-sectional study cannot be used to infer causality, findings suggest that addressing the negative effects of poverty and helping youth to develop good peer relationships and school connectedness may benefit this vulnerable group. However, further research is necessary to ascertain the reliability of our initial findings.

Limitations

There are several limitations of the current study. First, mental health and psychosocial data were collected concurrently, so causal direction cannot be inferred. In addition, we do not know whether the classes have predictive validity and longitudinal research is needed.

Second, mental health symptoms were teacher-reported. This is in line with previous studies that investigated the dual factor model of mental health in children (i.e. Compton, 2016; Greenspoon & Saklofske, 2001) and reflects the fact that many measures of child mental health are not available for self-report in this age group. However, while teachers may be good reporters of conduct problems, they are less qualified to report on children’s internal states (De Los Reyes, 2015; Papandrea & Winefield, 2011). To test the reliability of our findings when children self-report symptoms, future research should consider an alternative, age-appropriate self-report measure, such as the Me and My School questionnaire (Deighton et al., 2013).

Third, we identified peer support and school connectedness as correlates with class membership. Future research will need to examine other forms of social support, different components of school connectedness, or mediator and moderator effects, to explore those relations more fully.

Fourth, we acknowledge that our assessment of subjective well-being was somewhat limited. Typically, subjective well-being is defined as overall positive affect and life satisfaction (Lucas, 2016). A separate measure of life satisfaction was not available in the datasets used for our analysis. Measurement shortcomings are a limitation of many studies that use secondary data because the specific research questions were not the immediate focus of the research team collecting the data. It is possible that incorporating a more extensive measure of life satisfaction into the analysis would lead to the identification of different mental health classes. Where possible, future research should incorporate a measure of life satisfaction to investigate whether results are altered. A related consideration is that subjective well-being can be argued to include more than the appraisal of affect and life satisfaction, encompassing other aspects, such as social well-being. Such a broad definition of subjective well-being was not the focus of the current study and discussion of such conceptual discrepancies are beyond the scope of this article (see Lima & Morais, 2018; Lucas, 2016, for further discussion on this issue). However, we note that encompassing more aspects of well-being into a similar latent class model may lead to the identification of mental health and well-being classes that are less focussed on symptomology, which may be of substantive interest.

A further limitation regards treatment of items used in the latent class analysis. In order to simplify the latent class model so that it would converge on a final solution, SDQ symptom responses, which were originally on a 3-point scale, were dichotomised and KIDSCREEN well-being items, which were on a 5-point scale, were treated as continuous. Therefore, some information from the original scale was lost. Further, KIDSCREEN items were negatively skewed, with high ceiling effects, which can be problematic when treating ordinal variables as continuous (Bauer & Curran, 2003). Further research will need to test the reliability of these classes with measures that are less prone to skew or using ordinal variables where the analysis allows.

Finally, the generalisability of the results is subject to certain limitations: participants were recruited from schools that volunteered to take part in RCTs of universal interventions and may not be representative of children in the wider UK population. In addition, the sample could be further biased through the handling of missing data. Missing data were handled using the default settings in Mplus because there is no strong consensus on a more effective way to handle missing data on covariates (Heron, Croudace, Barker, & Tilling, 2015). Cases with missing data on predictor variables were deleted list-wise; the remaining sample size was still very large (n = 2989) and well powered, although it is possible list-wise deletion introduced bias.

Implications

Our study demonstrated the importance of simultaneously measuring internalising and externalising symptoms and subjective well-being when investigating children’s mental health. Most child mental health research is variable centred, examining the relation between a single aspect of mental health and other factors. However, it appears that patterns of mental health symptoms and subjective well-being define groups of children. Failing to consider those patterns of mental health could lead to unreliable findings, and the complex interaction between different symptoms and well-being would be lost. Therefore, we recommend that research investigating child mental health incorporates methods, such as latent class analysis, which can consider a range of aspects of mental health, in order to supplement more traditional variable centred approaches.

Similarly, when screening children’s mental health in elementary schools, a full range of symptoms and indicators of subjective well-being should be considered. This would allow schools to identify both symptomatic and vulnerable children, so that interventions can be appropriately targeted. Different groups may require different approaches. For example, those with conduct problems and emotional symptoms may require specific interventions or referrals to other mental health services; those with low symptoms and low well-being may benefit from lower intensity, preventative interventions in school.

Further, elementary schools often implement universal intervention programs to improve the mental health and well-being of their pupils (Patalay et al., 2017). However, despite children receiving the same intervention, different outcomes might be expected for particular subgroups of children (Greenberg & Abenavoli, 2017). For example, those with complete mental health may show less improvement because they are already showing good mental health; symptomatic individuals may only show slight improvement if they require more intensive interventions; and vulnerable individuals may show the greatest improvement over time if the intervention is preventative. Therefore, a suggestion for researchers and educators that evaluate the impact of universal mental health programmes is to investigate the effect of the intervention for each group. That would give a more detailed account of the effect of those interventions and prevent the loss of information which would occur through aggregating scores across the whole sample.

Finally, the findings demonstrate that subjective well-being is a key component of child mental health. This suggests the importance of interventions that focus on improving well-being as well as reducing symptoms. To date, a number of school-based positive psychology interventions have been developed that are associated with increased subjective well-being amongst children (e.g. Quinlan, Swain, Cameron, & Vella-Brodrick, 2015; Suldo et al., 2015; Wingate, Suldo, & Peterson, 2018). Those interventions focus on the building blocks of subjective well-being, such as developing peer and student–teacher relationships, as well as using character strengths-based interventions to evoke positive feelings. Our findings regarding the relational correlates of the identified mental health classes in the current study provide foundational support for such work.