Natural Language Processing and Schizophrenia: A Scoping Review of Uses and Challenges
Abstract
:1. Introduction
2. Materials and Methods
2.1. Search Strategies
2.2. Study Eligibility
2.3. Data Extraction
2.4. Quality Assessment
3. Results
3.1. Description of Identified Studies
3.2. Diagnostic and Predictive Modeling
3.3. Specific Linguistic Phenomena
3.4. Speech and Communication Analysis
3.5. Social Media and Online Content Analysis
3.6. Clinical and Cognitive Assessment
3.7. Linguistic Feature Analysis
4. Discussion
Limitations
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Faden, J.; Citrome, L. Schizophrenia: One Name, Many Different Manifestations. Med. Clin. N. Am. 2023, 107, 61–72. [Google Scholar] [CrossRef] [PubMed]
- Kahn, R.S.; Sommer, I.E.; Murray, R.M.; Meyer-Lindenberg, A.; Weinberger, D.R.; Cannon, T.D.; O’Donovan, M.; Correll, C.U.; Kane, J.M.; van Os, J.; et al. Schizophrenia. Nat. Rev. Dis. Primers 2015, 1, 15067. [Google Scholar] [CrossRef]
- Keefe, R.S.; Harvey, P.D. Cognitive impairment in schizophrenia. In Handbook of Experimental Pharmacology; Springer: Berlin/Heidelberg, Germany, 2012; pp. 11–37. [Google Scholar] [CrossRef]
- Rolland, B.; Jardri, R.; Amad, A.; Thomas, P.; Cottencin, O.; Bordet, R. Pharmacology of hallucinations: Several mechanisms for one single symptom? Biomed. Res. Int. 2014, 2014, 307106. [Google Scholar] [CrossRef] [PubMed]
- Kumar, S.; Soren, S.; Chaudhury, S. Hallucinations: Etiology and clinical implications. Ind. Psychiatry J. 2009, 18, 119–126. [Google Scholar] [CrossRef] [PubMed]
- Kesby, J.P.; Eyles, D.W.; McGrath, J.J.; Scott, J.G. Dopamine, psychosis and schizophrenia: The widening gap between basic and clinical neuroscience. Transl. Psychiatry 2018, 8, 30. [Google Scholar] [CrossRef] [PubMed]
- Meyer, L.; Lakatos, P.; He, Y. Language Dysfunction in Schizophrenia: Assessing Neural Tracking to Characterize the Underlying Disorder(s)? Front. Neurosci. 2021, 15, 640502. [Google Scholar] [CrossRef] [PubMed]
- De Boer, J.N.; Van Hoogdalem, M.; Mandl, R.C.W.; Brummelman, J.; Voppel, A.E.; Begemann, M.J.H.; Van Dellen, E.; Wijnen, F.N.K.; Sommer, I.E.C. Language in schizophrenia: Relation with diagnosis, symptomatology and white matter tracts. NPJ Schizophr. 2020, 6, 10. [Google Scholar] [CrossRef] [PubMed]
- McCutcheon, R.A.; Keefe, R.S.E.; McGuire, P.K. Cognitive impairment in schizophrenia: Aetiology, pathophysiology, and treatment. Mol. Psychiatry 2023, 28, 1902–1918, Erratum in Mol. Psychiatry 2023, 28, 1919. [Google Scholar] [CrossRef]
- Gejman, P.V.; Sanders, A.R.; Duan, J. The role of genetics in the etiology of schizophrenia. Psychiatr. Clin. N. Am. 2010, 33, 35–66. [Google Scholar] [CrossRef]
- McCutcheon, R.A.; Krystal, J.H.; Howes, O.D. Dopamine and glutamate in schizophrenia: Biology, symptoms and treatment. World Psychiatry 2020, 19, 15–33. [Google Scholar] [CrossRef]
- Karlsgodt, K.H.; Sun, D.; Cannon, T.D. Structural and Functional Brain Abnormalities in Schizophrenia. Curr. Dir. Psychol. Sci. 2010, 19, 226–231. [Google Scholar] [CrossRef]
- Guo, X.; Li, J.; Wang, J.; Fan, X.; Hu, M.; Shen, Y.; Chen, H.; Zhao, J. Hippocampal and orbital inferior frontal gray matter volume abnormalities and cognitive deficit in treatment-naive, first-episode patients with schizophrenia. Schizophr. Res. 2014, 152, 339–343. [Google Scholar] [CrossRef] [PubMed]
- Fatemi, S.H.; Folsom, T.D. The neurodevelopmental hypothesis of schizophrenia, revisited. Schizophr. Bull. 2009, 35, 528–548. [Google Scholar] [CrossRef] [PubMed]
- Robinson, N.; Bergen, S.E. Environmental Risk Factors for Schizophrenia and Bipolar Disorder and Their Relationship to Genetic Risk: Current Knowledge and Future Directions. Front. Genet. 2021, 12, 686666. [Google Scholar] [CrossRef] [PubMed]
- Lipner, E.; O’Brien, K.J.; Pike, M.R.; Ered, A.; Ellman, L.M. Environmental Risk Factors and Cognitive Outcomes in Psychosis: Pre-, Perinatal, and Early Life Adversity. Curr. Top. Behav. Neurosci. 2023, 63, 205–240. [Google Scholar] [CrossRef] [PubMed]
- Ehlen, F.; Montag, C.; Leopold, K.; Heinz, A. Linguistic findings in persons with schizophrenia-a review of the current literature. Front. Psychol. 2023, 14, 1287706. [Google Scholar] [CrossRef] [PubMed]
- Compton, M.T.; Ku, B.S.; Covington, M.A.; Metzger, C.; Hogoboom, A. Lexical Diversity and Other Linguistic Measures in Schizophrenia: Associations with Negative Symptoms and Neurocognitive Performance. J. Nerv. Ment. Dis. 2023, 211, 613–620. [Google Scholar] [CrossRef] [PubMed]
- Ojeda, N.; Sánchez, P.; Peña, J.; Elizagárate, E.; Yoller, A.B.; Larumbe, J.; Gutiérrez, M.; Casais, L.; Ezcurra, J. Verbal fluency in schizophrenia: Does cognitive performance reflect the same underlying mechanisms in patients and healthy controls? J. Nerv. Ment. Dis. 2010, 198, 286–291. [Google Scholar] [CrossRef] [PubMed]
- Hinzen, W.; Rosselló, J. The linguistics of schizophrenia: Thought disturbance as language pathology across positive symptoms. Front. Psychol. 2015, 6, 971. [Google Scholar] [CrossRef]
- Parola, A.; Simonsen, A.; Lin, J.M.; Zhou, Y.; Wang, H.; Ubukata, S.; Koelkebeck, K.; Bliksted, V.; Fusaroli, R. Voice Patterns as Markers of Schizophrenia: Building a Cumulative Generalizable Approach Via a Cross-Linguistic and Meta-analysis Based Investigation. Schizophr. Bull. 2023, 49 (Suppl. 2), S125–S141. [Google Scholar] [CrossRef]
- Corcoran, C.M.; Mittal, V.A.; Bearden, C.E.; Gur, R.E.; Hitczenko, K.; Bilgrami, Z.; Savic, A.; Cecchi, G.A.; Wolff, P. Language as a biomarker for psychosis: A natural language processing approach. Schizophr. Res. 2020, 226, 158–166. [Google Scholar] [CrossRef] [PubMed]
- de Filippis, R.; Carbone, E.A.; Gaetano, R.; Bruni, A.; Pugliese, V.; Segura-Garcia, C.; De Fazio, P. Machine learning techniques in a structural and functional MRI diagnostic approach in schizophrenia: A systematic review. Neuropsychiatr. Dis. Treat. 2019, 15, 1605–1627. [Google Scholar] [CrossRef] [PubMed]
- Gashkarimov, V.R.; Sultanova, R.I.; Efremov, I.S.; Asadullin, A.R. Machine learning techniques in diagnostics and prediction of the clinical features of schizophrenia: A narrative review. Consort. Psychiatr. 2023, 4, 43–53. [Google Scholar] [CrossRef] [PubMed]
- Buchlak, Q.D.; Esmaili, N.; Bennett, C.; Farrokhi, F. Natural Language Processing Applications in the Clinical Neurosciences: A Machine Learning Augmented Systematic Review. Acta Neurochir. Suppl. 2022, 134, 277–289. [Google Scholar] [CrossRef] [PubMed]
- Khurana, D.; Koli, A.; Khatter, K.; Singh, S. Natural language processing: State of the art, current trends and challenges. Multimed. Tools Appl. 2023, 82, 3713–3744. [Google Scholar] [CrossRef] [PubMed]
- Crema, C.; Attardi, G.; Sartiano, D.; Redolfi, A. Natural language processing in clinical neuroscience and psychiatry: A review. Front. Psychiatry 2022, 13, 946387. [Google Scholar] [CrossRef] [PubMed]
- Corcoran, C.M.; Carrillo, F.; Fernández-Slezak, D.; Bedi, G.; Klim, C.; Javitt, D.C.; Bearden, C.E.; Cecchi, G.A. Prediction of psychosis across protocols and risk cohorts using automated language analysis. World Psychiatry 2018, 17, 67–75. [Google Scholar] [CrossRef] [PubMed]
- Malgaroli, M.; Hull, T.D.; Zech, J.M.; Althoff, T. Natural language processing for mental health interventions: A systematic review and research framework. Transl. Psychiatry 2023, 13, 309. [Google Scholar] [CrossRef] [PubMed]
- Stang, A. Critical evaluation of the Newcastle-Ottawa scale for the assessment of the quality of nonrandomized studies in meta-analyses. Eur. J. Epidemiol. 2010, 25, 603–605. [Google Scholar] [CrossRef]
- Higgins, J.P.; Altman, D.G.; Gøtzsche, P.C.; Jüni, P.; Moher, D.; Oxman, A.D.; Savović, J.; Schulz, K.F.; Weeks, L.; Sterne, J.A. The Cochrane Collaboration’s tool for assessing risk of bias in randomised trials. BMJ 2011, 343, d5928. [Google Scholar] [CrossRef]
- Ku, B.S.; Pauselli, L.; Covington, M.A.; Compton, M.T. Computational linguistic analysis applied to a semantic fluency task: A replication among first-episode psychosis patients with and without derailment and tangentiality. Psychiatry Res. 2021, 304, 114105. [Google Scholar] [CrossRef]
- Parola, A.; Salvini, R.; Gabbatore, I.; Colle, L.; Berardinelli, L.; Bosco, F.M. Pragmatics, Theory of Mind and executive functions in schizophrenia: Disentangling the puzzle using machine learning. PLoS ONE 2020, 15, e0229603. [Google Scholar] [CrossRef]
- Figueroa-Barra, A.; Del Aguila, D.; Cerda, M.; Gaspar, P.A.; Terissi, L.D.; Durán, M.; Valderrama, C. Automatic language analysis identifies and predicts schizophrenia in first-episode of psychosis. Schizophrenia 2022, 8, 53. [Google Scholar] [CrossRef]
- Voppel, A.E.; de Boer, J.N.; Brederoo, S.G.; Schnack, H.G.; Sommer, I.E.C. Semantic and Acoustic Markers in Schizophrenia-Spectrum Disorders: A Combinatory Machine Learning Approach. Schizophr. Bull. 2023, 49 (Suppl. 2), S163–S171. [Google Scholar] [CrossRef]
- Arevian, A.C.; Bone, D.; Malandrakis, N.; Martinez, V.R.; Wells, K.B.; Miklowitz, D.J.; Narayanan, S. Clinical state tracking in serious mental illness through computational analysis of speech. PLoS ONE 2020, 15, e0225695. [Google Scholar] [CrossRef]
- Bedi, G.; Carrillo, F.; Cecchi, G.A.; Slezak, D.F.; Sigman, M.; Mota, N.B.; Ribeiro, S.; Javitt, D.C.; Copelli, M.; Corcoran, C.M. Automated analysis of free speech predicts psychosis onset in high-risk youths. NPJ Schizophr. 2015, 1, 15030. [Google Scholar] [CrossRef]
- Jeong, L.; Lee, M.; Eyre, B.; Balagopalan, A.; Rudzicz, F.; Gabilondo, C. Exploring the Use of Natural Language Processing for Objective Assessment of Disorganized Speech in Schizophrenia. Psychiatr. Res. Clin. Pract. 2023, 5, 84–92. [Google Scholar] [CrossRef]
- Rezaii, N.; Walker, E.; Wolff, P. A machine learning approach to predicting psychosis using semantic density and latent content analysis. NPJ Schizophr. 2019, 5, 9. [Google Scholar] [CrossRef]
- Chan, C.C.; Norel, R.; Agurto, C.; Lysaker, P.H.; Myers, E.J.; Hazlett, E.A.; Corcoran, C.M.; Minor, K.S.; Cecchi, G.A. Emergence of Language Related to Self-experience and Agency in Autobiographical Narratives of Individuals with Schizophrenia. Schizophr. Bull. 2023, 49, 444–453. [Google Scholar] [CrossRef]
- Lejeune, A.; Robaglia, B.M.; Walter, M.; Berrouiguet, S.; Lemey, C. Use of Social Media Data to Diagnose and Monitor Psychotic Disorders: Systematic Review. J. Med. Internet Res. 2022, 24, e36986. [Google Scholar] [CrossRef]
- Bae, Y.J.; Shim, M.; Lee, W.H. Schizophrenia Detection Using Machine Learning Approach from Social Media Content. Sensors 2021, 21, 5924. [Google Scholar] [CrossRef] [PubMed]
- Birnbaum, M.L.; Ernala, S.K.; Rizvi, A.F.; De Choudhury, M.; Kane, J.M. A Collaborative Approach to Identifying Social Media Markers of Schizophrenia by Employing Machine Learning and Clinical Appraisals. J. Med. Internet Res. 2017, 19, e289. [Google Scholar] [CrossRef] [PubMed]
- Malik, K.; Widyarini, I.G.A.A.; Kaligis, F.; Kusumawardhani, A.; Astagiri Yusuf, P.; Krisnadhi, A.A.; Riandi, O.; Pujitresnani, A. Differences in syntactic and semantic analysis based on machine learning algorithms in prodromal psychosis and normal adolescents. Asian J. Psychiatr. 2023, 85, 103633. [Google Scholar] [CrossRef] [PubMed]
- Parola, A.; Gabbatore, I.; Berardinelli, L.; Salvini, R.; Bosco, F.M. Multimodal assessment of communicative-pragmatic features in schizophrenia: A machine learning approach. NPJ Schizophr. 2021, 7, 28. [Google Scholar] [CrossRef] [PubMed]
- Perlini, C.; Bellani, M.; Finos, L.; Lasalvia, A.; Bonetto, C.; Scocco, P.; D’Agostino, A.; Torresani, S.; Imbesi, M.; Bellini, F.; et al. Non literal language comprehension in a large sample of first episode psychosis patients in adulthood. Psychiatry Res. 2018, 260, 78–89. [Google Scholar] [CrossRef] [PubMed]
- Minor, K.S.; Lundin, N.B.; Myers, E.J.; Fernández-Villardón, A.; Lysaker, P.H. Automated measures of speech content and speech organization in schizophrenia: Test-retest reliability and generalizability across demographic variables. Psychiatry Res. 2023, 320, 115048. [Google Scholar] [CrossRef] [PubMed]
- Cohen, A.S.; Rodriguez, Z.; Warren, K.K.; Cowan, T.; Masucci, M.D.; Edvard Granrud, O.; Holmlund, T.B.; Chandler, C.; Foltz, P.W.; Strauss, G.P. Natural Language Processing and Psychosis: On the Need for Comprehensive Psychometric Evaluation. Schizophr. Bull. 2022, 48, 939–948. [Google Scholar] [CrossRef] [PubMed]
- Gargano, G.; Caletti, E.; Perlini, C.; Turtulici, N.; Bellani, M.; Bonivento, C.; Garzitto, M.; Siri, F.M.; Longo, C.; Bonetto, C.; et al. Language production impairments in patients with a first episode of psychosis. PLoS ONE 2022, 17, e0272873. [Google Scholar] [CrossRef] [PubMed]
- Iyortsuun, N.K.; Kim, S.H.; Jhon, M.; Yang, H.J.; Pant, S. A Review of Machine Learning and Deep Learning Approaches on Mental Health Diagnosis. Healthcare 2023, 11, 285. [Google Scholar] [CrossRef]
- Tan, E.J.; Yelland, G.W.; Rossell, S.L. Characterising receptive language processing in schizophrenia using word and sentence tasks. Cogn. Neuropsychiatry 2016, 21, 14–31. [Google Scholar] [CrossRef]
- Panesar, K.; Pérez Cabello de Alba, M.B. Natural language processing-driven framework for the early detection of language and cognitive decline. Lang Health 2023, 1, 20–35. [Google Scholar] [CrossRef]
Authors | Population (Type of Participants, N) | Main Use of NLP | Outcomes | Limitations | Quality |
---|---|---|---|---|---|
Ku, B. S., et al. (2021) [32] | N = 197. Patients presenting with FEP. | Clinical and Cognitive Assessment | Patients with derailment produce significantly fewer words than those without derailment. First-episode psychosis patients with moderate-to-severe derailment have a lower Coherence-5 score (0.554) compared to those without derailment (0.570), with a small-to-medium effect size (d = 0.27). | The study has several limitations: SAPS is not a fully accurate or gold-standard method for evaluating FTD due to the subjectivity of the clinician administering the scale, and the three different CoVec output measures are inter-related, measuring the same phenomenon with slightly different approaches. | High |
Parola, A., et al. (2020) [33] | N = 67. A total of 32 individuals with SCZ and 35 HCs (Italy). | Clinical and Cognitive Assessment | The model’s sensitivity showed that all patients with SCZ were classified correctly, with high overall accuracy and good precision, resulting in very few false positives. Pragmatic linguistic ability was identified as the most important factor in distinguishing between SCZ patients and HCs. SCZ was associated with poor performance on tasks involving Theory of Mind, selective attention, extra-linguistic abilities, planning, and inhibition. There was a less clear association for paralinguistic abilities and cognitive flexibility, where patients showed a wider range of performance values. | Relatively small sample size. | High |
Figueroa-Barra, A., et al. (2022) [34] | N = 133 (HC = 49; FEP = 40; Chronic SCZ = 44). All exclusively Spanish-speaking subjects from Chile. | Diagnostic and Predictive Modeling | Using the top ten ranked variables, the model’s accuracy in differentiating between groups was 80.97% (HC vs. SCZ), 85.93% (HC vs. FEP + SCZ), and 91.11% (HC vs FEP) with a random forest classifier. To evaluate FEP conversion to SCZ, accuracy was measured. Results were poor with only demographic information (43.33%) but improved with PANSS information (65.83%). PANSS allowed for a 67.5% prediction accuracy. Language-only provided a 75.83% accuracy. Combining all information and using the top ten features resulted in a 77.5% accuracy for predicting if an FEP patient would have a confirmed SCZ diagnosis. | Use of exclusively Chilean HCs, self-reported comorbidities like drug abuse, and differing demographic variables between healthy and psychotic subjects, which may introduce potential bias. There was no record of refusals at recruitment. The random forest model used for analysis has a simple and broad interpretation, and the study’s limited sample size may lead to overfitting. Additionally, the longitudinal analysis classes were unbalanced. | High |
Voppel, A. E., et al. (2023) [35] | N = 167. A total of 94 patients with SCZ spectrum disorder (SSD) and 73 HCs. | Diagnostic and Predictive Modeling | Acoustic classifier: 81% accuracy, 89% sensitivity, 70% specificity, AUC-ROC 0.82. Semantic classifier: 80% accuracy, 81% sensitivity, 78% specificity, AUC-ROC 0.83. Combined classifier: 85% accuracy, 92% sensitivity, 79% specificity, AUC-ROC 0.88. | Significant differences in years of education between groups, possible audio contamination from background noise, low test–retest validity for acoustic features, and the use of cross-validation to estimate the generalizability of the models. | High |
Arevian, A. C., et al. (2020) [36] | N= 47. Participants recruited from a community-based mental health clinic for adult patients with serious mental illness. | Diagnostic and Predictive Modeling | Using an individually trained algorithm, prediction models showed a high correlation (up to 0.78) between predicted and actual clinical states based on providers’ global assessment ratings. There was little correlation between individuals regarding which speech features correlated with their clinical state, suggesting that word choice patterns related to mental illness/wellness may be specific to individuals. Both population-based and individualized approaches can inform computational methods using behavioral markers. Statistically significant correlations were found between the model and actual scores for the summary, depression, and self-harm sub-scores of the BASIS-24, and the mental health sub-score of the SF-12. No significant correlations were found for four of the six BASIS-24 sub-scores or the physical health sub-score of the SF-12. | Variability and subjectivity in global assessment ratings due to clinician differences, a small sample size that does not account for participant characteristics, an inability to determine the strengths or weaknesses of features and algorithms for symptom-specific states or differences compared to healthy volunteers, and a predictive model that does not significantly predict the physical health subscale. | Moderate |
Bedi, G., et al. (2015) [37] | N = 34. Participants were help-seeking youths aged 14–27; referred by school, clinicians or self-referred through the Center of Prevention and Evaluation website. | Diagnostic and Predictive Modeling | Baseline speech recordings and transcriptions accurately predicted the transition to psychosis in a high-risk clinical group. Automated analysis outperformed clinical ratings, showing that automated speech analysis can enhance predictive accuracy beyond expert clinical opinion. | Small sample size. | High |
Jeong, L., et al. (2023) [38] | N = 7. Patients admitted to a psychiatric inpatient unit between 2019 and 2021, with diagnostic of SCZ and psychosis as the main reason of admission. | Diagnostic and Predictive Modeling | Participants with severe symptoms of poverty of speech, content, and social inattentiveness showed reduced lexical richness and syntactic complexity, with lower Honore’s statistics, shorter word lengths, more high-frequency words, shorter sentences and clauses, and fewer coordinations and prepositions. Those with higher derailment and pressure of speech used more words, clauses, and longer sentences but had lower type–token ratios and content density. Lower BERT next-sentence probability scores were linked to severe derailment, illogicality, and circumstantiality. Machine learning models predicted alogia, illogicality, poverty of speech, social inattentiveness, and global TLC scores with up to an 82% accuracy (0.82 F1 score). Preliminary results show that NLP can predict symptom severity from speech records. | The study has several limitations: a small sample size, reliance on the attending psychiatrist’s subjective judgment as the gold standard, and the use of third-party manual transcription services to convert speech recordings into text. | High |
Rezaii, N., et al. (2019) [39] | N = 40. Participants of the North American Prodrome Longitudinal Study (NAPLS) at Emory University. A total of 30 from the second phase (NAPLS-2) and 10 from the third phase (NAPLS-3). | Diagnostic and Predictive Modeling | Semantic density predicted conversion to psychosis in 80% of cases (60% sensitivity, 100% specificity). Latent semantic content (VOICES) predicted conversion in 70% of cases (40% sensitivity, 100% specificity). Combining semantic density and VOICES predicted conversion in 93% of cases (86% sensitivity, 96% specificity). | The small number of participants, insufficient variety in neuropsychiatric disorders studied, and no inclusion of HCs weakens the generalizability and reliability of the results. | High |
Chan, C. C., et al. (2023) [40] | N = 257; 167 patients with SCZ or schizoaffective disorder, 90 HCs. | Linguistic Feature Analysis | Several markers related to self-experience emerged as top features differentiating SCZ from HCs. “Self-experience and Agency” was higher in SCZ. SCZ patients used a more negative emotional tone, spoke less about “Burden”, and used more negative emotional words. Language related to self-experience strongly correlates with clinical symptoms. | Text embedding only reveals the frequency of phrases, not their relationships; there were not enough unmedicated patients to evaluate the effect of medication properly; no validated measure of self-disturbance was used; comprehensive speech signals (including speech sounds and facial expressions) were not sufficiently evaluated; disturbances in self-experience are also prevalent in trauma, with a disproportionate number of veterans in the SCZ group; there was no data on the rates of trauma or comorbid disorders. | |
Lejeune, A., et al. (2022) [41] | A total of 7 studies included, published in the United States (5) and Korea (2) between 2015 and 2021. The samples size varied between 51 and 265,396 participants. | Social Media and Online Content Analysis | Social media data may be utilized for a variety of purposes in the treatment of individuals with schizophrenia, such as post-first psychotic episode patient monitoring. | A small number of included studies, most of which did not use clinical diagnostic data, and limitations in both the methodology and the choice of machine learning algorithms. | Moderate |
Bae, Y. J., et al. (2021) [42] | Large corpus of social media posts collected from the Reddit website between September 2016 and September 2020. N = 13,156 (posts from Reddit sub-communities for SCZ); N = 247,569 (posts from non-mental-health-related subreddits). | Social Media and Online Content Analysis | Classification: A random forest machine learning model achieved a a high accuracy of 96% (94% recall, 98% precision, 96% F1-score, and 0.97 AUC) in distinguishing between SCZ and control groups. Linguistic characteristics of SCZ: Posts from individuals with SCZ showed significant linguistic differences compared to control participants. SCZ posts had a lower word count (WC), fewer first-person singular (FPS) and third-person singular (TPS) pronouns, past tense, and positive emotion words. They had higher second-person (SP) pronouns, third-person plural (TPP) pronouns, impersonal pronouns, present tense, and negative emotion words. Topic detection and comparison: The SCZ subreddit focused more on diagnosis (diagnostic), symptoms (Sx), treatment (Tx), and the nature of the disorder. Control subreddits discussed family, friends, social relationships, and general topics more frequently. Increased use of symptom-related words and decreased occurrence of positive general topics characterized the language in the SCZ subreddit. | It focuses on identifying specific textual information in SCZ rather than examining various types of mental disorders, lacks evidence that SCZ subreddit users have an actual diagnosis, and users of the SCZ subreddit may not be representative of the broader population, introducing selection bias. Only commonly used machine learning classifiers were evaluated, and the findings may be limited to Reddit users and might not generalize to users of other platforms. | Moderate |
Birnbaum, M. L., et al. (2017) [43] | N = 292. A total of 146 Twitter users in the SCZ group and 146 Twitter users in the control group. | Social Media and Online Content Analysis | SCZ group: More frequent use of first-person pronouns and words related to biological processes (body, health), less frequent use of words related to “friends.” Subtle language changes may also be linked to SCZ. The classifier agreed with experts in removing inauthentic samples but was overly inclusive in labeling true SCZ cases. | The authors report various limitations: it is impossible to confirm a diagnosis of SCZ via Twitter alone, symptoms of psychosis are not limited to SCZ, access was restricted to public profiles, the classifier was developed using only linguistic variables, and the findings may be limited to Twitter users and may differ from users of other platforms. | High |
Malik, K., et al. (2023) [44] | N = 70. A total of 35 participants in the prodromal group (based on the Prodromal Questionnaire-Brief, Indonesian version); 35 participants in the healthy control group. Participants were aged 14–19 years. | Specific Linguistic Phenomena | Several features significantly distinguish prodromal psychosis adolescents from adolescents without prodromal psychosis, including decreased semantic coherence and word complexity in sentences. Syntactic analysis showed no significant difference in speech production between the two groups. There were differences in word-use frequency in the prodromal psychosis group compared to the group without prodromal psychosis. While not many syntactic and semantic features were statistically significant, linguistic trends similar to those in SCZ patients were found, suggesting the need for a separate model to improve prediction accuracy. The highest accuracy achieved by various classifiers was 57%, with a standard deviation of 3.2% using a random forest classifier. | The authors reported everal limitations: a small sample size, potential selection bias from only including participants who responded, and the severity of prodromal psychosis symptoms possibly preventing some from responding. Substance use was eliminated as a confounding factor for adolescent psychosis risk. Prodromal subjects showed subtle speech disorganization, indicating a need for improved predictive models like the tangentiality prediction model. There were imbalances in variables, such as age, psychiatric history, and family history between prodromal and healthy control participants, as well as in the proportion of coherent and incoherent phrase segments analyzed. | High |
Parola, A., et al. (2021) [45] | N = 67. A total of 32 individuals with SCZ and 35 HCs (Italy). | Specific Linguistic Phenomena | The model achieved an accuracy of 0.821 (SD = 0.118), a sensitivity of 0.758 (SD = 0.285), a precision of 0.910 (SD = 0.151), a specificity of 0.900 (SD = 0.175), and an AUC of 0.894 (SD = 0.143). Linguistic irony was the most important factor for classifying individuals as either SCZ patients or HCs. | Sample is relatively small. Heterogeneity of clinical profiles of patients with SCZ disorder not considered. Pragmatic ability can be measured in different ways; communicative features found to be the most informative to classify participants would need to be confirmed in further studies across different pragmatic tasks and contexts. | Moderate |
Perlini, C., et al. (2018) [46] | N = 228. FEP AP = 60; FEP NAP = 168. FEP outpatients recruited from 117 public community mental health centers in Northern Italy. Control group = 70, with no DSM-IV axis I disorder. | Specific Linguistic Phenomena | After adjusting for cognitive measures (flexibility, working memory, IQ), only OPEN task results remained statistically significant, showing similar results for both patient groups compared to HCs. | No assessment of ToM in participants (partly related to comprehension of figurative speech). No observation of possible variations in participants’ speech over time, given the cross-sectional nature of the study. Effects of pharmacotherapt not taken into account. DSM-IV axis II disorders in the control group not assessed or excluded (e.g., schizotypal traits). Lack of a comprehensive evaluation of different pragmatic linguistic aspects (prosody, figures of speech, etc.). | high |
Minor, K. S., et al. (2023) [47] | N = 101 (for baseline to six months sample). N = 47 (for baseline to 1 year sample). N total = 148. All participants were enrolled in ≥one of three studies at a Midwestern Veterans Affairs Medical Center (VA), with parent studies having taken place from 2004 to 2016. | Speech and Communication Analysis | Speech content indices generally met fair reliability standards, while speech organization indices were mostly fair to good. Changes in speech indices from baseline to six months rarely varied based on demographics. Speech indices showed the most differences in race, income, and education, but more convergence for age and gender. Regarding replicability, both speech content and organization indices generally did not meet the good test–retest reliability standards of other instruments. | Use of IPII may produce speech that is difficult to replicate, a small number of women participants complicates the analysis of gender effects, and a large sample size of older individuals makes it hard to differentiate between the effects of long-term antipsychotic use and the prolonged effects of the illness. | High |
Cohen, A. S., et al. (2022) [48] | N = 35. A total of 31 patients with DSM-5 diagnostic of SCZ, and 4 patients with DSM-5 diagnostic of bipolar disorder. | Speech and Communication Analysis | The integrated NLP-based measure of paranoia in this study was reliable over a week, showed good alignment with our criterion and related measures, and diverged from global measures of negative affect and psychopathology. It had higher criterion validity for white and female participants than for black and male participants. This highlights the need for thorough psychometric evaluations for NLP measures of psychosis, considering demographic biases to avoid missing critical issues. | Sample size is modest and demographically constrained, video data are incomplete for many participants, the sample consists exclusively of participants with serious mental issues, suggesting the need for a community sample with a wider spectrum of symptoms for future studies, and the epoch is limited to a week. | High |
Gargano, G., et al. (2022) [49] | N = 266. A total of 133 FEP (95 non-affective FEP; 35 affective FEP) and 133 HCs. | Speech and Communication Analysis | FEP patients show significant deficits in micro-level speech production, including intraphrasal discourse construction. They use fewer lexical fillers, have a lower speech rate, and shorter utterances compared to HCs. Their narratives have a lower percentage of syntactic completeness. Both affective and non-affective FEP patients exhibit significant impairments in speech rate and mean length of utterances compared to HCs, indicating that all psychotic patients have some impairment in productive aspects of phrasal construction. No significant difference was found between FEP-NA and FEP-A in language production. FEP patients performed worse than controls on neuropsychological tasks (verbal IQ, n-back, SOA). The model using only language production variables had a prediction accuracy of 76.36%. Machine learning results showed that GAF alone predicted FEP and HC groups with a 97.90% accuracy, while neuropsychological measures had a predictive power of 99% | The study has several limitations: all of the participants were outpatients with only moderate impairment (GAF scores), no significant differences in language production impairments between FEP-A and FEP-NA were found, differences in sample sizes between FEP-A and FEP-NA (38 vs. 95), and the linguistic analysis method used was time-consuming, limiting its clinical applicability. | High |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Deneault, A.; Dumais, A.; Désilets, M.; Hudon, A. Natural Language Processing and Schizophrenia: A Scoping Review of Uses and Challenges. J. Pers. Med. 2024, 14, 744. https://doi.org/10.3390/jpm14070744
Deneault A, Dumais A, Désilets M, Hudon A. Natural Language Processing and Schizophrenia: A Scoping Review of Uses and Challenges. Journal of Personalized Medicine. 2024; 14(7):744. https://doi.org/10.3390/jpm14070744
Chicago/Turabian StyleDeneault, Antoine, Alexandre Dumais, Marie Désilets, and Alexandre Hudon. 2024. "Natural Language Processing and Schizophrenia: A Scoping Review of Uses and Challenges" Journal of Personalized Medicine 14, no. 7: 744. https://doi.org/10.3390/jpm14070744
APA StyleDeneault, A., Dumais, A., Désilets, M., & Hudon, A. (2024). Natural Language Processing and Schizophrenia: A Scoping Review of Uses and Challenges. Journal of Personalized Medicine, 14(7), 744. https://doi.org/10.3390/jpm14070744