Abstract
We present data demonstrating how brain health may be assessed by applying data-mining and text analytics to patient language. Three brain-based disorders are investigated - Alzheimer’s Disease, cognitive impairment and clinical depression. Prior studies identify particular language characteristics associated with these disorders. Our data show computer-based pattern recognition can distinguish language samples from individuals with and without these conditions. Binary classification accuracies range from 73% to 97% depending on details of the classification task. Text classification accuracy is known to improve substantially as training data approaches web-scale. Such a web scale dataset seems inevitable given the ubiquity of social computing and its language intensive nature. Given this context, we claim that the classification accuracy levels obtained in our experiments are significant findings for the fields of web intelligence and applied brain informatics.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Peintner, B., Jarrold, W., Vergyri, D., Richey, C., Gorno Tempini, M., Ogar, J.: Learning Diagnostic Models Using Speech and Language Measures. In: 30th Annual International IEEE EMBS Conference, Vancouver, British Columbia, Canada, August 20-24 (2008)
Gottschalk, L.A., Bechtel, R.J., Maguire, G.A., Katz, M.L., Levinson, D.M., Harrington, D.E., et al.: Computer detection of cognitive impairment and associated neuropsychiatric dimensions from the content analysis of verbal samples. American Journal of Drug and Alcohol Abuse 28, 653–670 (2002)
Toutanova, K., Klein, D., Manning, C., Singer, Y.: Feature-rich part- of-speech tagging with a cyclic dependency network. In: Proceedings of HLT-NAACL, pp. 252–259 (2003)
Pennebaker, J.W., Francis, M.E., Booth, R.J.: Linguistic Inquiry and Word Count: LIWC 2001. Erlbaum Publishers, Mahwah (2001)
Brown, C., Snodgrass, T., Kemper, S.J., Herman, R., Covington, M.A.: Automatic measurement of propositional idea density from part-of-speech tagging. Behavior Research Methods 40(2), 540–545 (2008)
Snowdon, D.A., Kemper, S.J., Mortimer, J.A., Greiner, L.H., Wekstein, D.R., Mackesbery, W.R.: Linguistic ability in early life and cognitive function and Alzheimer’s disease in late life: Finds from the Nun Study. Journal of the American Medical Association 275, 528–532 (1996)
Thomas, C., Keselj, V., Cercone, N., Rockwood, K., Asp, E.: Automatic detection and rating of dementia of Alzheimer type through lexical analysis of spontaneous speech. In: 2005 IEEE International Conference on Mechatronics and Automation, vol. 3, pp. 1569–1574 (2005)
Wilson, S.M., Henry, M.L., Besbris, M., Ogar, J.M., Dronkers, N.F., Jarrold, W., Miller, B.L., Gorno-Tempini, M.L.: Connected speech production in three variants of primary progressive aphasia. Brain, Advance Access published on June 11 (2010), doi: 10.1093/brain/awq129
Rosenman, R.H., et al.: A Predictive Study of Coronary Heart Disease: The Western Collaborative Groups Study. Journal of the American Medical Association 189, 15–22 (1964)
Eslinger, P.J., Damasio, A.R., Benton, A.L.: The Iowa Screening Battery for Mental Decline. Department of Neurology (Div of Behavioral Neurology), University of Iowa (1984)
Witten, I.H., Frank, E.: Data mining: Practical machine learning tools and techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)
Beck, A.T.: The Evolution of the Cognitive Model of Depression and Its Neurobiological Correlates. Am. J. Psychiatry 165, 969–977 (2008)
World Health Organization: The global burden of disease: 2004 update, Part 3, Disease incidence, prevalence and disability (2004)
Stirman, S.W., Pennebaker, J.W.: Word use in poetry of suicidal and non-suicidal poets. Psychosomatic Medicine 63, 517–522 (2001)
Rude, S.S., Gortner, E.M., Pennebaker, J.W.: Language use of depressed and depression-vulnerable college students. Cognition and Emotion 18, 112–133 (2004)
Mehl, M.R.: The lay assessment of sub-clinical depression in daily life. Psychological Assessment 18, 340–345 (2006)
Zhong, N., Liu, J., Yao, Y.: In Search of the Wisdom Web. Computer, 27–31 (November 2002)
Banko, M., Brill, E.: Scaling to Very Very Large Corpora for Natural Language Disambiguation. In: ACL 2001, pp.26–33 (2001)
Sun, R., Peterson, T.: Multi-agent reinforcement learning: weighting and partitioning. Neural Networks 12, 727–753 (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jarrold, W.L., Peintner, B., Yeh, E., Krasnow, R., Javitz, H.S., Swan, G.E. (2010). Language Analytics for Assessing Brain Health: Cognitive Impairment, Depression and Pre-symptomatic Alzheimer’s Disease. In: Yao, Y., Sun, R., Poggio, T., Liu, J., Zhong, N., Huang, J. (eds) Brain Informatics. BI 2010. Lecture Notes in Computer Science(), vol 6334. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15314-3_28
Download citation
DOI: https://doi.org/10.1007/978-3-642-15314-3_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15313-6
Online ISBN: 978-3-642-15314-3
eBook Packages: Computer ScienceComputer Science (R0)