Abstract
In recent years, there has been an increasing interest in digital humanities. This interest is justified by the development of natural language processing tools and the emergence of digitized text collections of documents in different fields of knowledge, for example, literature, art, philosophy, and history. In this paper, we applied unsupervised topic modeling to the Bulletin of Opposition, the journal of Soviet opposition published by Trotskyists in Paris from 1929 to 1941, to analyze the main trends in the Russian opposition-leaning media. We identified topic classes using models based on Latent Dirichlet Allocation and examined Dynamic Topic Models as a tool to single out the main issues of interest for historical research. Applying topic modeling and statistical methods, we proposed an approach to Retrospective Event Detection that was evaluated on a human-annotated set of historical news items. The present study may help to improve event detection on smaller text corpora.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
Blei, D.M., Lafferty, J.D.: Dynamic topic models. In: Proceedings of the 23rd International Conference on Machine Learning, pp. 113–120. ACM (2006)
Loukachevitch, N., Mischenko, N.: Evaluation of approaches for most frequent sense identification in Russian. In: van der Aalst, W.M.P., et al. (eds.) AIST 2018. LNCS, vol. 11179, pp. 99–110. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-11027-7_10
Jelodar, H., et al.: Latent Dirichlet Allocation (LDA) and topic modeling: models, applications, a survey. Multimedia Tools Appl. 78, 1–43 (2018)
Chen, J., Shang, Q., Xiong, H.: Hot events detection for chinese microblogs based on the TH-LDA model. In: Proceedings of the 2018 International Conference on Transportation & Logistics, Information & Communication, Smart City, TLICSC 2018. Atlantis Press (2018)
Keane, N., Yee, C., Zhou, L.: Using topic modeling and similarity thresholds to detect events. In: Proceedings of the 3rd Workshop on EVENTS at the NAACL-HLT 2015, pp. 34–42 (2015)
Gupta, M., Gupta, P.: Research and implementation of event extraction from Twitter using LDA and scoring function. Int. J. Inf. Technol. 11(2), 365–371 (2019)
Ge, B., et al.: Microblog topic mining based on a combined TF-IDF and LDA topic model. In: Automatic Control, Mechatronics and Industrial Engineering: Proceedings of the International Conference on Automatic Control, Mechatronics and Industrial Engineering, ACMIE 2018, Suzhou, China, 29–31 October 2018, p. 291. CRC Press (2019)
\(\ll \)The Bulletin of Opposition\(\gg \). https://www.1917.com/Marxism/Trotsky/BO/Main.html. Accessed 13 Feb 2019
Morphological analyzer pymorphy2. https://radimrehurek.com/gensim/index.html. Accessed 13 Feb 2019
Gensim. Topic modeling for humans. https://radimrehurek.com/gensim/index.html. Accessed 13 Feb 2019
https://github.com/oldaandozerskaya/Bulletin_of_opposition. Accessed 03 June 2019
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Glazkova, A., Kruzhinov, V., Sokova, Z. (2019). Dynamic Topic Models for Retrospective Event Detection: A Study on Soviet Opposition-Leaning Media. In: van der Aalst, W., et al. Analysis of Images, Social Networks and Texts. AIST 2019. Lecture Notes in Computer Science(), vol 11832. Springer, Cham. https://doi.org/10.1007/978-3-030-37334-4_13
Download citation
DOI: https://doi.org/10.1007/978-3-030-37334-4_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-37333-7
Online ISBN: 978-3-030-37334-4
eBook Packages: Computer ScienceComputer Science (R0)