Abstract
Topic Models are representations of the given text. Topic models are unsupervised in nature because they totally depend on word distributions. Few generative topic models obtain the topics proportionate to the richness of the given text. Applying deep learning for identifying the worthy distributions in a rich text is quite helpful for generation of quality topics. This paper discusses the idea of learning deep topics of interest from scientific research articles. The learning is handled using deep stacked auto-encoder with three hidden layer stack coupled with generative topic models. The deep learning framework is explored excluding and including back-propagation and is tested upon both LDA and HDP as foundational topic model. Experiments conducted over the data-set of research articles from top bio-medical journals reveal better topic coherence for learning deep topics.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Lau, J. H., Collier, N., Baldwin, T.: On-line trend analysis with topic models:\# twitter trends detection topic model online. Proceedings of COLING 2012, 1519-1534 (2012)
Blei, David M.: Probabilistic topic models. Communications of the ACM, 55(4):77–84(2012)
Blei, David M., Andrew Y. Ng, and Michael I.Jordan.: Latent Dirichlet allocation, Journal of Machine earning Research, 3:993–1022(2003)
Paisley, John, Chong, Wang, Blei, DM, Michael Jordan.: Nested hierarchical Dirichlet processes, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 37, no. 2, pp. 256-270 (2015)
MuthuSelvi, G, Mahalakshmi, GS, Sendhilkumar, S, Vijayakumar, Yongxin Zhu, Victor Chang.: Sustainable Computing Based Deep Learning Framework for Writing Research Manuscripts, IEEE Transactions on Sustainable Computing, ISSN: 2377-3782, (2018)
Richang, Hong, Luming, Zhang, Chao, Zhang, Roger 2016, ‘Zimmermann:Flickr Circles: Aesthetic Tendency Discovery by Multi-View Regularized Topic Modeling, IEEE Trans. Multimedia, vol.18,no.8, pp.1555-1567.
Röder, Michael, Andreas Both, Alexander Hinneburg.: Exploring the space of topic coherence measures. Proc. of 8th ACM international conference on Web search and data mining. ACM (2015)
Rubin, TN, Chambers, A, Smyth, P, Steyvers M.: Statistical Topic Models for Multi-Label Document Classification, Machine learning, vol. 88, no. 1, pp. 157-208 (2012)
Ting Hua, Xuchao Zhang, Wei Wang, Chang-Tien Lu, Naren Ramakrishnan.: Automatical Storyline Generation with Help from Twitter. In Proc. of 25th ACM International on Conference on Information and Knowledge Management (CIKM '16). ACM, New York, NY, USA, 2383-2388 (2010)
Tuan-Anh Hoang, Ee-Peng.: Lim: Modeling Topics and Behavior of Microbloggers: An Integrated Approach’, ACMTIST, vol. 8, no. 3, pp. 44:1-44:37 (2017)
Tuan-Anh Hoang, Ee-Peng Lim.: Microblogging Content Propagation Modeling Using Topic-Specific Behavioral Factors, IEEE Trans. Knowl. Data Eng. vol. 28, no. 9, pp. 2407-2422 (2016)
Wang, H, Wu, F, Li, X, Tang, S, Shao, J, Zhuang, Y.: Jointly discovering fine-grained and coarse-grained sentiments via topic modeling, Proceedings of the 22nd ACM international conference on Multimedia, pp. 913–916 (2014)
Wei Liang, Zixian Lu, Qun Jin, YonghuaXiong.: Min Wu:Modeling and Analyzing of Research Topic Evolution Associated with Social Networks of Researchers, IJDST, vol. 7, no. 3, pp. 42-62 (2016)
Wei Zhang, Jiayu Zhuang, Xi Yong, Jiankou Li, Wei Chen, Zhemin.: Li:Personalized topic modeling for recommending user-generated content’, Frontiers of IT & EE, vol. 18, no. 5, pp. 708-718 (2017)
William Speier, Michael K Ong, Corey W Arnold.: Using phrases and document metadata to improve topic modeling of clinical reports, Journal of Biomedical Informatics, vol. 61, pp. 260-266 (2016)
Wong, Melvin, Bilal Farooq, Guillaume-Alexandre Bilodeau.: Discriminative conditional restricted Boltzmann machine for discrete choice and latent variable modelling’, arXiv preprint arXiv: 1706.00505 (2017)
Hofmann, T.: Probabilistic latent semantic indexing, Article presented at the Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, pp.14 (1999)
Papadimitriou, CH, Tamaki, H, Raghavan, P, Vempala, S.: Latent semantic indexing: A probabilistic analysis. Article presented at the Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems, pp.12 (1998)
Zhen Hai, Gao Cong, Kuiyu Chang, Peng Cheng, Chunyan Miao.: Analyzing Sentiments in One Go: A Supervised Joint Topic Modeling Approach. IEEE Trans. Knowl. Data Eng. vol. 29, no. 6, pp. 1172-1185 (2017)
Zhou, Q., Yang, N., Wei, F., Huang, S., Zhou, M., Zhao, T.: Neural Document Summarization by Jointly Learning to Score and Select Sentences. arXiv preprint arXiv:1807.02305(2018)
Zongda Wu, Li Lei, Guiling Li, Hui Huang, Chengren Zheng, Enhong Chen, Guandong Xu.: A topic modeling based approach to novel document automatic summarization’, Expert Syst. Appl. vol. 84, pp. 12-23 (2017)
Chien, Jen-Tzung.: Hierarchical Pitman-Yor-Dirichlet language model, IEEE Transactions on Audio, Speech, and Language Processing, vol. 23, no. 8, pp.1259-1272 (2015)
Pratanwanich, N, Lio, P.: Who Wrote This? Textual Modeling with Authorship Attribution in Big Data, In the Proceedings of the 2014 IEEE International Conference on Data Mining Workshop, vol. 16, no. 4, pp. 645-652 (2014)
Arifin, A. Z., Abdullah, M. Z., Rosyadi, A. W., Ulumi, D. I., Wahib, A., Sholikah, R. W.: Sentence Extraction Based on Sentence Distribution and Part of Speech Tagging for Multi-Document Summarization. TELKOMNIKA, 16(2), 843-851 (2018)
Mahalakshmi, G. S., Siva, R., Sendhilkumar, S.: On The Expressive Power of Scientific Manuscripts. IEEE Transactions on Emerging Topics in Computing (2018)
David Mimno, Hanna M. Wallach, Edmund Talley, Miriam Leenders, Andrew McCallum.: Optimizing semantic coherence in topic models. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP '11). Association for Computational Linguistics, Stroudsburg, PA, USA, 262-272 (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Mahalakshmi, G.S., Hemadharsana, S., Muthuselvi, G., Sendhilkumar, S. (2020). Learning Deep Topics of Interest. In: Smys, S., Iliyasu, A.M., Bestak, R., Shi, F. (eds) New Trends in Computational Vision and Bio-inspired Computing. Springer, Cham. https://doi.org/10.1007/978-3-030-41862-5_156
Download citation
DOI: https://doi.org/10.1007/978-3-030-41862-5_156
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-41861-8
Online ISBN: 978-3-030-41862-5
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)