[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to main content

Abstract

Topic Models are representations of the given text. Topic models are unsupervised in nature because they totally depend on word distributions. Few generative topic models obtain the topics proportionate to the richness of the given text. Applying deep learning for identifying the worthy distributions in a rich text is quite helpful for generation of quality topics. This paper discusses the idea of learning deep topics of interest from scientific research articles. The learning is handled using deep stacked auto-encoder with three hidden layer stack coupled with generative topic models. The deep learning framework is explored excluding and including back-propagation and is tested upon both LDA and HDP as foundational topic model. Experiments conducted over the data-set of research articles from top bio-medical journals reveal better topic coherence for learning deep topics.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
£29.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
GBP 19.95
Price includes VAT (United Kingdom)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
GBP 143.50
Price includes VAT (United Kingdom)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
GBP 179.99
Price includes VAT (United Kingdom)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
GBP 179.99
Price includes VAT (United Kingdom)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Lau, J. H., Collier, N., Baldwin, T.: On-line trend analysis with topic models:\# twitter trends detection topic model online. Proceedings of COLING 2012, 1519-1534 (2012)

    Google Scholar 

  2. Blei, David M.: Probabilistic topic models. Communications of the ACM, 55(4):77–84(2012)

    Google Scholar 

  3. Blei, David M., Andrew Y. Ng, and Michael I.Jordan.: Latent Dirichlet allocation, Journal of Machine earning Research, 3:993–1022(2003)

    Google Scholar 

  4. Paisley, John, Chong, Wang, Blei, DM, Michael Jordan.: Nested hierarchical Dirichlet processes, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 37, no. 2, pp. 256-270 (2015)

    Google Scholar 

  5. MuthuSelvi, G, Mahalakshmi, GS, Sendhilkumar, S, Vijayakumar, Yongxin Zhu, Victor Chang.: Sustainable Computing Based Deep Learning Framework for Writing Research Manuscripts, IEEE Transactions on Sustainable Computing, ISSN: 2377-3782, (2018)

    Google Scholar 

  6. Richang, Hong, Luming, Zhang, Chao, Zhang, Roger 2016, ‘Zimmermann:Flickr Circles: Aesthetic Tendency Discovery by Multi-View Regularized Topic Modeling, IEEE Trans. Multimedia, vol.18,no.8, pp.1555-1567.

    Google Scholar 

  7. Röder, Michael, Andreas Both, Alexander Hinneburg.: Exploring the space of topic coherence measures. Proc. of 8th ACM international conference on Web search and data mining. ACM (2015)

    Google Scholar 

  8. Rubin, TN, Chambers, A, Smyth, P, Steyvers M.: Statistical Topic Models for Multi-Label Document Classification, Machine learning, vol. 88, no. 1, pp. 157-208 (2012)

    Article  MathSciNet  Google Scholar 

  9. Ting Hua, Xuchao Zhang, Wei Wang, Chang-Tien Lu, Naren Ramakrishnan.: Automatical Storyline Generation with Help from Twitter. In Proc. of 25th ACM International on Conference on Information and Knowledge Management (CIKM '16). ACM, New York, NY, USA, 2383-2388 (2010)

    Google Scholar 

  10. Tuan-Anh Hoang, Ee-Peng.: Lim: Modeling Topics and Behavior of Microbloggers: An Integrated Approach’, ACMTIST, vol. 8, no. 3, pp. 44:1-44:37 (2017)

    Google Scholar 

  11. Tuan-Anh Hoang, Ee-Peng Lim.: Microblogging Content Propagation Modeling Using Topic-Specific Behavioral Factors, IEEE Trans. Knowl. Data Eng. vol. 28, no. 9, pp. 2407-2422 (2016)

    Google Scholar 

  12. Wang, H, Wu, F, Li, X, Tang, S, Shao, J, Zhuang, Y.: Jointly discovering fine-grained and coarse-grained sentiments via topic modeling, Proceedings of the 22nd ACM international conference on Multimedia, pp. 913–916 (2014)

    Google Scholar 

  13. Wei Liang, Zixian Lu, Qun Jin, YonghuaXiong.: Min Wu:Modeling and Analyzing of Research Topic Evolution Associated with Social Networks of Researchers, IJDST, vol. 7, no. 3, pp. 42-62 (2016)

    Google Scholar 

  14. Wei Zhang, Jiayu Zhuang, Xi Yong, Jiankou Li, Wei Chen, Zhemin.: Li:Personalized topic modeling for recommending user-generated content’, Frontiers of IT & EE, vol. 18, no. 5, pp. 708-718 (2017)

    Google Scholar 

  15. William Speier, Michael K Ong, Corey W Arnold.: Using phrases and document metadata to improve topic modeling of clinical reports, Journal of Biomedical Informatics, vol. 61, pp. 260-266 (2016)

    Google Scholar 

  16. Wong, Melvin, Bilal Farooq, Guillaume-Alexandre Bilodeau.: Discriminative conditional restricted Boltzmann machine for discrete choice and latent variable modelling’, arXiv preprint arXiv: 1706.00505 (2017)

    Google Scholar 

  17. Hofmann, T.: Probabilistic latent semantic indexing, Article presented at the Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, pp.14 (1999)

    Google Scholar 

  18. Papadimitriou, CH, Tamaki, H, Raghavan, P, Vempala, S.: Latent semantic indexing: A probabilistic analysis. Article presented at the Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems, pp.12 (1998)

    Google Scholar 

  19. Zhen Hai, Gao Cong, Kuiyu Chang, Peng Cheng, Chunyan Miao.: Analyzing Sentiments in One Go: A Supervised Joint Topic Modeling Approach. IEEE Trans. Knowl. Data Eng. vol. 29, no. 6, pp. 1172-1185 (2017)

    Google Scholar 

  20. Zhou, Q., Yang, N., Wei, F., Huang, S., Zhou, M., Zhao, T.: Neural Document Summarization by Jointly Learning to Score and Select Sentences. arXiv preprint arXiv:1807.02305(2018)

    Google Scholar 

  21. Zongda Wu, Li Lei, Guiling Li, Hui Huang, Chengren Zheng, Enhong Chen, Guandong Xu.: A topic modeling based approach to novel document automatic summarization’, Expert Syst. Appl. vol. 84, pp. 12-23 (2017)

    Google Scholar 

  22. Chien, Jen-Tzung.: Hierarchical Pitman-Yor-Dirichlet language model, IEEE Transactions on Audio, Speech, and Language Processing, vol. 23, no. 8, pp.1259-1272 (2015)

    Google Scholar 

  23. Pratanwanich, N, Lio, P.: Who Wrote This? Textual Modeling with Authorship Attribution in Big Data, In the Proceedings of the 2014 IEEE International Conference on Data Mining Workshop, vol. 16, no. 4, pp. 645-652 (2014)

    Google Scholar 

  24. Arifin, A. Z., Abdullah, M. Z., Rosyadi, A. W., Ulumi, D. I., Wahib, A., Sholikah, R. W.: Sentence Extraction Based on Sentence Distribution and Part of Speech Tagging for Multi-Document Summarization. TELKOMNIKA, 16(2), 843-851 (2018)

    Article  Google Scholar 

  25. Mahalakshmi, G. S., Siva, R., Sendhilkumar, S.: On The Expressive Power of Scientific Manuscripts. IEEE Transactions on Emerging Topics in Computing (2018)

    Google Scholar 

  26. David Mimno, Hanna M. Wallach, Edmund Talley, Miriam Leenders, Andrew McCallum.: Optimizing semantic coherence in topic models. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP '11). Association for Computational Linguistics, Stroudsburg, PA, USA, 262-272 (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to G. S. Mahalakshmi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this chapter

Cite this chapter

Mahalakshmi, G.S., Hemadharsana, S., Muthuselvi, G., Sendhilkumar, S. (2020). Learning Deep Topics of Interest. In: Smys, S., Iliyasu, A.M., Bestak, R., Shi, F. (eds) New Trends in Computational Vision and Bio-inspired Computing. Springer, Cham. https://doi.org/10.1007/978-3-030-41862-5_156

Download citation

Publish with us

Policies and ethics