[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/2390524.2390599dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
research-article
Free access

Finding bursty topics from microblogs

Published: 08 July 2012 Publication History

Abstract

Microblogs such as Twitter reflect the general public's reactions to major events. Bursty topics from microblogs reveal what events have attracted the most online attention. Although bursty event detection from text streams has been studied before, previous work may not be suitable for microblogs because compared with other text streams such as news articles and scientific publications, microblog posts are particularly diverse and noisy. To find topics that have bursty patterns on microblogs, we propose a topic model that simultaneously captures two observations: (1) posts published around the same time are more likely to have the same topic, and (2) posts published by the same user are more likely to have the same topic. The former helps find event-driven posts while the latter helps identify and filter out "personal" posts. Our experiments on a large Twitter dataset show that there are more meaningful and unique bursty topics in the top-ranked results returned by our model than an LDA baseline and two degenerate variations of our model. We also show some case studies that demonstrate the importance of considering both the temporal information and users' personal interests for bursty topic detection from microblogs.

References

[1]
Amr Ahmed and Eric P. Xing. 2008. Dynamic non-parametric mixture models and the recurrent Chinese restaurant process: with applications to evolutionary clustering. In Proceedings of the SIAM International Conference on Data Mining, pages 219--230.
[2]
Amr Ahmed and Eric P. Xing. 2010. Timeline: A dynamic hierarchical Dirichlet process model for recovering birth/death and evolution of topics in text stream. In Proceedings of the 26th Conference on Uncertainty in Artificial Intelligence, pages 20--29.
[3]
David M. Blei and John D. Lafferty. 2006. Dynamic topic models. In Proceedings of the 23rd International Conference on Machine Learning.
[4]
David M. Blei, Andrew Y. Ng, and Michael I. Jordan. 2003. Latent Dirichlet allocation. Journal of Machine Learning Research, 3: 993--1022.
[5]
Gabriel Pui Cheong Fung, Jeffrey Xu Yu, Philip S. Yu, and Hongjun Lu. 2005. Parameter free bursty events detection in text streams. In Proceedings of the 31st International Conference on Very Large Data Bases, pages 181--192.
[6]
Amit Gruber, Michal Rosen-Zvi, and Yair Weiss. 2007. Hidden topic Markov model. In Proceedings of the International Conference on Artificial Intelligence and Statistics.
[7]
Liangjie Hong, Byron Dom, Siva Gurumurthy, and Kostas Tsioutsiouliklis. 2011. A time-dependent topic model for multiple text streams. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 832--840.
[8]
Alexander Ihler, Jon Hutchins, and Padhraic Smyth. 2006. Adaptive event detection with time-varying poisson processes. In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 207--216.
[9]
Jon Kleinberg. 2002. Bursty and hierarchical structure in streams. In Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 91--101.
[10]
Tomonari Masada, Daiji Fukagawa, Atsuhiro Takasu, Tsuyoshi Hamada, Yuichiro Shibata, and Kiyoshi Oguri. 2009. Dynamic hyperparameter optimization for bayesian topical trend analysis. In Proceedings of the 18th ACM Conference on Information and knowledge management, pages 1831--1834.
[11]
Ramesh M. Nallapati, Susan Ditmore, John D. Lafferty, and Kin Ung. 2007. Multiscale topic tomography. In Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 520--529.
[12]
Michal Rosen-Zvi, Thomas Griffiths, Mark Steyvers, and Padhraic Smyth. 2004. The author-topic model for authors and documents. In Proceedings of the 20th conference on Uncertainty in artificial intelligence, pages 487--494.
[13]
Xuerui Wang and Andrew McCallum. 2006. Topics over time: a non-Markov continuous-time model of topical trends. In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 424--433.
[14]
Xuanhui Wang, ChengXiang Zhai, Xiao Hu, and Richard Sproat. 2007. Mining correlated bursty topic patterns from coordinated text streams. In Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 784--793.
[15]
Chong Wang, David M. Blei, and David Heckerman. 2008. Continuous time dynamic topic models. In Proceedings of the 24th Conference on Uncertainty in Artificial Intelligence, pages 579--586.
[16]
Jianshu Weng and Francis Lee. 2011. Event detection in Twitter. In Proceedings of the 5th International AAAI Conference on Weblogs and Social Media.
[17]
Wayne Xin Zhao, Jing Jiang, Jianshu Weng, Jing He, Ee-Peng Lim, Hongfei Yan, and Xiaoming Li. 2011. Comparing twitter and traditional media using topic models. In Proceedings of the 33rd European conference on Advances in information retrieval, pages 338--349.

Cited By

View all
  • (2019)Latent Dirichlet allocation (LDA) and topic modelingMultimedia Tools and Applications10.1007/s11042-018-6894-478:11(15169-15211)Online publication date: 1-Jun-2019
  • (2018)Time-Varying Dynamic Topic ModelJournal of Global Information Management10.4018/JGIM.201801010626:1(104-119)Online publication date: 1-Jan-2018
  • (2018)Insights from the Long-TailProceedings of the 27th ACM International Conference on Information and Knowledge Management10.1145/3269206.3271706(297-306)Online publication date: 17-Oct-2018
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
ACL '12: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
July 2012
1100 pages

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 08 July 2012

Qualifiers

  • Research-article

Acceptance Rates

Overall Acceptance Rate 85 of 443 submissions, 19%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)68
  • Downloads (Last 6 weeks)4
Reflects downloads up to 13 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2019)Latent Dirichlet allocation (LDA) and topic modelingMultimedia Tools and Applications10.1007/s11042-018-6894-478:11(15169-15211)Online publication date: 1-Jun-2019
  • (2018)Time-Varying Dynamic Topic ModelJournal of Global Information Management10.4018/JGIM.201801010626:1(104-119)Online publication date: 1-Jan-2018
  • (2018)Insights from the Long-TailProceedings of the 27th ACM International Conference on Information and Knowledge Management10.1145/3269206.3271706(297-306)Online publication date: 17-Oct-2018
  • (2018)Social Network Monitoring for Bursty Cascade DetectionACM Transactions on Knowledge Discovery from Data10.1145/317804812:4(1-24)Online publication date: 16-Apr-2018
  • (2018)Detecting global and local topics via mining twitter dataNeurocomputing10.1016/j.neucom.2017.07.056273:C(120-132)Online publication date: 17-Jan-2018
  • (2018)Detecting bursts in sentiment-aware topics from social mediaKnowledge-Based Systems10.1016/j.knosys.2017.11.007141:C(44-54)Online publication date: 1-Feb-2018
  • (2017)Mining Twitter features for event summarization and ratingProceedings of the International Conference on Web Intelligence10.1145/3106426.3106487(615-622)Online publication date: 23-Aug-2017
  • (2017)Inferring Dynamic User Interests in Streams of Short Texts for User ClusteringACM Transactions on Information Systems10.1145/307260636:1(1-37)Online publication date: 17-Jul-2017
  • (2017)A probabilistic method for emerging topic tracking in Microblog streamWorld Wide Web10.1007/s11280-016-0390-420:2(325-350)Online publication date: 1-Mar-2017
  • (2016)PercimoProceedings of the 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining10.5555/3192424.3192474(271-278)Online publication date: 18-Aug-2016
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media