Tweet Timeline Generation via Graph-Based Dynamic Greedy Clustering

Feifan Fan¹⁹,
Runwei Qiang¹⁹,
Chao Lv¹⁹,
Wayne Xin Zhao²⁰ &
…
Jianwu Yang¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9460))

Included in the following conference series:

AIRS

893 Accesses
1 Citations

Abstract

When searching a query in the microblogging, a user would typically receive an archive of tweets as part of a retrospective piece on the impact of social media. For ease of understanding the retrieved tweets, it is useful to produce a summarized timeline about a given topic. However, tweet timeline generation is quite challenging due to the noisy and temporal characteristics of microblogs. In this paper, we propose a graph-based dynamic greedy clustering approach, which considers the coverage, relevance and novelty of the tweet timeline. First, tweet embedding representation is learned in order to construct the tweet semantic graph. Based on the graph, we estimate the coverage of timeline according to the graph connectivity. Furthermore, we integrate a noise tweet elimination component to remove noisy tweets with the lexical and semantic features based on relevance and novelty. Experimental results on public Text Retrieval Conference (TREC) Twitter corpora demonstrate the effectiveness of the proposed approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 35.99; Price includes VAT (United Kingdom)

Softcover Book: GBP 44.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

On the Evaluation of Tweet Timeline Generation Task

Perceiving Topic Bubbles: Local Topic Detection in Spatio-Temporal Tweet Stream

TEAGS: time-aware text embedding approach to generate subgraphs

Article 03 June 2020

Notes

1.
https://github.com/lintool/twitter-tools.

References

Agarwal, M.K., Ramamritham, K., Bhide, M.: Real time discovery of dense clusters in highly dynamic graphs: identifying real world events in highly dynamic environments. Proc. VLDB Endowment 5(10), 980–991 (2012)
Article Google Scholar
Agrawal, R., Gollapudi, S., Halverson, A., Ieong, S.: Diversifying search results. In: Proceedings of the Second ACM International Conference on Web Search and Data Mining, pp. 5–14. ACM (2009)
Google Scholar
Albakour, M., Macdonald, C., Ounis, I., et al.: On sparsity and drift for effective real-time filtering in microblogs. In: Proceedings of the 22nd ACM International Conference on Information & Knowledge Management, pp. 419–428. ACM (2013)
Google Scholar
Aslam, J.A., Pelekhov, E., Rus, D.: The star clustering algorithm for static and dynamic information organization. J. Graph Algorithms Appl. 8, 95–129 (2004)
Article MathSciNet MATH Google Scholar
Di Marco, A., Navigli, R.: Clustering and diversifying web search results with graph-based word sense induction. Comput. Linguistics 39(3), 709–754 (2013)
Article Google Scholar
Joachims, T.: Optimizing search engines using clickthrough data. In: KDD, pp. 133–142 (2002)
Google Scholar
Lappas, T., Arai, B., Platakis, M., Kotsakos, D., Gunopulos, D.: On burstiness-aware search for document sequences. In: Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 477–486. ACM (2009)
Google Scholar
Lee, P., Lakshmanan, L.V., Milios, E.E.: Incremental cluster evolution tracking from highly dynamic network data. In: IEEE 30th International Conference on Data Engineering (ICDE), 2014, pp. 3–14. IEEE (2014)
Google Scholar
Lin, C., Lin, C., Li, J., Wang, D., Chen, Y., Li, T.: Generating event storylines from microblogs. In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management, pp. 175–184. ACM (2012)
Google Scholar
Lin, J., Efron, M.: Overview of the TREC-2013 Microblog Track. In: TREC 2013 (2013)
Google Scholar
Lin, J., Efron, M.: Overview of the TREC-2014 Microblog Track. In: TREC 2014 (2014)
Google Scholar
Lv, C., Fan, F., Qiang, R., Fei, Y., Yang, J.: PKUICST at TREC 2014 Microblog Track: Feature Extraction for Effective Microblog Search and Adaptive Clustering Algorithms for TTG (2014)
Google Scholar
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
Google Scholar
Ounis, I., Macdonald, C., Lin, J., Soboroff, I.: Overview of the TREC-2011 Microblog Track. In: TREC 2011 (2012)
Google Scholar
Walid, M., Wei, G., Tarek, E.: QCRI at TREC 2014: Applying the KISS Principle for TTG Task in the Microblog Track (2014)
Google Scholar
Wang, D., Li, T., Ogihara, M.: Generating pictorial storylines via minimum-weight connected dominating set approximation in multi-view graphs. In: AAAI (2012)
Google Scholar
Wang, X., Zhai, C.: Learn from web search logs to organize search results. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 87–94. ACM (2007)
Google Scholar
Xu, T., McNamee, P., Oard, D.W.: HLTCOE at TREC 2014: Microblog and Clinical Decision Support (2014)
Google Scholar
Zhai, C., Cohen, W.W., Lafferty, J.: Beyond independent relevance: methods and evaluation metrics for subtopic retrieval. In: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Informaion Retrieval, pp. 10–17. ACM (2003)
Google Scholar
Zhang, Y.: Using bayesian priors to combine classifiers for adaptive filtering. In: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 345–352. ACM (2004)
Google Scholar
Zhou, W., Shen, C., Li, T., Chen, S., Xie, N., Wei, J.: Generating textual storyline to improve situation awareness in disaster management. In. In Proceedings of the 15th IEEE International Conference on Information Reuse and Integration (IRI 2014) (2014)
Google Scholar

Download references

Acknowledgments

The work reported in this paper is supported by the National Natural Science Foundation of China Grant 61370116. We thank anonymous reviewers for their beneficial comments.

Author information

Authors and Affiliations

Institute of Computer Science and Technology, Peking University, Beijing, 100871, China
Feifan Fan, Runwei Qiang, Chao Lv & Jianwu Yang
School of Information, Renmin University of China, Beijing, China
Wayne Xin Zhao

Authors

Feifan Fan
View author publications
You can also search for this author in PubMed Google Scholar
Runwei Qiang
View author publications
You can also search for this author in PubMed Google Scholar
Chao Lv
View author publications
You can also search for this author in PubMed Google Scholar
Wayne Xin Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Jianwu Yang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jianwu Yang .

Editor information

Editors and Affiliations

Science and Engineering Faculty, Queensland University of Technology, Brisbane, Australia
Guido Zuccon
Brisbane, Queensland, Australia
Shlomo Geva
University of Tsukuba, Ibaraki, Japan
Hideo Joho
RMIT University, Melbourne, Australia
Falk Scholer
School of Computer Engineering, Nanyang Technological University, Singapore, Singapore
Aixin Sun
Tianjin University, China
Peng Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fan, F., Qiang, R., Lv, C., Zhao, W.X., Yang, J. (2015). Tweet Timeline Generation via Graph-Based Dynamic Greedy Clustering. In: Zuccon, G., Geva, S., Joho, H., Scholer, F., Sun, A., Zhang, P. (eds) Information Retrieval Technology. AIRS 2015. Lecture Notes in Computer Science(), vol 9460. Springer, Cham. https://doi.org/10.1007/978-3-319-28940-3_24

Download citation

DOI: https://doi.org/10.1007/978-3-319-28940-3_24
Published: 22 January 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-28939-7
Online ISBN: 978-3-319-28940-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Tweet Timeline Generation via Graph-Based Dynamic Greedy Clustering

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

On the Evaluation of Tweet Timeline Generation Task

Perceiving Topic Bubbles: Local Topic Detection in Spatio-Temporal Tweet Stream

TEAGS: time-aware text embedding approach to generate subgraphs

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Tweet Timeline Generation via Graph-Based Dynamic Greedy Clustering

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

On the Evaluation of Tweet Timeline Generation Task

Perceiving Topic Bubbles: Local Topic Detection in Spatio-Temporal Tweet Stream

TEAGS: time-aware text embedding approach to generate subgraphs

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation