[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2487788.2488172acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Understanding the diversity of tweets in the time of outbreaks

Published: 13 May 2013 Publication History

Abstract

A microblogging service like Twitter continues to surge in importance as a means of sharing information in social networks. In the medical domain, several works have shown the potential of detecting public health events (i.e., infectious disease outbreaks) using Twitter messages or tweets. Given its real-time nature, Twitter can enhance early outbreak warning for public health authorities in order that a rapid response can take place. Most of previous works on detecting outbreaks in Twitter simply analyze tweets matched disease names and/or locations of interests. However, the effectiveness of such method is limited for two main reasons. First, disease names are highly ambiguous, i.e., referring slangs or non health-related contexts. Second, the characteristics of infectious diseases are highly dynamic in time and place, namely, strongly time-dependent and vary greatly among different regions. In this paper, we propose to analyze the temporal diversity of tweets during the known periods of real-world outbreaks in order to gain insight into a temporary focus on specific events. More precisely, our objective is to understand whether the temporal diversity of tweets can be used as indicators of outbreak events, and to which extent. We employ an efficient algorithm based on sampling to compute the diversity statistics of tweets at particular time. To this end, we conduct experiments by correlating temporal diversity with the estimated event magnitude of 14 real-world outbreak events manually created as ground truth. Our analysis shows that correlation results are diverse among different outbreaks, which can reflect the characteristics (severity and duration) of outbreaks.

References

[1]
E. Aramaki, S. Maskawa, and M. Morita. Twitter catches the flu: Detecting influenza epidemics using twitter. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, EMNLP '11, 2011.
[2]
N. Collier. What's Unusual in Online Disease Outbreak News? Journal of Biomedical Semantics, 1(1):2, 2010.
[3]
N. Collier and S. Doan. Syndromic classification of twitter messages. CoRR, abs/1110.3094, 2011.
[4]
N. Collier, N. T. Son, and N. M. Nguyen. Omg u got flu? analysis of shared health messages for bio-surveillance. CoRR, abs/1110.3089, 2011.
[5]
A. Culotta. Towards detecting influenza epidemics by analyzing twitter messages. In Proceedings of the First Workshop on Social Media Analytics, SOMA '10, 2010.
[6]
F. Deng, S. Siersdorfer, and S. Zerr. Efficient jaccard-based diversity analysis of large document collections. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management, CIKM '12, 2012.
[7]
E. Diaz-Aviles, A. Stewart, E. Velasco, K. Denecke, and W. Nejdl. Epidemic intelligence for the crowd, by the crowd. In International AAAI Conference on Weblogs and Social Media, ICWSM '12, 2012.
[8]
A. Dong, R. Zhang, P. Kolari, J. Bai, F. Diaz, Y. Chang, Z. Zheng, and H. Zha. Time is of the essence: improving recency ranking using twitter data. In Proceedings of the 19th international conference on World wide web, WWW '10, 2010.
[9]
M. Efron. Information search and retrieval in microblogs. Journal of the American Society for Information Science and Technology (JASIST), 62(6):996--1008, 2011.
[10]
J. Gomide, A. Veloso, W. Meira, V. Almeida, F. Benevenuto, F. Ferraz, and M. Teixeira. Dengue surveillance based on a computational model of spatio-temporal locality of twitter. In Proceedings of the 3rd ACM Web Science Conference, WebSci '11, 2011.
[11]
N. Kanhabua, S. Romano, and A. Stewart. Identifying relevant temporal expressions for real-world events. In Proceedings of the SIGIR 2012 Workshop on Time-aware Information Access (TAIA '12), 2012.
[12]
N. Kanhabua, S. Romano, A. Stewart, and W. Nejdl. Supporting temporal analytics for health-related events in microblogs. In Proceedings of the 21st ACM international conference on Information and knowledge management, CIKM '12, 2012.
[13]
R. Nagmoti, A. Teredesai, and M. De Cock. Ranking approaches for microblog search. In Proceedings of 2010 IEEE/WIC/ACM International Conference on Web Intelligence, WI '10, 2010.
[14]
B. O'Connor, R. Balasubramanyan, B. R. Routledge, and N. A. Smith. From tweets to polls: Linking text sentiment to public opinion time series. In Proceedings of the Fourth International Conference on Weblogs and Social Media, ICWSM '10, 2010.
[15]
M. J. Paul and M. Dredze. You are what you tweet: Analyzing twitter for public health. In Proceedings of the Fifth International Conference on Weblogs and Social Media, ICWSM '11, 2011.
[16]
E. J. Ruiz, V. Hristidis, C. Castillo, A. Gionis, and A. Jaimes. Correlating financial time series with micro-blogging activity. In Proceedings of the fifth ACM international conference on Web search and data mining, WSDM '12, 2012.
[17]
T. Sakaki, M. Okazaki, and Y. Matsuo. Earthquake shakes twitter users: real-time event detection by social sensors. In Proceedings of the 19th international conference on World wide web, WWW '10, 2010.
[18]
M. Sofean, A. Stewart, K. Denecke, and M. Smith. Medical case-driven classification of microblogs: Characteristics and annotation. In Proceedings of SIGHIT International Health Informatics Symposium, IHI '12, 2012.
[19]
J. Strötgen and M. Gertz. Heideltime: High quality rule-based extraction and normalization of temporal expressions. In Proceedings of the 5th International Workshop on Semantic Evaluation (SemEval '10), 2010.
[20]
M. Szomszor, P. Kostkova, and E. de Quincey.#swineflu: Twitter predicts swine flu outbreak in 2009. In Proceedings of the Third International Conference on Electronic Healthcare, eHealth '10, 2010.
[21]
G. Tsatsaronis, I. Varlamis, and K. Nørvåg. Semafor: semantic document indexing using semantic forests. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management, CIKM '12, 2012.
[22]
A. Tumasjan, T. O. Sprenger, P. G. Sandner, and I. M. Welpe. Predicting elections with twitter: What 140 characters reveal about political sentiment. In Proceedings of the Fourth International Conference on Weblogs and Social Media, ICWSM '10, 2010.
[23]
S. Vieweg, A. L. Hughes, K. Starbird, and L. Palen. Microblogging during two natural hazards events: what twitter may contribute to situational awareness. In Proceedings of the 28th International Conference on Human Factors in Computing Systems, CHI '10, 2010.
[24]
I. H. Witten and E. Frank. Data Mining: Practical Machine Learning Tools and Techniques, Second Edition. Morgan Kaufmann, 2005.

Cited By

View all
  • (2023)Machine Learning Framework for Analyzing Disaster-Tweets2023 International Conference on Intelligent Systems for Communication, IoT and Security (ICISCoIS)10.1109/ICISCoIS56541.2023.10100450(55-60)Online publication date: 9-Feb-2023
  • (2023)Dtweet: Disaster Tweet Analysis Using Deep Learning TechniquesSoft Computing for Security Applications10.1007/978-981-99-3608-3_23(331-343)Online publication date: 20-Jul-2023
  • (2022)Why does the president tweet this? Discovering reasons and contexts for politicians’ tweets from news articlesInformation Processing and Management: an International Journal10.1016/j.ipm.2022.10289259:3Online publication date: 1-May-2022
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
WWW '13 Companion: Proceedings of the 22nd International Conference on World Wide Web
May 2013
1636 pages
ISBN:9781450320382
DOI:10.1145/2487788

Sponsors

  • NICBR: Nucleo de Informatcao e Coordenacao do Ponto BR
  • CGIBR: Comite Gestor da Internet no Brazil

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 May 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. event detection
  2. outbreak events
  3. temporal diversity
  4. twitter
  5. web observatory

Qualifiers

  • Research-article

Conference

WWW '13
Sponsor:
  • NICBR
  • CGIBR
WWW '13: 22nd International World Wide Web Conference
May 13 - 17, 2013
Rio de Janeiro, Brazil

Acceptance Rates

WWW '13 Companion Paper Acceptance Rate 831 of 1,250 submissions, 66%;
Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)11
  • Downloads (Last 6 weeks)0
Reflects downloads up to 18 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Machine Learning Framework for Analyzing Disaster-Tweets2023 International Conference on Intelligent Systems for Communication, IoT and Security (ICISCoIS)10.1109/ICISCoIS56541.2023.10100450(55-60)Online publication date: 9-Feb-2023
  • (2023)Dtweet: Disaster Tweet Analysis Using Deep Learning TechniquesSoft Computing for Security Applications10.1007/978-981-99-3608-3_23(331-343)Online publication date: 20-Jul-2023
  • (2022)Why does the president tweet this? Discovering reasons and contexts for politicians’ tweets from news articlesInformation Processing and Management: an International Journal10.1016/j.ipm.2022.10289259:3Online publication date: 1-May-2022
  • (2021)Multilingual Text Classification from Twitter during Emergencies2021 IEEE International Conference on Consumer Electronics (ICCE)10.1109/ICCE50685.2021.9427581(1-6)Online publication date: 10-Jan-2021
  • (2020)Learning from Tweets: Opportunities and Challenges to Inform Policy Making During Dengue EpidemicProceedings of the ACM on Human-Computer Interaction10.1145/33928754:CSCW1(1-27)Online publication date: 29-May-2020
  • (2019)Identifying Protective Health Behaviors on Twitter: Observational Study of Travel Advisories and Zika VirusJournal of Medical Internet Research10.2196/1309021:5(e13090)Online publication date: 13-May-2019
  • (2019)Using social media to geo-target emergency management effortsProceedings of the 5th ACM SIGSPATIAL International Workshop on the Use of GIS in Emergency Management10.1145/3356998.3365769(1-4)Online publication date: 5-Nov-2019
  • (2019)Identifying disaster-related tweets and their semantic, spatial and temporal context using deep learning, natural language processing and spatial analysis: a case study of Hurricane IrmaInternational Journal of Digital Earth10.1080/17538947.2018.156321912:11(1205-1229)Online publication date: 3-Jan-2019
  • (2019)Embedding and predicting the event at early stageWorld Wide Web10.1007/s11280-018-0545-622:3(1055-1074)Online publication date: 1-May-2019
  • (2018)Addressing Selection Bias in Event Studies with General-Purpose Social Media PanelsJournal of Data and Information Quality10.1145/318504810:1(1-24)Online publication date: 29-May-2018
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media