More Web Proxy on the site http://driver.im/

research-article

"Picture the scene...";: Visually Summarising Social Media Events

Authors:

Philip J. McParlane,

Andrew James McMinn,

Joemon M. JoseAuthors Info & Claims

CIKM '14: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management

Pages 1459 - 1468

https://doi.org/10.1145/2661829.2661923

Published: 03 November 2014 Publication History

Abstract

Due to the advent of social media and web 2.0, we are faced with a deluge of information; recently, research efforts have focused on filtering out noisy, irrelevant information items from social media streams and in particular have attempted to automatically identify and summarise events. However, due to the heterogeneous nature of such social media streams, these efforts have not reached fruition. In this paper, we investigate how images can be used as a source for summarising events. Existing approaches have considered only textual summaries which are often poorly written, in a different language and slow to digest. Alternatively, images are "worth 1,000 words" and are able to quickly and easily convey an idea or scene. Since images in social media can also be noisy, irrelevant and repetitive, we propose new techniques for their automatic selection, ranking and presentation. We evaluate our approach on a recently created social media event data set containing 365k tweets and 50 events, for which we extend by collecting 625k related images. By conducting two crowdsourced evaluations, we firstly show how our approach overcomes the problems of automatically collecting relevant and diverse images from noisy microblog data, before highlighting the advantages of multimedia summarisation over text based approaches.

References

[1]

C. C. Aggarwal and K. Subbian. Event detection in social streams. In SIAM DM, 2012.

[2]

J. Allan, V. Lavrenko, and H. Jin. First story detection in TDT is hard. In ACM CIKM, 2000.

Digital Library

[3]

F. Benevenuto, G. Magno, T. Rodrigues, and V. Almeida. Detecting spammers on twitter. In CEAS, volume 6, 2010.

[4]

D. Chakrabarti and K. Punera. Event summarization using tweets. In AAAI ICWSM, 2011.

[5]

C. H. Chang, M. Kayed, M. Girgis, and K. Shaalan. A survey of web information extraction systems. In IEEE KDE, 2006.

Digital Library

[6]

O. Chum, J. Philbin, and A. Zisserman. Near duplicate image detection: min-hash and tf-idf weighting. In BMVC, 2008.

[7]

C. Clarke, N. Craswell, and I. Soboroff. Preliminary report on the trec 2009 web track. In TREC notebook, 2009.

[8]

C. L. Clarke, M. Kolla, G. V. Cormack, O. Vechtomova, A. Ashkan, S. Büttcher, and I. MacKinnon. Novelty and diversity in information retrieval evaluation. In ACM SIGIR, 2008.

Digital Library

[9]

L. K. D. A. Shamma and E. Churchill. Statler: Summarizing media through short-message services. In ACM CSCW, 2010.

[10]

M. Del Fabro and L. Böszörmenyi. Summarization and presentation of real-life events using community-contributed content. In ACM MM, 2012.

Digital Library

[11]

J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. Imagenet: A large-scale hierarchical image database. In IEEE CVPR, 2009.

[12]

Y. Duan, L. Jiang, T. Qin, M. Zhou, and H.-Y. Shum. An empirical study on learning to rank of tweets. In COLING, 2010.

Digital Library

[13]

C. Eickhoff and A. P. Vries. Increasing cheat robustness of crowdsourcing tasks. In Inf. Retr., volume 16, 2013.

Digital Library

[14]

O. Etzioni, M. Banko, S. Soderland, and D. S. Weld. Open information extraction from the web. In Commun. ACM, volume 51, 2008.

Digital Library

[15]

T. Finin, W. Murnane, A. Karandikar, N. Keller, J. Martineau, and M. Dredze. Annotating named entities in twitter data with crowdsourcing. In NAACL HLT, 2010.

Digital Library

[16]

J. J. Foo and R. Sinha. Pruning sift for scalable near-duplicate image matching. In ACM ADC, 2007.

Digital Library

[17]

J. J. Foo, J. Zobel, R. Sinha, and S. M. M. Tahaghoghi. Detection of near-duplicate images for web search. In ACM CIVR, 2007.

Digital Library

[18]

M. Hirth, T. Hoßfeld, and P. Tran-Gia. Cheat-detection mechanisms for crowdsourcing. In Technical Report 474, University of Würzburg, 2010.

[19]

J. Hong and C. F. Baker. How good is the crowd at "real" wsd? In ACL LAW V, 2011.

Digital Library

[20]

A. P. D. V. J. Vuurens and C. Eickhoff. How much spam can you take? an analysis of crowdsourcing results to increase accuracy. In ACM SIGIR Workshop on Crowdsourcing for Information Retrieval, 2011.

[21]

D. Klein and C. D. Manning. Accurate unlexicalized parsing. In Annual Meeting on ACL, 2003.

Digital Library

[22]

B. Y.-L. Kuo, T. Hentrich, B. M. . Good, and M. D. Wilkinson. Tag clouds for summarizing web search results. In ACM WWW, 2007.

Digital Library

[23]

R. Long, H. Wang, Y. Chen, O. Jin, and Y. Yu. Towards effective event detection, tracking and summarization on microblog data. In WAIM, 2011.

Digital Library

[24]

D. G. Lowe. Distinctive image features from scale-invariant keypoints. In IJCV, volume 60, 2004.

Digital Library

[25]

A. Marcus, M. S. Bernstein, O. Badar, D. R. Karger, S. Madden, and R. C. Miller. Twitinfo: aggregating and visualizing microblogs for event exploration. In ACM SIGCHI, 2011.

Digital Library

[26]

A. J. McMinn, Y. Moshfeghi, and J. M. Jose. Building a large-scale corpus for evaluating event detection on twitter. In ACM CIKM, 2013.

Digital Library

[27]

P. J. McParlane and J. Jose. Exploiting twitter and wikipedia for the annotation of event images. In ACM SIGIR, 2014.

Digital Library

[28]

J. Nichols, J. Mahmud, and C. Drews. Summarizing sporting events using twitter. In ACM IUI, 2012.

Digital Library

[29]

S. Nowak and S. Rüger. How reliable are annotations via crowdsourcing: a study about inter-annotator agreement for multi-label image annotation. In ACM MIR, 2010.

Digital Library

[30]

S. Ravikumar, R. Balakrishnan, and S. Kambhampati. Ranking tweets considering trust and relevance. In ACM IIWeb, 2012.

Digital Library

[31]

R. Rivest. The md5 message-digest algorithm. 1992.

[32]

M. Sahuguet and B. Huet. Socially motivated multimedia topic timeline summarization. In ACM SAM, 2013.

Digital Library

[33]

T. Sakaki, M. Okazaki, and Y. Matsuo. Earthquake shakes twitter users: real-time event detection by social sensors. In ACM WWW, 2010.

Digital Library

[34]

P. Salembier and T. Sikora. Introduction to MPEG-7: Multimedia Content Description Interface. John Wiley & Sons, Inc., 2002.

Digital Library

[35]

J. Sankaranarayanan, H. Samet, B. E. Teitler, M. D. Lieberman, and J. Sperling. Twitterstand: news in tweets. In ACM SIGSPATIAL, 2009.

Digital Library

[36]

B. P. Sharifi, D. I. Inouye, and J. K. Kalita. Summarization of twitter microblogs. In BCS Computer Journal, 2013.

[37]

Z. Tang, Y. Dai, and X. Zhang. Perceptual hashing for color images using invariant moments. In Appl. Math, volume 6, 2012.

[38]

F. Wang and M. yen Kan. Npic: Hierarchical synthetic image classification using image search and generic features. In ACM CIVR, 2006.

Digital Library

[39]

J. Weng and B.-S. Lee. Event detection in twitter. In AAAI ICWSM, 2011.

[40]

J. Wood, J. Dykes, A. Slingsby, and K. Clarke. Interactive visual exploration of a large spatio-temporal dataset: Reflections on a geovisualization mashup. In IEEE TVCG, 2007.

Digital Library

[41]

A. Zubiaga, D. Spina, E. Amigó, and J. Gonzalo. Towards real-time summarization of scheduled events from twitter streams. In ACM HT, 2012.

Digital Library

Cited By

Yu ZTang ZLiang XZhang HSun RZhang X(2024)A novel image hashing with low-rank sparse matrix decomposition and feature distanceThe Visual Computer10.1007/s00371-024-03517-wOnline publication date: 13-Jun-2024
https://doi.org/10.1007/s00371-024-03517-w
Liang XTang ZHuang ZZhang XZhang S(2023)Efficient Hashing Method Using 2D-2D PCA for Image Copy DetectionIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2021.313118835:4(3765-3778)Online publication date: 1-Apr-2023
https://dl.acm.org/doi/10.1109/TKDE.2021.3131188
Aloufi SSaddik A(2022)MMSUM Digital Twins: A Multi-view Multi-modality Summarization Framework for Sporting EventsACM Transactions on Multimedia Computing, Communications, and Applications10.1145/346277718:1(1-25)Online publication date: 27-Jan-2022
https://dl.acm.org/doi/10.1145/3462777
Show More Cited By

Index Terms

"Picture the scene...";: Visually Summarising Social Media Events
1. Information systems
  1. Information retrieval

Recommendations

Celebrity's self-disclosure on Twitter and parasocial relationships

This study investigated how celebrities' self-disclosure on personal social media accounts, particularly Twitter, affects fans' perceptions. An online survey was utilized among a sample of 429 celebrity followers on Twitter. Results demonstrated that ...
Inspecting interactions: online news media synergies in social media
ASONAM '18: Proceedings of the 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining

The rising popularity of social media has radically changed the way news content is propagated, including interactive attempts with new dimensions. To date, traditional news media such as newspapers, television and radio have already adapted their ...
The emerging viewertariat in South Korea

Social networking sites (SNSs) represent Web 2.0 platforms or networking tools through which users can freely exchange ideas, opinions, experiences, and viewpoints and thus have considerable influence on the formation of political discourse. Despite the ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

CIKM '14: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management

November 2014

2152 pages

ISBN:9781450325981

DOI:10.1145/2661829

General Chairs:
Jianzhong Li
Harbin Inst. of Technology
,
X. Sean Wang
Fudan University
,
Program Chairs:
Minos Garofalakis
Technical University of Crete, Greece
,
Ian Soboroff
National Institute of Standards, USA
,
Torsten Suel
New York University, USA
,
Min Wang
Google Research, USA

Copyright © 2014 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 November 2014

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Seventh Framework Programme

Conference

CIKM '14

Sponsor:

CIKM '14: 2014 ACM Conference on Information and Knowledge Management

November 3 - 7, 2014

Shanghai, China

Acceptance Rates

CIKM '14 Paper Acceptance Rate 175 of 838 submissions, 21%;

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Sponsor:
sigir
sigir

The 34th ACM International Conference on Information and Knowledge Management

November 10 - 14, 2025

Seoul , Republic of Korea

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

17
Total Citations
View Citations
385
Total Downloads

Downloads (Last 12 months)23
Downloads (Last 6 weeks)1

Reflects downloads up to 01 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Yu ZTang ZLiang XZhang HSun RZhang X(2024)A novel image hashing with low-rank sparse matrix decomposition and feature distanceThe Visual Computer10.1007/s00371-024-03517-wOnline publication date: 13-Jun-2024
https://doi.org/10.1007/s00371-024-03517-w
Liang XTang ZHuang ZZhang XZhang S(2023)Efficient Hashing Method Using 2D-2D PCA for Image Copy DetectionIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2021.313118835:4(3765-3778)Online publication date: 1-Apr-2023
https://dl.acm.org/doi/10.1109/TKDE.2021.3131188
Aloufi SSaddik A(2022)MMSUM Digital Twins: A Multi-view Multi-modality Summarization Framework for Sporting EventsACM Transactions on Multimedia Computing, Communications, and Applications10.1145/346277718:1(1-25)Online publication date: 27-Jan-2022
https://dl.acm.org/doi/10.1145/3462777
Liang XTang ZLi SYu CZhang X(2022)A novel hashing scheme via image feature map and 2D PCAIET Image Processing10.1049/ipr2.1255516:12(3225-3236)Online publication date: 14-Jun-2022
https://doi.org/10.1049/ipr2.12555
Marcelino GSemedo DMourão ABlasi SMagalhães JMrak MShen HZhuang YSmith JYang YCesar PMetze FPrabhakaran B(2021)Assisting News Media Editors with Cohesive Visual StorylinesProceedings of the 29th ACM International Conference on Multimedia10.1145/3474085.3475476(3257-3265)Online publication date: 17-Oct-2021
https://dl.acm.org/doi/10.1145/3474085.3475476
Tang ZYu ZLi ZYu CZhang X(2020)Robust Image Hashing with Low-Rank Representation and Ring PartitionWireless Communications & Mobile Computing10.1155/2020/88704672020Online publication date: 1-Jan-2020
https://dl.acm.org/doi/10.1155/2020/8870467
Marcelino GSemedo DMourão ABlasi SMrak MMagalhaes JEl Saddik ADel Bimbo AZhang ZHauptmann ACandan KBertini MXie LWei X(2019)A Benchmark of Visual Storytelling in Social MediaProceedings of the 2019 on International Conference on Multimedia Retrieval10.1145/3323873.3325047(324-328)Online publication date: 5-Jun-2019
https://dl.acm.org/doi/10.1145/3323873.3325047
Bischke BBorth DDengel AVrochidis SHuet BChang EKompatsiaris I(2019)Large‐Scale Social Multimedia AnalysisBig Data Analytics for Large‐Scale Multimedia Search10.1002/9781119376996.ch6(157-181)Online publication date: 15-Mar-2019
https://doi.org/10.1002/9781119376996.ch6
Marcelino GPinto RMagalhães JAizawa KLew MSatoh S(2018)Ranking News-Quality MultimediaProceedings of the 2018 ACM on International Conference on Multimedia Retrieval10.1145/3206025.3206053(10-18)Online publication date: 5-Jun-2018
https://dl.acm.org/doi/10.1145/3206025.3206053
Ramesh Kashyap Avon der Weth CCheng ZKankanhalli MAkkermans HFontaine KVermeulen IHouben GWeber M(2018)EPICURE - Aspect-based Multimodal Review SummarizationProceedings of the 10th ACM Conference on Web Science10.1145/3201064.3202917(365-369)Online publication date: 15-May-2018
https://dl.acm.org/doi/10.1145/3201064.3202917
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents