[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2661829.2661923acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

"Picture the scene...";: Visually Summarising Social Media Events

Published: 03 November 2014 Publication History

Abstract

Due to the advent of social media and web 2.0, we are faced with a deluge of information; recently, research efforts have focused on filtering out noisy, irrelevant information items from social media streams and in particular have attempted to automatically identify and summarise events. However, due to the heterogeneous nature of such social media streams, these efforts have not reached fruition. In this paper, we investigate how images can be used as a source for summarising events. Existing approaches have considered only textual summaries which are often poorly written, in a different language and slow to digest. Alternatively, images are "worth 1,000 words" and are able to quickly and easily convey an idea or scene. Since images in social media can also be noisy, irrelevant and repetitive, we propose new techniques for their automatic selection, ranking and presentation. We evaluate our approach on a recently created social media event data set containing 365k tweets and 50 events, for which we extend by collecting 625k related images. By conducting two crowdsourced evaluations, we firstly show how our approach overcomes the problems of automatically collecting relevant and diverse images from noisy microblog data, before highlighting the advantages of multimedia summarisation over text based approaches.

References

[1]
C. C. Aggarwal and K. Subbian. Event detection in social streams. In SIAM DM, 2012.
[2]
J. Allan, V. Lavrenko, and H. Jin. First story detection in TDT is hard. In ACM CIKM, 2000.
[3]
F. Benevenuto, G. Magno, T. Rodrigues, and V. Almeida. Detecting spammers on twitter. In CEAS, volume 6, 2010.
[4]
D. Chakrabarti and K. Punera. Event summarization using tweets. In AAAI ICWSM, 2011.
[5]
C. H. Chang, M. Kayed, M. Girgis, and K. Shaalan. A survey of web information extraction systems. In IEEE KDE, 2006.
[6]
O. Chum, J. Philbin, and A. Zisserman. Near duplicate image detection: min-hash and tf-idf weighting. In BMVC, 2008.
[7]
C. Clarke, N. Craswell, and I. Soboroff. Preliminary report on the trec 2009 web track. In TREC notebook, 2009.
[8]
C. L. Clarke, M. Kolla, G. V. Cormack, O. Vechtomova, A. Ashkan, S. Büttcher, and I. MacKinnon. Novelty and diversity in information retrieval evaluation. In ACM SIGIR, 2008.
[9]
L. K. D. A. Shamma and E. Churchill. Statler: Summarizing media through short-message services. In ACM CSCW, 2010.
[10]
M. Del Fabro and L. Böszörmenyi. Summarization and presentation of real-life events using community-contributed content. In ACM MM, 2012.
[11]
J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. Imagenet: A large-scale hierarchical image database. In IEEE CVPR, 2009.
[12]
Y. Duan, L. Jiang, T. Qin, M. Zhou, and H.-Y. Shum. An empirical study on learning to rank of tweets. In COLING, 2010.
[13]
C. Eickhoff and A. P. Vries. Increasing cheat robustness of crowdsourcing tasks. In Inf. Retr., volume 16, 2013.
[14]
O. Etzioni, M. Banko, S. Soderland, and D. S. Weld. Open information extraction from the web. In Commun. ACM, volume 51, 2008.
[15]
T. Finin, W. Murnane, A. Karandikar, N. Keller, J. Martineau, and M. Dredze. Annotating named entities in twitter data with crowdsourcing. In NAACL HLT, 2010.
[16]
J. J. Foo and R. Sinha. Pruning sift for scalable near-duplicate image matching. In ACM ADC, 2007.
[17]
J. J. Foo, J. Zobel, R. Sinha, and S. M. M. Tahaghoghi. Detection of near-duplicate images for web search. In ACM CIVR, 2007.
[18]
M. Hirth, T. Hoßfeld, and P. Tran-Gia. Cheat-detection mechanisms for crowdsourcing. In Technical Report 474, University of Würzburg, 2010.
[19]
J. Hong and C. F. Baker. How good is the crowd at "real" wsd? In ACL LAW V, 2011.
[20]
A. P. D. V. J. Vuurens and C. Eickhoff. How much spam can you take? an analysis of crowdsourcing results to increase accuracy. In ACM SIGIR Workshop on Crowdsourcing for Information Retrieval, 2011.
[21]
D. Klein and C. D. Manning. Accurate unlexicalized parsing. In Annual Meeting on ACL, 2003.
[22]
B. Y.-L. Kuo, T. Hentrich, B. M. . Good, and M. D. Wilkinson. Tag clouds for summarizing web search results. In ACM WWW, 2007.
[23]
R. Long, H. Wang, Y. Chen, O. Jin, and Y. Yu. Towards effective event detection, tracking and summarization on microblog data. In WAIM, 2011.
[24]
D. G. Lowe. Distinctive image features from scale-invariant keypoints. In IJCV, volume 60, 2004.
[25]
A. Marcus, M. S. Bernstein, O. Badar, D. R. Karger, S. Madden, and R. C. Miller. Twitinfo: aggregating and visualizing microblogs for event exploration. In ACM SIGCHI, 2011.
[26]
A. J. McMinn, Y. Moshfeghi, and J. M. Jose. Building a large-scale corpus for evaluating event detection on twitter. In ACM CIKM, 2013.
[27]
P. J. McParlane and J. Jose. Exploiting twitter and wikipedia for the annotation of event images. In ACM SIGIR, 2014.
[28]
J. Nichols, J. Mahmud, and C. Drews. Summarizing sporting events using twitter. In ACM IUI, 2012.
[29]
S. Nowak and S. Rüger. How reliable are annotations via crowdsourcing: a study about inter-annotator agreement for multi-label image annotation. In ACM MIR, 2010.
[30]
S. Ravikumar, R. Balakrishnan, and S. Kambhampati. Ranking tweets considering trust and relevance. In ACM IIWeb, 2012.
[31]
R. Rivest. The md5 message-digest algorithm. 1992.
[32]
M. Sahuguet and B. Huet. Socially motivated multimedia topic timeline summarization. In ACM SAM, 2013.
[33]
T. Sakaki, M. Okazaki, and Y. Matsuo. Earthquake shakes twitter users: real-time event detection by social sensors. In ACM WWW, 2010.
[34]
P. Salembier and T. Sikora. Introduction to MPEG-7: Multimedia Content Description Interface. John Wiley & Sons, Inc., 2002.
[35]
J. Sankaranarayanan, H. Samet, B. E. Teitler, M. D. Lieberman, and J. Sperling. Twitterstand: news in tweets. In ACM SIGSPATIAL, 2009.
[36]
B. P. Sharifi, D. I. Inouye, and J. K. Kalita. Summarization of twitter microblogs. In BCS Computer Journal, 2013.
[37]
Z. Tang, Y. Dai, and X. Zhang. Perceptual hashing for color images using invariant moments. In Appl. Math, volume 6, 2012.
[38]
F. Wang and M. yen Kan. Npic: Hierarchical synthetic image classification using image search and generic features. In ACM CIVR, 2006.
[39]
J. Weng and B.-S. Lee. Event detection in twitter. In AAAI ICWSM, 2011.
[40]
J. Wood, J. Dykes, A. Slingsby, and K. Clarke. Interactive visual exploration of a large spatio-temporal dataset: Reflections on a geovisualization mashup. In IEEE TVCG, 2007.
[41]
A. Zubiaga, D. Spina, E. Amigó, and J. Gonzalo. Towards real-time summarization of scheduled events from twitter streams. In ACM HT, 2012.

Cited By

View all
  • (2024)A novel image hashing with low-rank sparse matrix decomposition and feature distanceThe Visual Computer10.1007/s00371-024-03517-wOnline publication date: 13-Jun-2024
  • (2023)Efficient Hashing Method Using 2D-2D PCA for Image Copy DetectionIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2021.313118835:4(3765-3778)Online publication date: 1-Apr-2023
  • (2022)MMSUM Digital Twins: A Multi-view Multi-modality Summarization Framework for Sporting EventsACM Transactions on Multimedia Computing, Communications, and Applications10.1145/346277718:1(1-25)Online publication date: 27-Jan-2022
  • Show More Cited By

Index Terms

  1. "Picture the scene...";: Visually Summarising Social Media Events

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIKM '14: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management
    November 2014
    2152 pages
    ISBN:9781450325981
    DOI:10.1145/2661829
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 03 November 2014

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. image diversification
    2. near duplicate detection
    3. social media
    4. twitter
    5. visual event summarisation

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    CIKM '14
    Sponsor:

    Acceptance Rates

    CIKM '14 Paper Acceptance Rate 175 of 838 submissions, 21%;
    Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

    Upcoming Conference

    CIKM '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)23
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 01 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)A novel image hashing with low-rank sparse matrix decomposition and feature distanceThe Visual Computer10.1007/s00371-024-03517-wOnline publication date: 13-Jun-2024
    • (2023)Efficient Hashing Method Using 2D-2D PCA for Image Copy DetectionIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2021.313118835:4(3765-3778)Online publication date: 1-Apr-2023
    • (2022)MMSUM Digital Twins: A Multi-view Multi-modality Summarization Framework for Sporting EventsACM Transactions on Multimedia Computing, Communications, and Applications10.1145/346277718:1(1-25)Online publication date: 27-Jan-2022
    • (2022)A novel hashing scheme via image feature map and 2D PCAIET Image Processing10.1049/ipr2.1255516:12(3225-3236)Online publication date: 14-Jun-2022
    • (2021)Assisting News Media Editors with Cohesive Visual StorylinesProceedings of the 29th ACM International Conference on Multimedia10.1145/3474085.3475476(3257-3265)Online publication date: 17-Oct-2021
    • (2020)Robust Image Hashing with Low-Rank Representation and Ring PartitionWireless Communications & Mobile Computing10.1155/2020/88704672020Online publication date: 1-Jan-2020
    • (2019)A Benchmark of Visual Storytelling in Social MediaProceedings of the 2019 on International Conference on Multimedia Retrieval10.1145/3323873.3325047(324-328)Online publication date: 5-Jun-2019
    • (2019)Large‐Scale Social Multimedia AnalysisBig Data Analytics for Large‐Scale Multimedia Search10.1002/9781119376996.ch6(157-181)Online publication date: 15-Mar-2019
    • (2018)Ranking News-Quality MultimediaProceedings of the 2018 ACM on International Conference on Multimedia Retrieval10.1145/3206025.3206053(10-18)Online publication date: 5-Jun-2018
    • (2018)EPICURE - Aspect-based Multimodal Review SummarizationProceedings of the 10th ACM Conference on Web Science10.1145/3201064.3202917(365-369)Online publication date: 15-May-2018
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media