[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2647868.2655013acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
poster

Real-time summarization of user-generated videos based on semantic recognition

Published: 03 November 2014 Publication History

Abstract

User-generated contents play an important role in the Internet video-sharing activities. Techniques for summarizing the user-generated videos (UGVs) into short representative clips are useful in many applications. This paper introduces an approach for UGV summarization based on semantic recognition. Different from other types of videos like movies or broadcasting news, where the semantic contents may vary greatly across different shots, most UGVs have only a single long shot with relatively consistent high-level semantics. Therefore, a few semantically representative segments are generally sufficient for a UGV summary, which can be selected based on the distribution of semantic recognition scores. In addition, due to the poor shooting quality of many UGVs, factors such as camera shaking and lighting condition are also considered to achieve more pleasant summaries. Experiments on over 100 UGVs with both subjective and objective evaluations show that our approach clearly outperforms several alternative methods and is highly efficient. Using a regular laptop, it can produce a summary for a 2-minute video in just 10 seconds.

References

[1]
Y.-G. Jiang. SUPER: towards real-time event recognition in internet videos. In Proceedings of ICMR, 2012.
[2]
Y.-G. Jiang, G. Ye, S.-F. Chang, D. Ellis, and A. C. Loui. Consumer video understanding: A benchmark database and an evaluation of human and machine performance. In Proceedings of ICMR, 2011.
[3]
A. Khosla, R. Hamid, C.-J. Lin, and N. Sundaresan. Large-scale video summarization using web-image priors. In Proceedings of CVPR, 2013.
[4]
D. Liu, G. Hua, and T. Chen. A hierarchical visual model for video object summarization. IEEE TPAMI, 32(12):2178--2190, 2010.
[5]
Z. Lu and K. Grauman. Story-driven summarization for egocentric video. In Proceedings of CVPR, 2013.
[6]
Y. Luo and X. Tang. Photo and video quality evaluation: Focusing on the subject. In Proceedings of ECCV. 2008.
[7]
T. Mei, L.-X. Tang, J. Tang, and X.-S. Hua. Near-lossless semantic video summarization and its applications to video analysis. ACM TOMCCAP, 9(3):1--23, 2013.
[8]
C.-W. Ngo, Y.-F. Ma, and H.-J. Zhang. Video summarization and scene detection by graph modeling. IEEE TCSVT, 15(2):296--305, 2005.
[9]
M. Wang, R. Hong, G. Li, Z.-J. Zha, S. Yan, and T.-S. Chua. Event driven web video summarization by tag localization and key-shot identification. IEEE TMM, 14(4):975--985, 2012.
[10]
W.-Q. Yan and M. S. Kankanhalli. Detection and removal of lighting & shaking artifacts in home videos. In Proceedings of ACM Multimedia, 2002.
[11]
H. J. Zhang, J. Wu, D. Zhong, and S. W. Smoliar. An integrated system for content-based video retrieval and browsing. Pattern Recognition, 30(4):643--658, 1997.

Cited By

View all
  • (2024)Online Informative Sampling Using Semantic Features in Underwater EnvironmentsOCEANS 2024 - Singapore10.1109/OCEANS51537.2024.10682405(1-6)Online publication date: 15-Apr-2024
  • (2020)Query TwiceProceedings of the 28th ACM International Conference on Multimedia10.1145/3394171.3414064(4023-4031)Online publication date: 12-Oct-2020
  • (2020)A Multi-Task Neural Approach for Emotion Attribution, Classification, and SummarizationIEEE Transactions on Multimedia10.1109/TMM.2019.292212922:1(148-159)Online publication date: 1-Jan-2020
  • Show More Cited By

Index Terms

  1. Real-time summarization of user-generated videos based on semantic recognition

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '14: Proceedings of the 22nd ACM international conference on Multimedia
    November 2014
    1310 pages
    ISBN:9781450330633
    DOI:10.1145/2647868
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 03 November 2014

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. semantic recognition
    2. user-generated videos
    3. video summarization

    Qualifiers

    • Poster

    Funding Sources

    Conference

    MM '14
    Sponsor:
    MM '14: 2014 ACM Multimedia Conference
    November 3 - 7, 2014
    Florida, Orlando, USA

    Acceptance Rates

    MM '14 Paper Acceptance Rate 55 of 286 submissions, 19%;
    Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)6
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 18 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Online Informative Sampling Using Semantic Features in Underwater EnvironmentsOCEANS 2024 - Singapore10.1109/OCEANS51537.2024.10682405(1-6)Online publication date: 15-Apr-2024
    • (2020)Query TwiceProceedings of the 28th ACM International Conference on Multimedia10.1145/3394171.3414064(4023-4031)Online publication date: 12-Oct-2020
    • (2020)A Multi-Task Neural Approach for Emotion Attribution, Classification, and SummarizationIEEE Transactions on Multimedia10.1109/TMM.2019.292212922:1(148-159)Online publication date: 1-Jan-2020
    • (2019)Video SkimmingACM Computing Surveys10.1145/334771252:5(1-38)Online publication date: 13-Sep-2019
    • (2019)Stories That Big Danmaku Data Can Tell as a New MediaIEEE Access10.1109/ACCESS.2019.29090547(53509-53519)Online publication date: 2019
    • (2018)An Unsupervised Method to Extract Video Object via Complexity Awareness and Object Local PartsIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2017.268257828:7(1580-1594)Online publication date: Jul-2018
    • (2018)Heterogeneous Knowledge Transfer in Video Emotion Recognition, Attribution and SummarizationIEEE Transactions on Affective Computing10.1109/TAFFC.2016.26226909:2(255-270)Online publication date: 1-Apr-2018
    • (2018)Key-frame selection for automatic summarization of surveillance videosMachine Vision and Applications10.1007/s00138-018-0954-729:7(1101-1117)Online publication date: 1-Oct-2018
    • (2017)Frame-Transformer Emotion Classification NetworkProceedings of the 2017 ACM on International Conference on Multimedia Retrieval10.1145/3078971.3079030(78-83)Online publication date: 6-Jun-2017
    • (2017)Videography-Based Unconstrained Video AnalysisIEEE Transactions on Image Processing10.1109/TIP.2017.267880026:5(2261-2273)Online publication date: 1-May-2017
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media