[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/319463.319654acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
Article
Free access

Video Manga: generating semantically meaningful video summaries

Published: 30 October 1999 Publication History

Abstract

This paper presents methods for automatically creating pictorial video summaries that resemble comic books. The relative importance of video segments is computed from their length and novelty. Image and audio analysis is used to automatically detect and emphasize meaningful events. Based on this importance measure, we choose relevant keyframes. Selected keyframes are sized by importance, and then efficiently packed into a pictorial summary. We present a quantitative measure of how well a summary captures the salient events in a video, and show how it can be used to improve our summaries. The result is a compact and visually pleasing summary that captures semantically important events, and is suitable for printing or Web access. Such a summary can be further enhanced by including text captions derived from OCR or other methods. We describe how the automatically generated summaries are used to simplify access to a large collection of videos.

References

[1]
Aigrain, P., Joly, P. and Longueville, V., "Medium Knowledge-Based Macro-Segmentation of Video into Sequences," Intelligent Multimedia Information Retrieval, AAAI Press/The MIT Press, pp. ! 59-173, 1997.
[2]
Arman, F., Depommier, R., Hsu, A. and Chiu, M.-Y., "Content-based Browsing of Video Sequences," in Proc. A CM Multimedia 94, San Francisco, October 1994, pp. 97-103.
[3]
Boreczky, J. and Rowe, L., "Comparison of Video Shot Boundary Detection Techniques," in Proc. SPIE Conference on Storage and Retrieval for Still Image and 'Video Databases IE, San Jose, CA, February, 1996, pp. t70-179.
[4]
Christal, M., Smith, M., Taylor,'C. and Winkler, D., "Evolving Video Skims into Useful Multimedia Abstractions," in Human Factors in Computing Systems, CHI 98 Conference Proceedings (Los Angeles, CA), New York: ACM, pp. 171-178, 1998.
[5]
Foote, J., Boreczky, J., Girgensohn, A. and Wilcox, L., "An Intelligent Media Browser using Automatic Multimodal Analysis," in Proc. A CM Multimedia '98, Bristol, England, pp. 375-380, 1988.
[6]
Foote, J., Boreczky, J., and Wilcox, L., "Finding Presentations in Recorded Meetings Using Audio and Video Features," in Proc. 1CASSP '99, Vol. 6, pp. 3045-3048, 1999.
[7]
Girgensohn, A. and Boreczky, J., "Time-Constrained Keyframe Selection Technique," in IEEE Multimedia Systems '99, IEEE Computer Society, Vol. 1, pp. 756- 761, 1999.
[8]
Girgensohn, A. and Foote, J., "Video Frame Classification Using Transform Coefficients," in Proc. ICASSP '99, Vol. 6, pp. 3045-3048, 1999.
[9]
Huang, Q., Liu, Z. and Rosenberg, A., "Automated Semantic Structure Reconstruction and Representation Generation for Broadcast News," in Proc. iS& T/SPIE Conference on Storage and Retrieval for Image and Video Databases VII, Vol. 3656, pp. 50-62, 1999.
[10]
ISO MPEG 7 Content Set, Item V20, "Korea's Pop Singers' Live Music Show", Korean Broadcasting System, 1998.
[11]
Pfeiffer, S., Lienhart, R., Fischer, S. and Effelsberg, W., "Abstracting digital movies automatically," in Journal of Visual Communication and Image Representation, 7(4), pp. 345-353, December 1996.
[12]
Rasmussen, E., Clustering Algorithms, In W. B. Frakes & R. Baeza-Yates (Eds.), Information Retrieval: Data Structures and Algorithms, Prentice Hall, pp. 419-442, 1992.
[13]
Shahraray, B. and Gibbon, D. C., "Automated Authoring of Hypermedia Documents of Video Programs," in Proc. A CM Multimedia 95, San Francisco, November, pp. 401-409, 1995.
[14]
Smith, M. and Kanade, T, "Video Skimming and Characterization through the Combination of image and Language Understanding Techniques," in Proc. Computer Vision and Pattern Recognition, pp. 775- 781, 1997.
[15]
Taniguchi, Y., Akutsu, A. and Tonomura, Y., "PanoramaExcerpts: Extracting and Packing Panoramas for Video Browsing," in Proc. A CM Multimedia 97, pp. 427-436, 1997.
[16]
Uchihashi, S. and Foote, J., "Summarizing Video Using a Shot Importance Measure and a Frame-Packing Algorithm," in Proc. ICASSP '99, Vol. 6, pp. 3041- 3044, 1999.
[17]
Yeo, B-L. and Yeung, M., "Classification, Simplification and Dynamic Visualization of Scene Transition Graphs for Video Browsing," in Proc. IS&T/SPIE Electronic Imaging 98: Storage and Retrieval for Image and Wdeo Databases VI.
[18]
Yeung, M. M., Yeo, B. L., Wolf, W. and Liu, B., "Video Browsing using Clustering and Scene Transitions on Compressed Sequences," in SPIE Vol. 2417 Multimedia Computing and Networking 1995, pp. 399-413, Feb. 1995.
[19]
Yeung, M. and Yeo, B-L., "Video Visualization for Compact Presentation and Fast Browsing of Pictorial Content," in IEEE Trans. Circuits and Svs. for Video Technology, Vol. 7, No. 5, pp. 771-785, Oct. 1997.
[20]
Yu, H., Clark, C., Malkin, R. and Waibel, A., "Experiments In Automatic Meeting Transcription Using JRTK," in Proc. ICASSP 98, pp. 921-924, 1998.
[21]
Zhang, H. J., Low, C. Y., Smoliar, S. W. and Wu, J. H., "Video Parsing, Retrieval and Browsing: An Integrated and Content-Based Solution," in Proc. A CM Multimedia 95, San Francisco, November 1995, pp. 15-24
[22]
Zhuang, Y., Rui, Y., Huang, T.S. and Mehrotra, S., "Adaptive Key Frame Extraction Using Unsupervised Clustering," in Proc. ICIP '98, Vol. I, pp. 866-870, 1998.

Cited By

View all
  • (2024)Sphere Window: Challenges and Opportunities of 360° Video in Collaborative Design Workshops.Proceedings of the 13th Nordic Conference on Human-Computer Interaction10.1145/3679318.3685407(1-13)Online publication date: 13-Oct-2024
  • (2024)Meeting Bridges: Designing Information Artifacts that Bridge from Synchronous Meetings to Asynchronous CollaborationProceedings of the ACM on Human-Computer Interaction10.1145/36373128:CSCW1(1-29)Online publication date: 26-Apr-2024
  • (2023)Query-based video summarization with multi-label classification networkMultimedia Tools and Applications10.1007/s11042-023-15126-182:24(37529-37549)Online publication date: 22-Mar-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
MULTIMEDIA '99: Proceedings of the seventh ACM international conference on Multimedia (Part 1)
October 1999
516 pages
ISBN:1581131518
DOI:10.1145/319463
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 October 1999

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. keyframe selection and layout
  2. video summarization and analysis

Qualifiers

  • Article

Conference

MM99: ACM Multimedia 1999
October 30 - November 5, 1999
Florida, Orlando, USA

Acceptance Rates

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)192
  • Downloads (Last 6 weeks)33
Reflects downloads up to 11 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Sphere Window: Challenges and Opportunities of 360° Video in Collaborative Design Workshops.Proceedings of the 13th Nordic Conference on Human-Computer Interaction10.1145/3679318.3685407(1-13)Online publication date: 13-Oct-2024
  • (2024)Meeting Bridges: Designing Information Artifacts that Bridge from Synchronous Meetings to Asynchronous CollaborationProceedings of the ACM on Human-Computer Interaction10.1145/36373128:CSCW1(1-29)Online publication date: 26-Apr-2024
  • (2023)Query-based video summarization with multi-label classification networkMultimedia Tools and Applications10.1007/s11042-023-15126-182:24(37529-37549)Online publication date: 22-Mar-2023
  • (2022)Local Optimal-Oriented Pattern and Exponential Weighed-Jaya Optimization-Based Deep Convolutional Networks for Video SummarizationInternational Journal of Swarm Intelligence Research10.4018/IJSIR.30440313:3(1-21)Online publication date: 22-Jul-2022
  • (2022)Research on video summarization method based on convolutional neural networkInternational Conference on Neural Networks, Information, and Communication Engineering (NNICE 2022)10.1117/12.2639224(45)Online publication date: 15-Jul-2022
  • (2022)Interactive Data ComicsIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2021.311484928:1(944-954)Online publication date: Jan-2022
  • (2021)VSumVis: Interactive Visual Understanding and Diagnosis of Video Summarization ModelACM Transactions on Intelligent Systems and Technology10.1145/345892812:4(1-28)Online publication date: 8-Jun-2021
  • (2020)Video Synopsis Based on Attention Mechanism and Local Transparent ProcessingIEEE Access10.1109/ACCESS.2020.2994613(1-1)Online publication date: 2020
  • (2019)Improving Early Navigation in Time-Lapse Video with Spread-Frame LoadingProceedings of the 2019 CHI Conference on Human Factors in Computing Systems10.1145/3290605.3300785(1-12)Online publication date: 2-May-2019
  • (2019)Generation of Personalized Video Summaries by Detecting Viewer’s Emotion using ElectroencephalographyJournal of Visual Communication and Image Representation10.1016/j.jvcir.2019.102672(102672)Online publication date: Oct-2019
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media