More Web Proxy on the site http://driver.im/

research-article

Dynamic video narratives

Authors:

Carlos D. Correa,

Kwan-Liu MaAuthors Info & Claims

ACM Transactions on Graphics (TOG), Volume 29, Issue 4

Article No.: 88, Pages 1 - 9

https://doi.org/10.1145/1778765.1778825

Published: 26 July 2010 Publication History

Abstract

This paper presents a system for generating dynamic narratives from videos. These narratives are characterized for being compact, coherent and interactive, as inspired by principles of sequential art. Narratives depict the motion of one or several actors over time. Creating compact narratives is challenging as it is desired to combine the video frames in a way that reuses redundant backgrounds and depicts the stages of a motion. In addition, previous approaches focus on the generation of static summaries and can afford expensive image composition techniques. A dynamic narrative, on the other hand, must be played and skimmed in real-time, which imposes certain cost limitations in the video processing. In this paper, we define a novel process to compose foreground and background regions of video frames in a single interactive image using a series of spatio-temporal masks. These masks are created to improve the output of automatic video processing techniques such as image stitching and foreground segmentation. Unlike hand-drawn narratives, often limited to static representations, the proposed system allows users to explore the narrative dynamically and produce different representations of motion. We have built an authoring system that incorporates these methods and demonstrated successful results on a number of video clips. The authoring system can be used to create interactive posters of video clips, browse video in a compact manner or highlight a motion sequence in a movie.

Supplementary Material

JPG File (tp125-10.jpg)

Download
15.86 KB

Supplemental material. (088.zip)

Download
42.31 MB

MP4 File (tp125-10.mp4)

Download
63.12 MB

References

[1]

Agarwala, A., Dontcheva, M., Agrawala, M., Drucker, S., Colburn, A., Curless, B., Salesin, D., and Cohen, M. 2004. Interactive digital photomontage. ACM Trans. Graph. 23, 3, 294--302.

Digital Library

[2]

Agarwala, A., Zheng, K. C., Pal, C., Agrawala, M., Cohen, M., Curless, B., Salesin, D., and Szeliski, R. 2005. Panoramic video textures. ACM Trans. Graph. 24, 3, 821--827.

Digital Library

[3]

Agarwala, A., Agrawala, M., Cohen, M., Salesin, D., and Szeliski, R. 2006. Photographing long scenes with multi-viewpoint panoramas. ACM Trans. Graph. 25, 3, 853--861.

Digital Library

[4]

Anderson, D. M. 1961. Elements of Design. Holt, Rinehart and Winston.

[5]

Aner, A., and Kender, J. R. 2002. Video summaries through mosaic-based shot and scene clustering. In ECCV '02: Proceedings of the 7th European Conference on Computer Vision-Part IV, 388--402.

Digital Library

[6]

Apple Corporation, 2009. iMovie. http://www.apple.com/ilife/imovie.

[7]

Assa, J., Caspi, Y., and Cohen-Or, D. 2005. Action synopsis: pose selection and illustration. ACM Trans. Graph. 24, 3, 667--676.

Digital Library

[8]

Barnes, C., Goldman, D. B., Shechtman, E., and Finkelstein, A. 2010. Video tapestries with continuous temporal zoom. ACM Transactions on Graphics 29, 3.

Digital Library

[9]

Bennett, E. P., and McMillan, L. 2007. Computational time-lapse video. In SIGGRAPH '07: ACM SIGGRAPH 2007 papers, 102.

Digital Library

[10]

Boreczky, J., Girgensohn, A., Golovchinsky, G., and Uchihashi, S. 2000. An interactive comic book presentation for exploring video. In CHI '00: Proc. SIGCHI conference on Human factors in computing systems, 185--192.

Digital Library

[11]

Boykov, Y., and Kolmogorov, V. 2004. An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision. IEEE Trans. on Pattern Analysis and Machine Intelligence 26, 9, 1124--1137.

Digital Library

[12]

Brown, M., and Lowe, D. G. 2003. Recognising panoramas. In ICCV '03: Proc. Ninth IEEE International Conference on Computer Vision, 1218.

Digital Library

[13]

Caspi, Y., Axelrod, A., Matsushita, Y., and Gamliel, A. 2006. Dynamic stills and clip trailers. Vis. Comput. 22, 9, 642--652.

Digital Library

[14]

Chiu, P., Girgensohn, A., and Liu, Q. 2004. Stained-glass visualization for highly condensed video summaries. IEEE Conf. on Multimedia and Expo, 2004. ICME '04. 2004 3, 2059--2062.

[15]

Cutting, J. 2002. Representing motion in a static image: constraints and parallels in art, science, and popular culture. Perception 31, 1165--1193.

[16]

Eisner, W. 1985. Comics and Sequential Art. Poorhouse Press.

[17]

Forlines, C. 2008. Content aware video presentation on high-resolution displays. In AVI '08: Proceedings of the working conference on Advanced visual interfaces, 57--64.

Digital Library

[18]

Goldman, D. B., Curless, B., Salesin, D., and Seitz, S. M. 2006. Schematic storyboarding for video visualization and editing. ACM Trans. Graph. 25, 3, 862--871.

Digital Library

[19]

Granados, M., Seidel, H.-P., and Lensch, H. P. A. 2008. Background estimation from non-time sequence images. In GI '08: Proc. Graphics Interface 2008, 33--40.

Digital Library

[20]

Irani, M., and Anandan, P. 1998. Video indexing based on mosaic representations. Proc. of the IEEE 86, 5 (May), 905--921.

[21]

Kaewtrakulpong, P., and Bowden, R. 2001. An improved adaptive background mixture model for realtime tracking with shadow detection. In In Proc. 2nd European Workshop on Advanced Video Based Surveillance Systems, AVBS01, Kluwer Academic Publishers.

[22]

Kim, B., and Essa, I. 2005. Video-based nonphotorealistic and expressive illustration of motion. In CGI '05: Proc. Computer Graphics International 2005, 32--35.

Digital Library

[23]

Kwatra, V., Schödl, A., Essa, I., Turk, G., and Bobick, A. 2003. Graphcut textures: image and video synthesis using graph cuts. ACM Trans. Graph. 22, 3, 277--286.

Digital Library

[24]

Li, Y., Li, Y., Zhang, T., Zhang, T., Tretter, D., and Tretter, D. 2001. An overview of video abstraction techniques. Tech. rep., HP-2001-191, HP Laboratory.

[25]

Ma, Y.-F., Lu, L., Zhang, H.-J., and Li, M. 2002. A user attention model for video summarization. In Proc. tenth ACM international conference on Multimedia, 533--542.

Digital Library

[26]

McCloud, S. 1994. Understanding Comics. Perennial Currents.

[27]

Mei, T., Yang, B., Yang, S.-Q., and Hua, X.-S. 2009. Video collage: presenting a video sequence using a single image. The Visual Computer 25, 1, 39--51.

Digital Library

[28]

Pal, C., and Jojic, N. 2005. Interactive montages of sprites for indexing and summarizing security video. In CVPR '05: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, 1192.

Digital Library

[29]

Pritch, Y., Rav-Acha, A., and Peleg, S. 2008. Non-chronological video synopsis and indexing. IEEE Trans. Pattern Analysis and Machine Intelligence 30, 11, 1971--1984.

Digital Library

[30]

Rav-Acha, A., Pritch, Y., Lischinski, D., and Peleg, S. 2007. Dynamosaicing: Mosaicing of dynamic scenes. IEEE Trans. Pattern Anal. Mach. Intell. 29, 10, 1789--1801.

Digital Library

[31]

Rother, C., Kumar, S., Kolmogorov, V., and Blake, A. 2005. Digital tapestry. In CVPR '05: Proc. 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1, 589--596.

Digital Library

[32]

Rother, C., Bordeaux, L., Hamadi, Y., and Blake, A. 2006. Autocollage. ACM Trans. Graph. 25, 3, 847--852.

Digital Library

[33]

Sawhney, H. S., and Ayer, S. 1996. Compact representations of videos through dominant and multiple motion estimation. IEEE Trans. Pattern Anal. Mach. Intell. 18, 8, 814--830.

Digital Library

[34]

Schmandt-Besserat, D. 2007. When Writing Met Art: From Symbol to Story. University of Texas Press.

[35]

Shum, H.-Y., and Szeliski, R. 1998. Construction and refinement of panoramic mosaics with global and local alignment. In ICCV '98: Proc. Sixth International Conference on Computer Vision, 953.

Digital Library

[36]

Simakov, D., Caspi, Y., Shechtman, E., and Irani, M. 2008. Summarizing visual data using bidirectional similarity. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 1--8.

[37]

Taniguchi, Y., Akutsu, A., and Tonomura, Y. 1997. Panoramaexcerpts: extracting and packing panoramas for video browsing. In Proc. fifth ACM international conference on Multimedia, 427--436.

Digital Library

[38]

Teodosio, L., and Bender, W. 1993. Salient video stills: content and context preserved. In Proc. first ACM international conference on Multimedia, 39--46.

Digital Library

[39]

Tufte, E. R. 1990. Envisioning Information. Graphics Press, Cheshire, Connecticut.

Digital Library

[40]

Ueda, H., Miyatake, T., Sumino, S., and Nagasaka, A. 1993. Automatic structure visualization for video editing. In CHI '93: Proc. INTERACT '93 and CHI '93 conference on Human factors in computing systems, 137--141.

Digital Library

[41]

Wood, D. N., Finkelstein, A., Hughes, J. F., Thayer, C. E., and Salesin, D. H. 1997. Multiperspective panoramas for cel animation. In SIGGRAPH '97: Proc. 24th annual conference on Computer graphics and interactive techniques, 243--250.

Digital Library

[42]

Yang, B., Mei, T., Sun, L., Yang, S.-Q., and Hua, X.-S. 2008. Free-shaped video collage. In Lecture Notes in Computer Science, vol. 4903, 175--185.

Digital Library

[43]

Yeung, M., and Yeo, B.-L. 1997. Video visualization for compact presentation and fast browsing of pictorial content. IEEE Trans. on Circuits and Systems for Video Technology 7, 5 (Oct), 771--785.

Digital Library

Cited By

Chan CYuan CSun CChen H(2023)Hashing Neural Video Decomposition with Multiplicative Residuals in Space-Time2023 IEEE/CVF International Conference on Computer Vision (ICCV)10.1109/ICCV51070.2023.00712(7709-7719)Online publication date: 1-Oct-2023
https://doi.org/10.1109/ICCV51070.2023.00712
Wu MChiang YMusco C(2022)Streaming Approach to In Situ Selection of Key Time Steps for Time‐Varying Volume DataComputer Graphics Forum10.1111/cgf.1454241:3(309-320)Online publication date: 12-Aug-2022
https://doi.org/10.1111/cgf.14542
Li JLyu JSousa MBalakrishnan RTang AGrossman T(2021)Route Tapestries: Navigating 360° Virtual Tour Videos Using Slit-Scan VisualizationsThe 34th Annual ACM Symposium on User Interface Software and Technology10.1145/3472749.3474746(223-238)Online publication date: 10-Oct-2021
https://dl.acm.org/doi/10.1145/3472749.3474746
Show More Cited By

Index Terms

Dynamic video narratives
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision tasks
        Video summarization
  2. Computer graphics
    1. Graphics systems and interfaces

Recommendations

Dynamic video narratives
SIGGRAPH '10: ACM SIGGRAPH 2010 papers

This paper presents a system for generating dynamic narratives from videos. These narratives are characterized for being compact, coherent and interactive, as inspired by principles of sequential art. Narratives depict the motion of one or several ...
Interactive manipulation of large-scale crowd animation

Editing large-scale crowd animation is a daunting task due to the lack of an efficient manipulation method. This paper presents a novel cage-based editing method for large-scale crowd animation. The cage encloses animated characters and supports ...
Live Sketch: Video-driven Dynamic Deformation of Static Drawings
CHI '18: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems

Creating sketch animations using traditional tools requires special artistic skills, and is tedious even for trained professionals. To lower the barrier for creating sketch animations, we propose a new system, emphLive Sketch,</i> which allows ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Graphics

ACM Transactions on Graphics Volume 29, Issue 4

July 2010

942 pages

ISSN:0730-0301

EISSN:1557-7368

DOI:10.1145/1778765

Issue’s Table of Contents

Copyright © 2010 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 July 2010

Published in TOG Volume 29, Issue 4

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

48
Total Citations
View Citations
1,873
Total Downloads

Downloads (Last 12 months)13
Downloads (Last 6 weeks)1

Reflects downloads up to 14 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Chan CYuan CSun CChen H(2023)Hashing Neural Video Decomposition with Multiplicative Residuals in Space-Time2023 IEEE/CVF International Conference on Computer Vision (ICCV)10.1109/ICCV51070.2023.00712(7709-7719)Online publication date: 1-Oct-2023
https://doi.org/10.1109/ICCV51070.2023.00712
Wu MChiang YMusco C(2022)Streaming Approach to In Situ Selection of Key Time Steps for Time‐Varying Volume DataComputer Graphics Forum10.1111/cgf.1454241:3(309-320)Online publication date: 12-Aug-2022
https://doi.org/10.1111/cgf.14542
Li JLyu JSousa MBalakrishnan RTang AGrossman T(2021)Route Tapestries: Navigating 360° Virtual Tour Videos Using Slit-Scan VisualizationsThe 34th Annual ACM Symposium on User Interface Software and Technology10.1145/3472749.3474746(223-238)Online publication date: 10-Oct-2021
https://dl.acm.org/doi/10.1145/3472749.3474746
Frey S(2020)Temporally Dense Exploration of Moving and Deforming ShapesComputer Graphics Forum10.1111/cgf.1409240:1(7-21)Online publication date: 7-Oct-2020
https://doi.org/10.1111/cgf.14092
Nie YLi ZZhang ZZhang QMa TSun H(2020)Collision-Free Video Synopsis Incorporating Object Speed and Size ChangesIEEE Transactions on Image Processing10.1109/TIP.2019.294254329(1465-1478)Online publication date: 2020
https://doi.org/10.1109/TIP.2019.2942543
Thomas SGupta SSubramanian V(2019)Context Driven Optimized Perceptual Video Summarization and RetrievalIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2018.287318529:10(3132-3145)Online publication date: Oct-2019
https://doi.org/10.1109/TCSVT.2018.2873185
Wang MGuo SLiao MHe DChang JZhang J(2019)Action snapshot with single pose and viewpointThe Visual Computer: International Journal of Computer Graphics10.1007/s00371-018-1479-935:4(507-520)Online publication date: 1-Apr-2019
https://dl.acm.org/doi/10.1007/s00371-018-1479-9
Tong CRoberts RBorgo RWalton SLaramee RWegba KLu AWang YQu HLuo QMa X(2018)Storytelling and Visualization: An Extended SurveyInformation10.3390/info90300659:3(65)Online publication date: 14-Mar-2018
https://doi.org/10.3390/info9030065
Chen GSander PNehab D(2018)The ReplateProceedings of the ACM on Computer Graphics and Interactive Techniques10.1145/32032051:1(1-14)Online publication date: 25-Jul-2018
https://dl.acm.org/doi/10.1145/3203205
Zhou BChiang Y(2018)Key Time Steps Selection for Large‐Scale Time‐Varying Volume Datasets Using an Information‐Theoretic StoryboardComputer Graphics Forum10.1111/cgf.1339937:3(37-49)Online publication date: 10-Jul-2018
https://doi.org/10.1111/cgf.13399
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Issue’s Table of Contents