[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Dynamic video narratives

Published: 26 July 2010 Publication History

Abstract

This paper presents a system for generating dynamic narratives from videos. These narratives are characterized for being compact, coherent and interactive, as inspired by principles of sequential art. Narratives depict the motion of one or several actors over time. Creating compact narratives is challenging as it is desired to combine the video frames in a way that reuses redundant backgrounds and depicts the stages of a motion. In addition, previous approaches focus on the generation of static summaries and can afford expensive image composition techniques. A dynamic narrative, on the other hand, must be played and skimmed in real-time, which imposes certain cost limitations in the video processing. In this paper, we define a novel process to compose foreground and background regions of video frames in a single interactive image using a series of spatio-temporal masks. These masks are created to improve the output of automatic video processing techniques such as image stitching and foreground segmentation. Unlike hand-drawn narratives, often limited to static representations, the proposed system allows users to explore the narrative dynamically and produce different representations of motion. We have built an authoring system that incorporates these methods and demonstrated successful results on a number of video clips. The authoring system can be used to create interactive posters of video clips, browse video in a compact manner or highlight a motion sequence in a movie.

Supplementary Material

JPG File (tp125-10.jpg)
Supplemental material. (088.zip)
MP4 File (tp125-10.mp4)

References

[1]
Agarwala, A., Dontcheva, M., Agrawala, M., Drucker, S., Colburn, A., Curless, B., Salesin, D., and Cohen, M. 2004. Interactive digital photomontage. ACM Trans. Graph. 23, 3, 294--302.
[2]
Agarwala, A., Zheng, K. C., Pal, C., Agrawala, M., Cohen, M., Curless, B., Salesin, D., and Szeliski, R. 2005. Panoramic video textures. ACM Trans. Graph. 24, 3, 821--827.
[3]
Agarwala, A., Agrawala, M., Cohen, M., Salesin, D., and Szeliski, R. 2006. Photographing long scenes with multi-viewpoint panoramas. ACM Trans. Graph. 25, 3, 853--861.
[4]
Anderson, D. M. 1961. Elements of Design. Holt, Rinehart and Winston.
[5]
Aner, A., and Kender, J. R. 2002. Video summaries through mosaic-based shot and scene clustering. In ECCV '02: Proceedings of the 7th European Conference on Computer Vision-Part IV, 388--402.
[6]
Apple Corporation, 2009. iMovie. http://www.apple.com/ilife/imovie.
[7]
Assa, J., Caspi, Y., and Cohen-Or, D. 2005. Action synopsis: pose selection and illustration. ACM Trans. Graph. 24, 3, 667--676.
[8]
Barnes, C., Goldman, D. B., Shechtman, E., and Finkelstein, A. 2010. Video tapestries with continuous temporal zoom. ACM Transactions on Graphics 29, 3.
[9]
Bennett, E. P., and McMillan, L. 2007. Computational time-lapse video. In SIGGRAPH '07: ACM SIGGRAPH 2007 papers, 102.
[10]
Boreczky, J., Girgensohn, A., Golovchinsky, G., and Uchihashi, S. 2000. An interactive comic book presentation for exploring video. In CHI '00: Proc. SIGCHI conference on Human factors in computing systems, 185--192.
[11]
Boykov, Y., and Kolmogorov, V. 2004. An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision. IEEE Trans. on Pattern Analysis and Machine Intelligence 26, 9, 1124--1137.
[12]
Brown, M., and Lowe, D. G. 2003. Recognising panoramas. In ICCV '03: Proc. Ninth IEEE International Conference on Computer Vision, 1218.
[13]
Caspi, Y., Axelrod, A., Matsushita, Y., and Gamliel, A. 2006. Dynamic stills and clip trailers. Vis. Comput. 22, 9, 642--652.
[14]
Chiu, P., Girgensohn, A., and Liu, Q. 2004. Stained-glass visualization for highly condensed video summaries. IEEE Conf. on Multimedia and Expo, 2004. ICME '04. 2004 3, 2059--2062.
[15]
Cutting, J. 2002. Representing motion in a static image: constraints and parallels in art, science, and popular culture. Perception 31, 1165--1193.
[16]
Eisner, W. 1985. Comics and Sequential Art. Poorhouse Press.
[17]
Forlines, C. 2008. Content aware video presentation on high-resolution displays. In AVI '08: Proceedings of the working conference on Advanced visual interfaces, 57--64.
[18]
Goldman, D. B., Curless, B., Salesin, D., and Seitz, S. M. 2006. Schematic storyboarding for video visualization and editing. ACM Trans. Graph. 25, 3, 862--871.
[19]
Granados, M., Seidel, H.-P., and Lensch, H. P. A. 2008. Background estimation from non-time sequence images. In GI '08: Proc. Graphics Interface 2008, 33--40.
[20]
Irani, M., and Anandan, P. 1998. Video indexing based on mosaic representations. Proc. of the IEEE 86, 5 (May), 905--921.
[21]
Kaewtrakulpong, P., and Bowden, R. 2001. An improved adaptive background mixture model for realtime tracking with shadow detection. In In Proc. 2nd European Workshop on Advanced Video Based Surveillance Systems, AVBS01, Kluwer Academic Publishers.
[22]
Kim, B., and Essa, I. 2005. Video-based nonphotorealistic and expressive illustration of motion. In CGI '05: Proc. Computer Graphics International 2005, 32--35.
[23]
Kwatra, V., Schödl, A., Essa, I., Turk, G., and Bobick, A. 2003. Graphcut textures: image and video synthesis using graph cuts. ACM Trans. Graph. 22, 3, 277--286.
[24]
Li, Y., Li, Y., Zhang, T., Zhang, T., Tretter, D., and Tretter, D. 2001. An overview of video abstraction techniques. Tech. rep., HP-2001-191, HP Laboratory.
[25]
Ma, Y.-F., Lu, L., Zhang, H.-J., and Li, M. 2002. A user attention model for video summarization. In Proc. tenth ACM international conference on Multimedia, 533--542.
[26]
McCloud, S. 1994. Understanding Comics. Perennial Currents.
[27]
Mei, T., Yang, B., Yang, S.-Q., and Hua, X.-S. 2009. Video collage: presenting a video sequence using a single image. The Visual Computer 25, 1, 39--51.
[28]
Pal, C., and Jojic, N. 2005. Interactive montages of sprites for indexing and summarizing security video. In CVPR '05: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, 1192.
[29]
Pritch, Y., Rav-Acha, A., and Peleg, S. 2008. Non-chronological video synopsis and indexing. IEEE Trans. Pattern Analysis and Machine Intelligence 30, 11, 1971--1984.
[30]
Rav-Acha, A., Pritch, Y., Lischinski, D., and Peleg, S. 2007. Dynamosaicing: Mosaicing of dynamic scenes. IEEE Trans. Pattern Anal. Mach. Intell. 29, 10, 1789--1801.
[31]
Rother, C., Kumar, S., Kolmogorov, V., and Blake, A. 2005. Digital tapestry. In CVPR '05: Proc. 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1, 589--596.
[32]
Rother, C., Bordeaux, L., Hamadi, Y., and Blake, A. 2006. Autocollage. ACM Trans. Graph. 25, 3, 847--852.
[33]
Sawhney, H. S., and Ayer, S. 1996. Compact representations of videos through dominant and multiple motion estimation. IEEE Trans. Pattern Anal. Mach. Intell. 18, 8, 814--830.
[34]
Schmandt-Besserat, D. 2007. When Writing Met Art: From Symbol to Story. University of Texas Press.
[35]
Shum, H.-Y., and Szeliski, R. 1998. Construction and refinement of panoramic mosaics with global and local alignment. In ICCV '98: Proc. Sixth International Conference on Computer Vision, 953.
[36]
Simakov, D., Caspi, Y., Shechtman, E., and Irani, M. 2008. Summarizing visual data using bidirectional similarity. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 1--8.
[37]
Taniguchi, Y., Akutsu, A., and Tonomura, Y. 1997. Panoramaexcerpts: extracting and packing panoramas for video browsing. In Proc. fifth ACM international conference on Multimedia, 427--436.
[38]
Teodosio, L., and Bender, W. 1993. Salient video stills: content and context preserved. In Proc. first ACM international conference on Multimedia, 39--46.
[39]
Tufte, E. R. 1990. Envisioning Information. Graphics Press, Cheshire, Connecticut.
[40]
Ueda, H., Miyatake, T., Sumino, S., and Nagasaka, A. 1993. Automatic structure visualization for video editing. In CHI '93: Proc. INTERACT '93 and CHI '93 conference on Human factors in computing systems, 137--141.
[41]
Wood, D. N., Finkelstein, A., Hughes, J. F., Thayer, C. E., and Salesin, D. H. 1997. Multiperspective panoramas for cel animation. In SIGGRAPH '97: Proc. 24th annual conference on Computer graphics and interactive techniques, 243--250.
[42]
Yang, B., Mei, T., Sun, L., Yang, S.-Q., and Hua, X.-S. 2008. Free-shaped video collage. In Lecture Notes in Computer Science, vol. 4903, 175--185.
[43]
Yeung, M., and Yeo, B.-L. 1997. Video visualization for compact presentation and fast browsing of pictorial content. IEEE Trans. on Circuits and Systems for Video Technology 7, 5 (Oct), 771--785.

Cited By

View all
  • (2023)Hashing Neural Video Decomposition with Multiplicative Residuals in Space-Time2023 IEEE/CVF International Conference on Computer Vision (ICCV)10.1109/ICCV51070.2023.00712(7709-7719)Online publication date: 1-Oct-2023
  • (2022)Streaming Approach to In Situ Selection of Key Time Steps for Time‐Varying Volume DataComputer Graphics Forum10.1111/cgf.1454241:3(309-320)Online publication date: 12-Aug-2022
  • (2021)Route Tapestries: Navigating 360° Virtual Tour Videos Using Slit-Scan VisualizationsThe 34th Annual ACM Symposium on User Interface Software and Technology10.1145/3472749.3474746(223-238)Online publication date: 10-Oct-2021
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Graphics
ACM Transactions on Graphics  Volume 29, Issue 4
July 2010
942 pages
ISSN:0730-0301
EISSN:1557-7368
DOI:10.1145/1778765
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 July 2010
Published in TOG Volume 29, Issue 4

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. graph-cut optimization
  2. image compositing
  3. interactive editing
  4. motion extraction
  5. video exploration

Qualifiers

  • Research-article

Funding Sources

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)13
  • Downloads (Last 6 weeks)1
Reflects downloads up to 14 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Hashing Neural Video Decomposition with Multiplicative Residuals in Space-Time2023 IEEE/CVF International Conference on Computer Vision (ICCV)10.1109/ICCV51070.2023.00712(7709-7719)Online publication date: 1-Oct-2023
  • (2022)Streaming Approach to In Situ Selection of Key Time Steps for Time‐Varying Volume DataComputer Graphics Forum10.1111/cgf.1454241:3(309-320)Online publication date: 12-Aug-2022
  • (2021)Route Tapestries: Navigating 360° Virtual Tour Videos Using Slit-Scan VisualizationsThe 34th Annual ACM Symposium on User Interface Software and Technology10.1145/3472749.3474746(223-238)Online publication date: 10-Oct-2021
  • (2020)Temporally Dense Exploration of Moving and Deforming ShapesComputer Graphics Forum10.1111/cgf.1409240:1(7-21)Online publication date: 7-Oct-2020
  • (2020)Collision-Free Video Synopsis Incorporating Object Speed and Size ChangesIEEE Transactions on Image Processing10.1109/TIP.2019.294254329(1465-1478)Online publication date: 2020
  • (2019)Context Driven Optimized Perceptual Video Summarization and RetrievalIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2018.287318529:10(3132-3145)Online publication date: Oct-2019
  • (2019)Action snapshot with single pose and viewpointThe Visual Computer: International Journal of Computer Graphics10.1007/s00371-018-1479-935:4(507-520)Online publication date: 1-Apr-2019
  • (2018)Storytelling and Visualization: An Extended SurveyInformation10.3390/info90300659:3(65)Online publication date: 14-Mar-2018
  • (2018)The ReplateProceedings of the ACM on Computer Graphics and Interactive Techniques10.1145/32032051:1(1-14)Online publication date: 25-Jul-2018
  • (2018)Key Time Steps Selection for Large‐Scale Time‐Varying Volume Datasets Using an Information‐Theoretic StoryboardComputer Graphics Forum10.1111/cgf.1339937:3(37-49)Online publication date: 10-Jul-2018
  • Show More Cited By

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media