[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1178677.1178710acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
Article

Matching slides to presentation videos using SIFT and scene background matching

Published: 26 October 2006 Publication History

Abstract

We present a general approach for automatically matching electronic slides to videos of corresponding presentations for use in distance learning and video proceedings of conferences. We deal with a large variety of videos, various frame compositions and color balances, arbitrary slides sequence and with dynamic cameras switching, pan, tilt and zoom. To achieve high accuracy, we develop a two-phases process with unsupervised scene background modelling. In the first phase, scale invariant feature transform (SIFT) keypoints are applied to frame to slide matching, under constraint projective transformation (constraint homography) using a random sample consensus (RANSAC). Successful first-phase matches are then used to automatically build a scene background model. In the second phase the background model is applied to the remaining unmatched frames to boost the matching performance for difficult cases such as wide field of view camera shots where the slide shows as a small portion of the frame. We also show that color correction is helpful when color-related similarity measures are used for identifying slides. We provide detailed quantitative experimentation results characterizing the effect of each part of our approach. The results show that our approach is robust and achieves high performance on matching slides to a number of videos with different styles.

References

[1]
G. D. Abowd, C. G. Atkeson, A. Feinstein, C. E. Hmelo, R. Kooper,S.Long, N. N. Sawhney,and M.Tani. Teaching and learning as multimedia authoring:The classroom 2000 project. In ACM Multimedia pages 187--198, 1996.
[2]
A. Amir, G. Ashour, and S. Srinivasan. Automatic generation of conference video proceedings. In Journal of Visual Communication and Image Representation, JVCI Special Issue on Multimedia Databases pages 467--488, 2004.
[3]
A. Behera, D. Lalanne, and R. Ingold. Looking at projected documents: Event detection document identification., 2004.
[4]
N. Christianini and J. Shawe. Support Vector machines and other kernel-based learning method Cambridge University Press, 2002.
[5]
B. Erol, J. J. Hull, and D. Lee. Linking multimedia presentations with their symbolic source documents: algorithm and applications. In ACM Multimedia pages 498--507, 2003.
[6]
S. Fathima, T. Mahmood. Indexing for topics in videos using foils. In IEEE Conference on Computer Vision and Pattern Recognition pages II: 312--319, 2000.
[7]
A. Fischler, M. and C. Bolles, R. Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography.Comm. of the ACM 24:381--395, 1981.
[8]
R. M. Haralick and G. S. Linda.Computer and Robot Vision, Volume II Addison-Wesley, 1992.
[9]
R. Hartley and A. Zisserman. Multiple view and geometry in computer vision Cambridge University Press, 2002.
[10]
T. Liu, R. Hjelsvold, and R. Kender, J. Analysis and enhancement of videos of electronic slide presentations.IEEE International Conference on Multimedia and Expo (ICME) 2002.
[11]
D. Lowe. Distinctive image features from scale-invariant keypoints.International Journal of Computer Vision pages 91--110, 2004.
[12]
S. Mukhopadhyay and B.Smith. Passive capture and structuring of lectures.In ACM Multimedia (1)pages 477--487, 1999.
[13]
G. Pass, R. Zabih, and J. Miller. Comparing images using color coherence vectors. In ACM Multimedia pages 65--73, 1996.
[14]
L. A. Rowe and J. M. Gonzelez. Bmrc lecture browsers. In http://bmrc.berkekey.edu/frame/projects/lb/index.html
[15]
F. Wang, C.-W. Ngo, and T.-C. Pong. Synchronization of lecture videos and electronic slides by video text analysis. In ACM Multimedia pages 315--318, 2003.

Cited By

View all
  • (2024)SwapVid: Integrating Video Viewing and Document Exploration with Direct ManipulationProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642515(1-13)Online publication date: 11-May-2024
  • (2017)Sparse Time-Varying Graphs for Slide Transition Detection in Lecture VideosImage and Graphics10.1007/978-3-319-71607-7_50(567-576)Online publication date: 30-Dec-2017
  • (2017)Conclusion and Future WorkMultimodal Analysis of User-Generated Multimedia Content10.1007/978-3-319-61807-4_8(235-260)Online publication date: 1-Sep-2017
  • Show More Cited By

Index Terms

  1. Matching slides to presentation videos using SIFT and scene background matching

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MIR '06: Proceedings of the 8th ACM international workshop on Multimedia information retrieval
    October 2006
    344 pages
    ISBN:1595934952
    DOI:10.1145/1178677
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 26 October 2006

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. RANSAC
    2. SIFT keypoints
    3. color correction
    4. electronic slides
    5. looseness +1 distance learning
    6. presentation videos
    7. video indexing

    Qualifiers

    • Article

    Conference

    MM06
    MM06: The 14th ACM International Conference on Multimedia 2006
    October 26 - 27, 2006
    California, Santa Barbara, USA

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)9
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 04 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)SwapVid: Integrating Video Viewing and Document Exploration with Direct ManipulationProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642515(1-13)Online publication date: 11-May-2024
    • (2017)Sparse Time-Varying Graphs for Slide Transition Detection in Lecture VideosImage and Graphics10.1007/978-3-319-71607-7_50(567-576)Online publication date: 30-Dec-2017
    • (2017)Conclusion and Future WorkMultimodal Analysis of User-Generated Multimedia Content10.1007/978-3-319-61807-4_8(235-260)Online publication date: 1-Sep-2017
    • (2017)Adaptive News Video UploadingMultimodal Analysis of User-Generated Multimedia Content10.1007/978-3-319-61807-4_7(205-234)Online publication date: 1-Sep-2017
    • (2017)Lecture Video SegmentationMultimodal Analysis of User-Generated Multimedia Content10.1007/978-3-319-61807-4_6(173-203)Online publication date: 1-Sep-2017
    • (2017)Soundtrack Recommendation for UGVsMultimodal Analysis of User-Generated Multimedia Content10.1007/978-3-319-61807-4_5(139-171)Online publication date: 1-Sep-2017
    • (2017)Tag Recommendation and RankingMultimodal Analysis of User-Generated Multimedia Content10.1007/978-3-319-61807-4_4(101-138)Online publication date: 1-Sep-2017
    • (2017)Event UnderstandingMultimodal Analysis of User-Generated Multimedia Content10.1007/978-3-319-61807-4_3(59-99)Online publication date: 1-Sep-2017
    • (2017)Literature ReviewMultimodal Analysis of User-Generated Multimedia Content10.1007/978-3-319-61807-4_2(31-57)Online publication date: 1-Sep-2017
    • (2017)IntroductionMultimodal Analysis of User-Generated Multimedia Content10.1007/978-3-319-61807-4_1(1-30)Online publication date: 1-Sep-2017
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media