research-article

Temporal Video Segmentation to Scenes Using High-Level Audiovisual Features

Authors:

I. TrancosoAuthors Info & Claims

IEEE Transactions on Circuits and Systems for Video Technology, Volume 21, Issue 8

Pages 1163 - 1177

https://doi.org/10.1109/TCSVT.2011.2138830

Published: 01 August 2011 Publication History

Abstract

In this paper, a novel approach to video temporal decomposition into semantic units, termed scenes, is presented. In contrast to previous temporal segmentation approaches that employ mostly low-level visual or audiovisual features, we introduce a technique that jointly exploits low-level and high-level features automatically extracted from the visual and the auditory channel. This technique is built upon the well-known method of the scene transition graph (STG), first by introducing a new STG approximation that features reduced computational cost, and then by extending the unimodal STG-based temporal segmentation technique to a method for multimodal scene segmentation. The latter exploits, among others, the results of a large number of TRECVID-type trained visual concept detectors and audio event detectors, and is based on a probabilistic merging process that combines multiple individual STGs while at the same time diminishing the need for selecting and fine-tuning several STG construction parameters. The proposed approach is evaluated on three test datasets, comprising TRECVID documentary films, movies, and news-related videos, respectively. The experimental results demonstrate the improved performance of the proposed approach in comparison to other unimodal and multimodal techniques of the relevant literature and highlight the contribution of high-level audiovisual features toward improved video segmentation to scenes.

Cited By

View all

Li YLiu KLiu SFeng LQiao H(2024)Involving Distinguished Temporal Graph Convolutional Networks for Skeleton-Based Temporal Action SegmentationIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2023.328541634:1(647-660)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.1109/TCSVT.2023.3285416
Chen LTan JYang PWang H(2024)Semantic Transition Detection for Self-supervised Video Scene SegmentationMultiMedia Modeling10.1007/978-3-031-53311-2_2(14-27)Online publication date: 29-Jan-2024
https://dl.acm.org/doi/10.1007/978-3-031-53311-2_2
Yang YHuang YGuo WXu BXia DWilliams BChen YNeville J(2023)Towards global video scene segmentation with context-aware transformerProceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence and Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence and Thirteenth Symposium on Educational Advances in Artificial Intelligence10.1609/aaai.v37i3.25426(3206-3213)Online publication date: 7-Feb-2023
https://dl.acm.org/doi/10.1609/aaai.v37i3.25426
Show More Cited By

Temporal Video Segmentation to Scenes Using High-Level Audiovisual Features
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
      2. Computer vision tasks

Recommendations

Saliency from High-Level Semantic Image Features
Abstract
Top-down semantic information is known to play an important role in assigning saliency. Recently, large strides have been made in improving state-of-the-art semantic image understanding in the fields of object detection and semantic segmentation. ...
Layout-driven RTL binding techniques for high-level synthesis
ISSS '96: Proceedings of the 9th international symposium on System synthesis

The importance of effective and efficient accounting of layout effects is well-established in high-level synthesis (HLS), since it allows more realistic exploration of the design space and the generation of solutions with predictable metrics. This ...
High-Level Test Synthesis: A Survey from Synthesis Process Flow Perspective

High-level test synthesis is a special class of high-level synthesis having testability as one of the important components. This article presents a detailed survey on recent developments in high-level test synthesis from a synthesis process flow ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image IEEE Transactions on Circuits and Systems for Video Technology

IEEE Transactions on Circuits and Systems for Video Technology Volume 21, Issue 8

August 2011

165 pages

ISSN:1051-8215

Issue’s Table of Contents

Publisher

IEEE Press

Publication History

Published: 01 August 2011

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

48
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 21 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Li YLiu KLiu SFeng LQiao H(2024)Involving Distinguished Temporal Graph Convolutional Networks for Skeleton-Based Temporal Action SegmentationIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2023.328541634:1(647-660)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.1109/TCSVT.2023.3285416
Chen LTan JYang PWang H(2024)Semantic Transition Detection for Self-supervised Video Scene SegmentationMultiMedia Modeling10.1007/978-3-031-53311-2_2(14-27)Online publication date: 29-Jan-2024
https://dl.acm.org/doi/10.1007/978-3-031-53311-2_2
Yang YHuang YGuo WXu BXia DWilliams BChen YNeville J(2023)Towards global video scene segmentation with context-aware transformerProceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence and Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence and Thirteenth Symposium on Educational Advances in Artificial Intelligence10.1609/aaai.v37i3.25426(3206-3213)Online publication date: 7-Feb-2023
https://dl.acm.org/doi/10.1609/aaai.v37i3.25426
Tan JWang HYuan J(2023)Characters Link Shots: Character Attention Network for Movie Scene SegmentationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/363025720:4(1-23)Online publication date: 26-Oct-2023
https://dl.acm.org/doi/10.1145/3630257
Rao AXu LLi ZHuang QKuang ZZhang WLin D(2023)A Coarse-to-Fine Framework for Automatic Video UnscreenIEEE Transactions on Multimedia10.1109/TMM.2022.315017725(2723-2733)Online publication date: 1-Jan-2023
https://dl.acm.org/doi/10.1109/TMM.2022.3150177
Wang YWang YZhao D(2023)Overview of the NLPCC 2023 Shared Task 10: Learn to Watch TV: Multimodal Dialogue Understanding and Response GenerationNatural Language Processing and Chinese Computing10.1007/978-3-031-44699-3_37(412-419)Online publication date: 12-Oct-2023
https://dl.acm.org/doi/10.1007/978-3-031-44699-3_37
Cai CZhao QXu RQin B(2023)Multimodal Dialogue Understanding via Holistic Modeling and Sequence LabelingNatural Language Processing and Chinese Computing10.1007/978-3-031-44699-3_36(399-411)Online publication date: 12-Oct-2023
https://dl.acm.org/doi/10.1007/978-3-031-44699-3_36
Khurana KDeshpande U(2022)Video localized caption generation framework for industrial videosJournal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology10.3233/JIFS-21238143:4(4107-4132)Online publication date: 1-Jan-2022
https://dl.acm.org/doi/10.3233/JIFS-212381
Yang XQi S(2022)Interactive Design of Business English Learning Resources Based on EDIPT Multimodal ModelComputational Intelligence and Neuroscience10.1155/2022/12648472022Online publication date: 1-Jan-2022
https://dl.acm.org/doi/10.1155/2022/1264847
Yu RHan LZhang W(2022)Automatic Scene Segmentation Algorithm for Image Color RestorationProceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering10.1145/3573428.3573777(746-751)Online publication date: 21-Oct-2022
https://dl.acm.org/doi/10.1145/3573428.3573777
Show More Cited By

Abstract

Cited By

Recommendations

Saliency from High-Level Semantic Image Features

Layout-driven RTL binding techniques for high-level synthesis

High-Level Test Synthesis: A Survey from Synthesis Process Flow Perspective

Comments

Information

Published In

Publisher

Publication History

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations