[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1631272.1631295acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Scalable detection of partial near-duplicate videos by visual-temporal consistency

Published: 19 October 2009 Publication History

Abstract

Following the exponential growth of social media, there now exist huge repositories of videos online. Among the huge volumes of videos, there exist large numbers of near-duplicate videos. Most existing techniques either focus on the fast retrieval of full copies or near-duplicates, or consider localization in a heuristic manner. This paper considers the scalable detection and localization of partial near-duplicate videos by jointly considering visual similarity and temporal consistency. Temporal constraints are embedded into a network structure as directed edges. Through the structure, partial alignment is novelly converted into a network flow problem where highly efficient solutions exist. To precisely decide the boundaries of the overlapping segments, pair-wise constraints generated from keypoint matching can be added to the network to iteratively refine the localization result. We demonstrate the effectiveness of partial alignment for three different tasks. The first task links partial segments in full-length movies to videos crawled from YouTube. The second task performs fast web video search, while the third performs near-duplicate shot and copy detection. The experimental result demonstrates the effectiveness and efficiency of the proposed method compared to state-of-the-art techniques.

References

[1]
R. K. Ahuja, T. L. Magnanti, and J. B. Orlin. Network Flows: Theory, Algorithms, and Applications. Prentice Hall, Reading, Massachusetts, 1993.
[2]
O. Chum, J. Philbin, M. Isard, and A. Zisserman. Scalable near identical image and shot detection. CIVR, 2007.
[3]
I. Dohring and R. Lienhart. Mining tv broadcasts for recurring video sequences. CIVR, 2009.
[4]
W. Dong, Z. Wang, M. Charikar, and K. Li. Efficiently matching sets of features with random histograms. ACM Multimedia, 2008.
[5]
D. Goldfarb and Z. Jin. An o(nm)-time network simplex algorithm for the shortest path problem. Operations Research, 47(3):445--448, 1999.
[6]
Hampapur, A. Bolle, M. Rudolf, and K.-H. Hyun. Comparison of sequence matching techniques for video copy detection. SPIE: Storage and Retrieval for Media Databases, 2001.
[7]
W. H. Hsu, L. S. Kennedy, and S.-F. Chang. Video search reranking through random walk over document-level context graph. ACM Multimedia, 2007.
[8]
X.-S. Hua and H.-J. Z. X. Chen. Robust video signature based on ordinal measure. ICIP, 2004.
[9]
H. Jegou, M. Douze, and C. Schmid. Hamming embedding and weak geometric consistency for large scale image search. ECCV, 2008.
[10]
Y.-G. Jiang and C.-W. Ngo. Visual word proximity and linguistics for semantic video indexing and near-duplicate retrieval. CVIU, 113(3):405--414, March 2009.
[11]
J.Law-To, A. Joly, and N. Boujemaa. Muscle-vcd-2007: a live benchmark for video copy detection. http://www.-rocq.inria.fr/media/civr/bench, 2007.
[12]
Y. Ke and R. Sukthankar. Pca-sift: A more distinctive representation for local image descriptors. CVPR, 2004.
[13]
Y. Ke, R. Sukthankar, and L. Huston. Efficient near duplicate detection and sub-image retrieval. ACM Multimedia, 2004.
[14]
L. Kennedy and S.-F. Chang. Internet image archaeology: Automatically tracing the manipulation history of photographs on the web. ACM Multimedia, 2008.
[15]
H. Kim, J. Lee, H. Liu, and D. Lee. Video linkage: Group based copied video detection. CIVR, 2008.
[16]
J. Law-To, O. Buisson, V. Gouet-Brunet, and N. Boujemaa. Robust voting algorithm based on labels of behavior for video copy detection. ACM Multimedia, 2006.
[17]
D. Lowe. Distinctive image features from scale-invariant keypoints. IJCV, 60(2):91--110, 2004.
[18]
K. Milolajczyk and C. Schmid. Scale and affine invariant interest point detectors. IJCV, 60(1):63--86, 2004.
[19]
A. F. Smeaton, P. Over, and W. Kraaij. Evaluation campaigns and trecvid. ACM MIR, 2006.
[20]
H.-K. Tan, X. Wu, C.-W. Ngo, and W.-L. Zhao. Accelerating near-duplicate video matching by combining visual similarity and alignment distortion. ACM Multimedia, 2008.
[21]
K. Wagstaff, C. Cardie, S. Rogers, and S. Schroedl. Constrained k-means clustering with background knowledge. ICML, 2001.
[22]
X. Wu, A. G. Hauptman, and C.-W. Ngo. Practical elimination of near-duplicates from web search video. ACM Multimedia, 2007.
[23]
X. Wu, C.-W. Ngo, A. G. Hauptmann, and H.-K. Tan. Real-time near-duplicate elimination for web video search with content and context. TMM, 11(2):196--207, Feb. 2009.
[24]
X. Wu, M. Takimoto, S. Sato, and J. Adachi. Scene duplicate detection based on the pattern of discontinuities in feature point trajectories. ACM Multimedia, 2008.
[25]
D. Xu and S.-F. Chang. Video event recognition using kernel methods with multilevel temporal alignment. TPAMI, 30(11):1985--1997, 2008.
[26]
D. Zhang and S.-F. Chang. Detecting image near duplicate by stochastic attribute relational graph matching with learning. ACM Multimedia, 2004.
[27]
W.-L. Zhao and C.-W. Ngo. Scale-rotation invariant pattern entropy for keypoint--based near-duplicate detection. TIP, 18(2):412--423, 2009.
[28]
W.-L. Zhao, C.-W. Ngo, H.-K. Tan, and X. Wu. Near duplicate keyframe identificiation with interest point matching and pattern learning. TMM, 9(5):213--238, 2007.

Cited By

View all
  • (2024)Not All Pairs are Equal: Hierarchical Learning for Average-Precision-Oriented Video RetrievalProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3681110(3828-3837)Online publication date: 28-Oct-2024
  • (2024)Enhancing Single-Frame Supervision for Better Temporal Action LocalizationIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2024.338852130:6(2903-2915)Online publication date: Jun-2024
  • (2024)Research on Short Video Copyright Protection by Video Similarity Learning2024 International Conference on Interactive Intelligent Systems and Techniques (IIST)10.1109/IIST62526.2024.00072(530-535)Online publication date: 4-Mar-2024
  • Show More Cited By

Index Terms

  1. Scalable detection of partial near-duplicate videos by visual-temporal consistency

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '09: Proceedings of the 17th ACM international conference on Multimedia
    October 2009
    1202 pages
    ISBN:9781605586083
    DOI:10.1145/1631272
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 19 October 2009

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. network flow
    2. partial near-duplicate
    3. temporal graph

    Qualifiers

    • Research-article

    Conference

    MM09
    Sponsor:
    MM09: ACM Multimedia Conference
    October 19 - 24, 2009
    Beijing, China

    Acceptance Rates

    Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)28
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 18 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Not All Pairs are Equal: Hierarchical Learning for Average-Precision-Oriented Video RetrievalProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3681110(3828-3837)Online publication date: 28-Oct-2024
    • (2024)Enhancing Single-Frame Supervision for Better Temporal Action LocalizationIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2024.338852130:6(2903-2915)Online publication date: Jun-2024
    • (2024)Research on Short Video Copyright Protection by Video Similarity Learning2024 International Conference on Interactive Intelligent Systems and Techniques (IIST)10.1109/IIST62526.2024.00072(530-535)Online publication date: 4-Mar-2024
    • (2024)The 2023 video similarity dataset and challengeComputer Vision and Image Understanding10.1016/j.cviu.2024.103997243(103997)Online publication date: Jun-2024
    • (2023)VERD: Emergence of Product-Based Video E-Commerce Retrieval Dataset from User’s PerspectiveSensors10.3390/s2301051323:1(513)Online publication date: 3-Jan-2023
    • (2023)Video Retrieval for Everyday Scenes With Common ObjectsProceedings of the 2023 ACM International Conference on Multimedia Retrieval10.1145/3591106.3592239(565-570)Online publication date: 12-Jun-2023
    • (2023)Video Infringement Detection via Feature Disentanglement and Mutual Information MaximizationProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3612002(144-152)Online publication date: 26-Oct-2023
    • (2023)A Near-Duplicate Video Cleaning Method Based on AFENet Adaptive Clustering2023 8th International Conference on Intelligent Computing and Signal Processing (ICSP)10.1109/ICSP58490.2023.10248727(689-695)Online publication date: 21-Apr-2023
    • (2023)3D-CSL: Self-Supervised 3D Context Similarity Learning for Near-Duplicate Video Retrieval2023 IEEE International Conference on Image Processing (ICIP)10.1109/ICIP49359.2023.10222915(2880-2884)Online publication date: 8-Oct-2023
    • (2023)Self-Supervised Video Similarity Learning2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)10.1109/CVPRW59228.2023.00504(4756-4766)Online publication date: Jun-2023
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media