Stars
CPL: Weakly Supervised Temporal Sentence Grounding with Gaussian-based Contrastive Proposal Learning
This repository contains the main baselines introduced in WSSTG (ACL 2019).
[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers
This repository provides the dataset introduced by our WSSTG paper
Weakly Supervised Video Moment Localisation with Contrastive Negative Sample Mining
前沿论文持续更新--视频时刻定位 or 时域语言定位 or 视频片段检索。
My Reading Lists of Deep Learning and Natural Language Processing
Official pytorch implementation of "What and When to look?: Temporal Span Proposal Network for Video Relation Detection"
assistant tools for attention visualization in deep learning
Video Visual Relation Detection (VidVRD) tracklets generation. also for ACM MM Visual Relation Understanding Grand Challenge
Signature verification package, for learning representations from signature data, training user-dependent classifiers.
Recent Transformer-based CV and related works.
Visual Relation Grounding in Videos (ECCV'20, Spotlight)
pytorch based implementation faster rcnn
Inverse Discriminative Networks for Handwritten Signature Verification
A list of Human-Object Interaction Learning.
Implementation of "Pose-aware Multi-level Feature Network for Human Object Interaction Detection"(ICCV 2019 Oral)
Pytorch implementation of CartoonGAN (CVPR 2018)