Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleJanuary 2024
Cross-Scenario Unknown-Aware Face Anti-Spoofing With Evidential Semantic Consistency Learning
IEEE Transactions on Information Forensics and Security (TIFS), Volume 19Pages 3093–3108https://doi.org/10.1109/TIFS.2024.3356234In recent years, domain adaptation techniques have been widely used to adapt face anti-spoofing models to a cross-scenario target domain. Most previous methods assume that the Presentation Attack Instruments (PAIs) in such cross-scenario target domains ...
- research-articleOctober 2023
Chain-of-Look Prompting for Verb-centric Surgical Triplet Recognition in Endoscopic Videos
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 5007–5016https://doi.org/10.1145/3581783.3611898Surgical triplet recognition aims to recognize surgical activities as triplets (i.e.,<instrument, verb, target >), which provides fine-grained information essential for surgical scene understanding. Existing methods for surgical triplet recognition rely ...
- research-articleApril 2023
Improving Product Search with Season-Aware Query-Product Semantic Similarity
WWW '23 Companion: Companion Proceedings of the ACM Web Conference 2023Pages 864–868https://doi.org/10.1145/3543873.3587625Product search for online shopping should be season-aware, i.e., presenting seasonally relevant products to customers. In this paper, we propose a simple yet effective solution to improve seasonal relevance in product search by incorporating seasonality ...
- research-articleJanuary 2021
3D Object Representation Learning: A Set-to-Set Matching Perspective
IEEE Transactions on Image Processing (TIP), Volume 30Pages 2168–2179https://doi.org/10.1109/TIP.2021.3049968In this paper, we tackle the 3D object representation learning from the perspective of set-to-set matching. Given two 3D objects, calculating their similarity is formulated as the problem of set-to-set similarity measurement between two set of local ...
-
- research-articleOctober 2020
HOT-Net: Non-Autoregressive Transformer for 3D Hand-Object Pose Estimation
MM '20: Proceedings of the 28th ACM International Conference on MultimediaPages 3136–3145https://doi.org/10.1145/3394171.3413775As we use our hands frequently in daily activities, the analysis of hand-object interactions plays a critical role to many multimedia understanding and interaction applications. Different from conventional 3D hand-only and object-only pose estimation, ...
- research-articleSeptember 2020
Product Quantization Network for Fast Visual Search
International Journal of Computer Vision (IJCV), Volume 128, Issue 8-9Pages 2325–2343https://doi.org/10.1007/s11263-020-01326-xAbstractProduct quantization has been widely used in fast image retrieval due to its effectiveness of coding high-dimensional visual features. By constructing the approximation function, we extend the hard-assignment quantization to soft-assignment ...
- research-articleFebruary 2020
Detecting spatiotemporal irregularities in videos via a 3D convolutional autoencoder
Journal of Visual Communication and Image Representation (JVCIR), Volume 67, Issue Chttps://doi.org/10.1016/j.jvcir.2019.102747AbstractSpatiotemporal irregularities (i.e., the uncommon appearance and motion patterns) in videos are difficult to detect, as they are usually not well defined and appear rarely in videos. We tackle this problem by learning normal patterns ...
- research-articleMay 2019
Boosting Positive and Unlabeled Learning for Anomaly Detection With Multi-Features
IEEE Transactions on Multimedia (TOM), Volume 21, Issue 5Pages 1332–1344https://doi.org/10.1109/TMM.2018.2871421One of the key challenges of machine learning-based anomaly detection relies on the difficulty of obtaining anomaly data for training, which is usually rare, diversely distributed, and difficult to collect. To address this challenge, we formulate anomaly ...
- research-articleFebruary 2018
Tensorized projection for high-dimensional binary embedding
Embedding high-dimensional visual features (d-dimensional) to binary codes (b-dimensional) has shown advantages in various vision tasks such as object recognition and image retrieval. Meanwhile, recent works have demonstrated that to fully utilize the ...
- research-articleFebruary 2018
Distributed composite quantization
Approximate nearest neighbor (ANN) search is a fundamental problem in computer vision, machine learning and information retrieval. Recently, quantization-based methods have drawn a lot of attention due to their superior accuracy and comparable efficiency ...
- ArticleAugust 2017
Is my object in this video? reconstruction-based object search in videos
IJCAI'17: Proceedings of the 26th International Joint Conference on Artificial IntelligencePages 4551–4557This paper addresses the problem of videolevel object instance search, which aims to retrieve the videos in the database that contain a given query object instance. Without prior knowledge about "when" and "where" an object of interest may appear in a ...
- research-articleJanuary 2016
Object Instance Search in Videos via Spatio-Temporal Trajectory Discovery
IEEE Transactions on Multimedia (TOM), Volume 18, Issue 1Pages 116–127https://doi.org/10.1109/TMM.2015.2500734Given a specific object as query, object instance search aims to not only retrieve the images or frames that contain the query, but also locate all its occurrences. In this work, we explore the use of spatio-temporal cues to improve the quality of object ...
- research-articleSeptember 2015
Fast object instance search in videos from one example
2015 IEEE International Conference on Image Processing (ICIP)Pages 4381–4385https://doi.org/10.1109/ICIP.2015.7351634We present an efficient approach to search for and locate all occurrences of a specific object in large video volumes, given a single query example. Locations of object occurrences are returned as spatio-temporal trajectories in the 3D video volume. ...
- research-articleJune 2015
Randomized Spatial Context for Object Search
IEEE Transactions on Image Processing (TIP), Volume 24, Issue 6Pages 1748–1762https://doi.org/10.1109/TIP.2015.2405337Searching visual objects in large image or video data sets is a challenging problem, because it requires efficient matching and accurate localization of query objects that often occupy a small part of an image. Although spatial context has been shown to ...
- research-articleAugust 2013
Robust Part-Based Hand Gesture Recognition Using Kinect Sensor
IEEE Transactions on Multimedia (TOM), Volume 15, Issue 5Pages 1110–1120https://doi.org/10.1109/TMM.2013.2246148The recently developed depth sensors, e.g., the Kinect sensor, have provided new opportunities for human-computer interaction (HCI). Although great progress has been made by leveraging the Kinect sensor, e.g., in human body tracking, face recognition ...
- abstractOctober 2012
Rapid object search engine for contextual advertisement
MM '12: Proceedings of the 20th ACM international conference on MultimediaPages 1275–1276https://doi.org/10.1145/2393347.2396439Visual object search, with the goal to find and locate the target object in large image or video collections, is of great interest for many applications and hence has received intensive attentions in recent years. In this demo, we present a spatial ...
- demonstrationNovember 2011
Robust hand gesture recognition with kinect sensor
MM '11: Proceedings of the 19th ACM international conference on MultimediaPages 759–760https://doi.org/10.1145/2072298.2072443Hand gesture based Human-Computer-Interaction (HCI) is one of the most natural and intuitive ways to communicate between people and machines, since it closely mimics how human interact with each other. In this demo, we present a hand gesture recognition ...
- short-paperOctober 2010
Interactive visual object search through mutual information maximization
MM '10: Proceedings of the 18th ACM international conference on MultimediaPages 1147–1150https://doi.org/10.1145/1873951.1874172Searching for small objects (e.g., logos) in images is a critical yet challenging problem. It becomes more difficult when target objects differ significantly from the query object due to changes in scale, viewpoint or style, not to mention partial ...
- research-articleNovember 2008
Mining Recurring Events Through Forest Growing
IEEE Transactions on Circuits and Systems for Video Technology (IEEETCSVT), Volume 18, Issue 11Pages 1597–1607https://doi.org/10.1109/TCSVT.2008.2005616Recurring events are short temporal patterns that consist of multiple instances in the target database. Without any a priori knowledge of the recurring events, in terms of their lengths, temporal locations, the total number of such events, and possible ...