Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleFebruary 2025
A Metric-Based Detection System for Large Language Model Texts
ACM Transactions on Management Information Systems (TMIS), Volume 16, Issue 1Article No.: 8, Pages 1–19https://doi.org/10.1145/3704739More efforts are being put into improving the capabilities of Large Language Models (LLM) than into dealing with their implications. Current LLMs are able to generate high-quality texts seemingly indistinguishable from those written by human experts. ...
- research-articleOctober 2024
Two Weakly Supervised Approaches for Role Classification of Soccer Players
MMSports '24: Proceedings of the 7th ACM International Workshop on Multimedia Content Analysis in SportsPages 81–89https://doi.org/10.1145/3689061.3689072Role classification of players according to their uniform or playing kit in unseen soccer game scenes remains a challenging problem. While multiple methods have being proposed for this task, both handcrafted and deep learning methods have been designed ...
- research-articleOctober 2024
Knowledge Augmentation for Distillation: A General and Effective Approach to Enhance Knowledge Distillation
EMCLR'24: Proceedings of the 1st International Workshop on Efficient Multimedia Computing under LimitedPages 23–31https://doi.org/10.1145/3688863.3689569Knowledge Distillation (KD), which extracts knowledge from a well-performed large neural network (a.k.a teacher network) to guide the training of a small network (a.k.a student network), has emerged as a promising approach for transfer learning and model ...
- research-articleOctober 2024
STIR: Siamese Transformer for Image Retrieval Postprocessing
CIKM '24: Proceedings of the 33rd ACM International Conference on Information and Knowledge ManagementPages 4852–4857https://doi.org/10.1145/3627673.3680075Current metric learning approaches for image retrieval are usually based on learning a space of informative latent representations where simple approaches such as the cosine distance would work well. Recent state of the art methods such as HypViT move to ...
- ArticleNovember 2024
Delving Deeper Into Clean Samples for Combating Noisy Labels
AbstractReal-world datasets usually contain inevitable noisy labels, which will cause deep networks to overfit inaccurate information and yield degenerated performance. Previous literature typically focuses on sample selection to alleviate noisy labels. ...
-
- ArticleAugust 2024
Cross-Domain Few-Shot Fine-Grained Classification Based on Local-Global Semantic Consistency and Earth Mover’s Distance
Advanced Intelligent Computing Technology and ApplicationsPages 286–297https://doi.org/10.1007/978-981-97-5594-3_24AbstractIn recent years, few-shot classification algorithms based on metric learning have gained significant attention in the field. However, in the context of cross-domain few-shot classification tasks, their performance still requires further ...
- research-articleJuly 2024
Creating Ensembles of Classifiers through UMDA for Aerial Scene Classification
- Fabio Augusto Faria,
- Luiz Henrique Buris,
- Luis Augusto Martins Pereira,
- Fabio Augusto Menocci Cappabianco
GECCO '24: Proceedings of the Genetic and Evolutionary Computation ConferencePages 1228–1236https://doi.org/10.1145/3638529.3653998Aerial scene classification in remote sensing presents a significant challenge due to high intra-class variability and the different scales and orientations of the objects within dataset images. While deep learning architectures are commonly used for ...
- short-paperMay 2024
Enabling Pre-Shock State Detection using Electrogram Signals from Implantable Cardioverter-Defibrillators
WWW '24: Companion Proceedings of the ACM Web Conference 2024Pages 1138–1141https://doi.org/10.1145/3589335.3651450Identifying electrical signatures preceding a ventricular arrhythmia from the implantable cardioverter-defibrillators (ICDs) can help predict an upcoming ICD shock. To achieve this, we first deployed a large-scale study (N=326) to continuously monitor ...
- research-articleMay 2024
A Visual Embedding for the Supervised Image based on self-attention
CACML '24: Proceedings of the 2024 3rd Asia Conference on Algorithms, Computing and Machine LearningPages 23–28https://doi.org/10.1145/3654823.3654828Deep learning has proven itself to be a successful set of models for learning useful semantic representations of data. In this paper, we explore the similar information and semantic information on vector space of images with self-attention mechanism. For ...
- research-articleMarch 2024
IncMSR: An Incremental Learning Approach for Multi-Scenario Recommendation
WSDM '24: Proceedings of the 17th ACM International Conference on Web Search and Data MiningPages 939–948https://doi.org/10.1145/3616855.3635828For better performance and less resource consumption, multi-scenario recommendation (MSR) is proposed to train a unified model to serve all scenarios by leveraging data from multiple scenarios. Current works in MSR focus on designing effective networks ...
- research-articleJanuary 2024
A coarse-to-fine unsupervised domain adaptation method based on metric learning
Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology (JIFS), Volume 46, Issue 1Pages 3013–3027https://doi.org/10.3233/JIFS-235912Domain adaptation solves the challenge of inadequate labeled samples in the target domain by leveraging the knowledge learned from the labeled source domain. Most existing approaches aim to reduce the domain shift by performing some coarse alignments ...
- research-articleDecember 2023
Hierarchical Learning and Dummy Triplet Loss for Efficient Deepfake Detection
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 20, Issue 3Article No.: 89, Pages 1–18https://doi.org/10.1145/3626101The advancement of generative models has made it easier to create highly realistic Deepfake videos. This accessibility has led to a surge in research on Deepfake detection to mitigate potential misuse. Typically, Deepfake detection models utilize binary ...
- research-articleDecember 2023
Learning feature alignment and dual correlation for few‐shot image classification
CAAI Transactions on Intelligence Technology (CIT2), Volume 9, Issue 2Pages 303–318https://doi.org/10.1049/cit2.12273AbstractFew‐shot image classification is the task of classifying novel classes using extremely limited labelled samples. To perform classification using the limited samples, one solution is to learn the feature alignment (FA) information between the ...
- ArticleDecember 2023
Intra-variance Guided Metric Learning for Face Forgery Detection
AbstractSince facial manipulation technology has raised serious concerns, facial forgery detection has also attracted increasing attention. Although recent work has made good achievements, the detection of unseen fake faces is still a big challenge. In ...
- research-articleOctober 2023
Preserving Local and Global Information: An Effective Metric-based Subspace Clustering
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 3619–3627https://doi.org/10.1145/3581783.3612235Subspace clustering, which recoveries the subspace representation in the form of an affinity graph, has drawn tons of attention due to its effectiveness in various clustering tasks. However, existing subspace clustering methods are usually fed with raw ...
- research-articleOctober 2023
Causal Intervention for Sparse-View Gait Recognition
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 77–85https://doi.org/10.1145/3581783.3612124Gait recognition aims at identifying individuals by unique walking patterns at a long distance. However, prevailing methods suffer from a large degradation when applied to large-scale surveillance systems. We find a significant cause of this issue is ...
- research-articleOctober 2023
Style-Controllable Generalized Person Re-identification
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 7912–7921https://doi.org/10.1145/3581783.3611802Domain generalizable person Re-identification is a challenging and realistic task. It requires a model to train on multi-source domains and then generalizes well on unseen target domains. Existing approaches typically mix images from different domains in ...
- ArticleDecember 2023
Prototype Rectification with Region-Wise Foreground Enhancement for Few-Shot Classification
AbstractFew-Shot Classification is a challenging problem as it uses only very few labeled examples to assign labels for query samples. The Prototypical Network effectively addresses the issue by matching the nearest mean-based prototype for each query ...
- ArticleDecember 2023
Shared Nearest Neighbor Calibration for Few-Shot Classification
AbstractFew-shot classification aims to classify query samples using very few labeled examples. Most existing methods follow the Prototypical Network to classify query samples by matching them to the nearest centroid. However, scarce labeled examples tend ...