Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleNovember 2024
Exploiting Pre-Trained Language Models for Black-Box Attack against Knowledge Graph Embeddings
ACM Transactions on Knowledge Discovery from Data (TKDD), Volume 19, Issue 1Article No.: 1, Pages 1–14https://doi.org/10.1145/3688850Despite the emerging research on adversarial attacks against knowledge graph embedding (KGE) models, most of them focus on white-box attack settings. However, white-box attacks are difficult to apply in practice compared to black-box attacks since they ...
- research-articleNovember 2024
Multi-scale feature extraction and fusion with attention interaction for RGB-T tracking
AbstractRGB-T single-object tracking aims to track objects utilizing both RGB images and thermal infrared(TIR) images. Though the siamese-based RGB-T tracker shows its advantage in tracking speed, its accuracy still cannot be compared with other state-of-...
Highlights- feature extraction use multi-scale strategy and channel spatial fusion strategy.
- Introduce a self-attention interaction module to facilitate fast convergence.
- Our proposed method achieved the best performance compared to other ...
- research-articleNovember 2024
Meta-collaborative comparison for effective cross-domain few-shot learning
AbstractRecent advancements in cross-domain few-shot learning (CD-FSL) primarily focus on learning to compare global representations between query and support images for classification. However, due to the notorious cross-domain semantic gap, the ideal ...
Highlights- We present a novel framework for CDFSL.
- The proposed method has strong generalization ability.
- The proposed method achieves state-of-the-art performance.
- research-articleNovember 2024
Neural quantile optimization for edge–cloud networking
Computer Networks: The International Journal of Computer and Telecommunications Networking (CNTW), Volume 253, Issue Chttps://doi.org/10.1016/j.comnet.2024.110713AbstractWe seek the best traffic allocation scheme for the edge–cloud networking subject to SD-WAN architecture and burstable billing. First, we formulate a family of quantile-based integer programming problems for a fixed network topology with random ...
- research-articleNovember 2024
Multi-dimensional attention-aided transposed ConvBiLSTM network for hyperspectral image super-resolution
Computer Vision and Image Understanding (CVIU), Volume 248, Issue Chttps://doi.org/10.1016/j.cviu.2024.104096AbstractHyperspectral (HS) image always suffers from the deficiency of low spatial resolution, compared with conventional optical image types, which has limited its further applications in remote sensing areas. Therefore, HS image super-resolution (SR) ...
Highlights- A transposed convolutional bi-directional LSTM SR network is constructed for HS image.
- The network aims at modeling the spatial–sequential correlated features of HS bands.
- Multi-dimensional attention mechanism (MDAM) is proposed.
-
- research-articleOctober 2024
Dual-path Collaborative Generation Network for Emotional Video Captioning
MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 496–505https://doi.org/10.1145/3664647.3681603Emotional Video Captioning (EVC) is an emerging task that aims to describe factual content with the intrinsic emotions expressed in videos. The essential of the EVC task is to effectively perceive subtle and ambiguous visual emotional cues during the ...
- research-articleOctober 2024
STAR-VP: Improving Long-term Viewport Prediction in 360° Videos via Space-aligned and Time-varying Fusion
MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 5556–5565https://doi.org/10.1145/3664647.3681268Accurate long-term viewport prediction in tile-based 360° video adaptive streaming helps pre-download tiles for a further future, thus establishing a longer buffer to cope with network fluctuations. Long-term viewport motion is mainly influenced by ...
- research-articleOctober 2024
SOAP: Enhancing Spatio-Temporal Relation and Motion Information Capturing for Few-Shot Action Recognition
MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 4572–4580https://doi.org/10.1145/3664647.3681062High frame-rate~(HFR) videos of action recognition improve fine-grained expression while reducing the spatio-temporal relation and motion information density. Thus, large amounts of video samples are continuously required for traditional data-driven ...
- research-articleOctober 2024
SSL: A Self-similarity Loss for Improving Generative Image Super-resolution
MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 3189–3198https://doi.org/10.1145/3664647.3680874Generative adversarial networks (GAN) and generative diffusion models (DM) have been widely used in real-world image super-resolution (Real-ISR) to enhance the image perceptual quality. However, these generative models are prone to generating visual ...
- research-articleOctober 2024
Video Anomaly Detection via Progressive Learning of Multiple Proxy Tasks
- Menghao Zhang,
- Jingyu Wang,
- Qi Qi,
- Pengfei Ren,
- Haifeng Sun,
- Zirui Zhuang,
- Huazheng Wang,
- Lei Zhang,
- Jianxin Liao
MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 4719–4728https://doi.org/10.1145/3664647.3680871Learning multiple proxy tasks is a popular training strategy in semi-supervised video anomaly detection. However, the traditional method of learning multiple proxy tasks simultaneously is prone to suboptimal solutions, and simply executing multiple proxy ...
- research-articleOctober 2024
Subspace-Contrastive Multi-View Clustering
ACM Transactions on Knowledge Discovery from Data (TKDD), Volume 18, Issue 9Article No.: 211, Pages 1–35https://doi.org/10.1145/3674839Most multi-view clustering methods based on shallow models are limited in sound nonlinear information perception capability, or fail to effectively exploit complementary information hidden in different views. To tackle these issues, we propose a novel ...
- tutorialOctober 2024
Unifying Spectral and Spatial Graph Neural Networks
CIKM '24: Proceedings of the 33rd ACM International Conference on Information and Knowledge ManagementPages 5511–5513https://doi.org/10.1145/3627673.3679088In recent years, Graph Neural Networks (GNNs) have attracted considerable attention. However, the rapid emergence of diverse GNN models, each grounded in different theoretical foundations, complicates the model selection process, as these models are not ...
- research-articleOctober 2024
FedRFC: Federated Learning with Recursive Fuzzy Clustering for improved non-IID data training
Future Generation Computer Systems (FGCS), Volume 160, Issue CPages 835–843https://doi.org/10.1016/j.future.2024.06.049AbstractIn contemporary times, artificial intelligence is extensively applied across domains, concurrently raising concerns about privacy breaches. In response, federated learning has emerged as a promising solution that allows multiple parties to ...
Highlights- Propose a fuzzy clustering-based FL framework for the under utilization of data.
- Propose a recursive bi-partitioning clustering algorithm for client partitioning.
- Design a model fusion strategy to enhance the performance of ...
- research-articleOctober 2024
Fuzzy preference matroids rough sets for approximate guided representation in transformer
Expert Systems with Applications: An International Journal (EXWA), Volume 255, Issue PBhttps://doi.org/10.1016/j.eswa.2024.124592AbstractRecently, the transformer has exhibited remarkable performance across various applications, primarily owing to its exceptional capability in capturing global information through the attention mechanism. Nevertheless, the dot-product within ...
Highlights- The concept of fuzzy preference matroids rough set is introduced.
- A novel approximate guided representation method is developed.
- Constructed a plug-and-play transformer block based on the proposed method.
- The effectiveness and ...
- research-articleOctober 2024
Optimizing vehicle edge computing task offloading at intersections: a fuzzy decision-making approach
AbstractDue to the rapid development of the Internet of Vehicles (IoV), the combination of IoV and edge computing, known as vehicle edge computing (VEC), has received considerable attention from both academia and industry. However, task offloading in ...
- research-articleOctober 2024
VRCopilot: Authoring 3D Layouts with Generative AI Models in VR
UIST '24: Proceedings of the 37th Annual ACM Symposium on User Interface Software and TechnologyArticle No.: 96, Pages 1–13https://doi.org/10.1145/3654777.3676451Immersive authoring provides an intuitive medium for users to create 3D scenes via direct manipulation in Virtual Reality (VR). Recent advances in generative AI have enabled the automatic creation of realistic 3D layouts. However, it is unclear how ...
- ArticleOctober 2024
MetaUNETR: Rethinking Token Mixer Encoding for Efficient Multi-organ Segmentation
Medical Image Computing and Computer Assisted Intervention – MICCAI 2024Pages 446–455https://doi.org/10.1007/978-3-031-72114-4_43AbstractThe Transformer architecture and versatile CNN backbones have led to advanced progress in sequence modeling and dense prediction tasks. A critical development is the incorporation of different token mixing modules such as ConvNeXt, Swin ...
- ArticleOctober 2024
LIDIA: Precise Liver Tumor Diagnosis on Multi-Phase Contrast-Enhanced CT via Iterative Fusion and Asymmetric Contrastive Learning
- Wei Huang,
- Wei Liu,
- Xiaoming Zhang,
- Xiaoli Yin,
- Xu Han,
- Chunli Li,
- Yuan Gao,
- Yu Shi,
- Le Lu,
- Ling Zhang,
- Lei Zhang,
- Ke Yan
Medical Image Computing and Computer Assisted Intervention – MICCAI 2024Pages 394–404https://doi.org/10.1007/978-3-031-72114-4_38AbstractThe early detection and precise diagnosis of liver tumors are tasks of critical clinical value, yet they pose significant challenges due to the high heterogeneity and variability of liver tumors. In this work, a precise LIver tumor DIAgnosis ...
- research-articleNovember 2024
RIS-NOMA communications over Nakagami-m fading with imperfect successive interference cancellation
AbstractConsidering imperfect successive interference cancellation (SIC) for non-orthogonal multiple access (NOMA) communications, this work studies the cooperative reconfigurable intelligent surface (RIS)- and relay-assisted system under Nakagami-m ...