Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleFebruary 2025
A benchmark dataset and semantics-guided detection network for spatial–temporal human actions in urban driving scenes
AbstractIn real urban driving scenes, human actions are very complex and have the characteristic of multiple concurrent actions. It has a great significance to detect human actions in urban traffic scenes for auxiliary or autonomous driving systems. In ...
Highlights- The TITAN-Human Action dataset is built for action detection in urban driving scenes.
- We propose a novel semantics-guided network for spatial-temporal action detection.
- The experimental results show that the proposed SGDNet ...
- ArticleDecember 2024
- research-articleDecember 2024
Class Incremental Learning via Semantic Information Mapping and Background Information Calibrating
IEEE Transactions on Circuits and Systems for Video Technology (IEEETCSVT), Volume 34, Issue 12_Part_2Pages 13373–13385https://doi.org/10.1109/TCSVT.2024.3447066Replaying episodic memory hippocampus-based is a promising class incremental learning (CIL) method, and it must address the problem of catastrophic forgetting. However, most current studies have ignored background information provided by the ...
- research-articleDecember 2024
Dehazing & Reasoning YOLO: Prior knowledge-guided network for object detection in foggy weather
AbstractFast and accurate object detection in foggy weather is crucial for visual tasks such as autonomous driving and video surveillance. Existing methods typically preprocess images with enhancement techniques before the object detector, so that the ...
Highlights- We design a Restoration Subnetwork Module (RSM) based on the atmospheric scattering model and three Adaptive Feature Fusion Modules (AFFM) for encouraging the network to learn more discriminative features from foggy images.
- We ...
- research-articleNovember 2024
Learning Local-Global Representation for Scribble-Based RGB-D Salient Object Detection via Transformer
IEEE Transactions on Circuits and Systems for Video Technology (IEEETCSVT), Volume 34, Issue 11_Part_2Pages 11592–11604https://doi.org/10.1109/TCSVT.2024.3424651Manual scribbles have been introduced to RGB-D Salient Object Detection (SOD) as a credible indicator for salient regions and backgrounds, helping to strike a balance between detection accuracy and labeling efficiency. Previous works address this task by ...
-
- research-articleNovember 2024
Detection of color phenotype in strawberry germplasm resources based on field robot and semantic segmentation
- Ningyuan Yang,
- Zhenyu Huang,
- Yong He,
- Wenfei Xiao,
- Hong Yu,
- Lihua Qian,
- Yixin Xu,
- Yimin Tao,
- Ping Lyu,
- Xiaohan Lyu,
- Xuping Feng
Computers and Electronics in Agriculture (COEA), Volume 226, Issue Chttps://doi.org/10.1016/j.compag.2024.109464Highlights- Semantic segmentation of strawberries using the modified Segment Anything Model.
- Class weight resolves severe sample category imbalance.
- CLAHE algorithm reduces the natural light impact in agricultural image processing.
- ...
Strawberry holds significant economic value, but the laborious and time-consuming process of evaluating phenotypic traits in numerous germplasm resources during breeding poses a challenge. Prior studies relied on manual image collection within a ...
- research-articleOctober 2024
VSG<sup>3</sup>A<sup>2</sup>: A Genetic Algorithm-Based Virtual Sample Generation Approach Using Information Gain and Acceptance-Rejection Sampling
IEEE Transactions on Evolutionary Computation (TEC), Volume 28, Issue 5Pages 1514–1528https://doi.org/10.1109/TEVC.2023.3298703Virtual sample generation (VSG) is an important technology for dealing with small sample learning in some industries. Using evolutionary computation algorithms to solve VSG is a promising way. However, two issues remain unaddressed in the existing VSG ...
- ArticleSeptember 2024
Alignment-Enhanced Network for Temporal Language Grounding in Videos
Artificial Neural Networks and Machine Learning – ICANN 2024Pages 177–192https://doi.org/10.1007/978-3-031-72338-4_13AbstractTemporal language grounding in videos aims to ground one video segment in an untrimmed video based on a given sentence query. The main challenge in this task lies in how to align the video and textual modalities effectively. Most existing methods ...
- research-articleSeptember 2024
STAFFormer: Spatio-temporal adaptive fusion transformer for efficient 3D human pose estimation
AbstractExisting two-stage methods for 3D Human Pose Estimation often use 2D poses as input, which are then lifted to obtain 3D representations. This typically involves frame-by-frame estimation, whether in 2D or 3D, resulting in high computational ...
Highlights- TDFR recovers dense temporal frames from sparsely ones provided by a 2D pose estimator.
- TDFR facilitates 10x increase in estimation speed while maintaining accuracy.
- STAF module adaptively fuses through both spatial and temporal ...
- research-articleOctober 2024
Technological Dialogue and Intelligent Generation: An Empirical Study of Primary Chinese Teachers Using Generative AI for Lesson Preparation
IECT '24: Proceedings of the 2024 International Conference on Intelligent Education and Computer TechnologyPages 606–611https://doi.org/10.1145/3687311.3687419The purpose of this study is to explore the application of generative artificial intelligence (AI) in Chinese literacy teaching in the first grade of primary school and its influence on teachers' practical knowledge construction and teaching innovation. ...
- research-articleJune 2024
Dynamic Properties and Chaos Control of a High Dimensional Double Rotor Model
Automatic Control and Computer Sciences (ACCS), Volume 58, Issue 3Pages 227–236https://doi.org/10.3103/S0146411624700123AbstractIn this paper, a high dimensional double rotor model is proposed. We establish its dynamic equations, and simply it into a four-dimensional mapping form. The bifurcations of the double rotor mapping under different control parameters are ...
- research-articleJune 2024
Correlation-Guided Semantic Consistency Network for Visible-Infrared Person Re-Identification
IEEE Transactions on Circuits and Systems for Video Technology (IEEETCSVT), Volume 34, Issue 6Pages 4503–4515https://doi.org/10.1109/TCSVT.2023.3340225Visible-infrared person re-identification (VI-ReID) has raised more attention in night-time surveillance applications due to the struggle to capture valid appearance information under poor illumination conditions via visible cameras. Existing works ...
- ArticleJuly 2024
Lung Cancer Risk Prediction Model Trained with Multi-source Data
AbstractRecent research about lung cancer risk prediction model require the data for predicting as same as the data for training whether based on single-source data or multi-source data. Both of them either cannot fully use collected multi-source data to ...
- research-articleMay 2024
Mask-guided discriminative feature network for occluded person re-identification
Journal of Visual Communication and Image Representation (JVCIR), Volume 101, Issue Chttps://doi.org/10.1016/j.jvcir.2024.104178AbstractIn recent years, although research on person re-identification (ReID) has made significant progress, occluded person ReID remains a major challenge. In real-world scenes, persons are often occluded by various obstacles such as vehicles, umbrellas,...
Highlights- Designed Mask-guided Discriminative Feature Enhancement and Fusion (MDFEF) module to balance global and local information, suppressing occlusion noise.
- Proposed end-to-end Mask-guided Discriminative Feature Network (MDFNet) for ...
- research-articleMay 2024
Knowledge graph embedding based on dynamic adaptive atrous convolution and attention mechanism for link prediction
Information Processing and Management: an International Journal (IPRM), Volume 61, Issue 3https://doi.org/10.1016/j.ipm.2024.103642AbstractKnowledge graph embedding (KGE) is essential for various applications, particularly in link prediction and other downstream tasks. While existing convolutional neural network (CNN)-based methods have been effective, they face challenges in ...
Highlights- We construct a new dynamic adaptive atrous convolutional neural network.
- Convolutional kernel can dynamically learn multidimensional attentions to inputs.
- Model can expand receptive field and avoid losses from pooling and ...
- research-articleMay 2024
A multi-granularity hierarchical network for long- and short-term forecasting on multivariate time series data
AbstractMultivariate time series forecasting is a significant research problem in many fields such as economics, finance and transportation, where simultaneous long- and short-term forecasting is required. However, current techniques are typically ...
Highlights- It is for long- and short-term forecasting on multivariate time series data.
- It is a multi-granular hierarchy network using idea of granularity division.
- The external relationships and internal correlation are introduced and mined.
- research-articleMarch 2024
Mental-LLM: Leveraging Large Language Models for Mental Health Prediction via Online Text Data
- Xuhai Xu,
- Bingsheng Yao,
- Yuanzhe Dong,
- Saadia Gabriel,
- Hong Yu,
- James Hendler,
- Marzyeh Ghassemi,
- Anind K. Dey,
- Dakuo Wang
Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT), Volume 8, Issue 1Article No.: 31, Pages 1–32https://doi.org/10.1145/3643540Advances in large language models (LLMs) have empowered a variety of applications. However, there is still a significant gap in research when it comes to understanding and enhancing the capabilities of LLMs in the field of mental health. In this work, we ...
- research-articleFebruary 2024
Depression detection via capsule networks with contrastive learning
- Han Liu,
- Changya Li,
- Xiaotong Zhang,
- Feng Zhang,
- Wei Wang,
- Fenglong Ma,
- Hongyang Chen,
- Hong Yu,
- Xianchao Zhang
AAAI'24/IAAI'24/EAAI'24: Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial IntelligenceArticle No.: 2480, Pages 22231–22239https://doi.org/10.1609/aaai.v38i20.30228Depression detection is a challenging and crucial task in psychological illness diagnosis. Utilizing online user posts to predict whether a user suffers from depression seems an effective and promising direction. However, existing methods suffer from ...