Author: Yu, Hong : Search

Applied Filters

Publication Date

People

Publications

221 Results for: Author: Yu, HongEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,855,380 records)|Limit your search to The ACM Full-Text Collection (777,925 records)

Showing 1 - 20of221 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
February 2025
A benchmark dataset and semantics-guided detection network for spatial–temporal human actions in urban driving scenes
Pattern Recognition (PATT), Volume 158, Issue Chttps://doi.org/10.1016/j.patcog.2024.111035
Abstract
In real urban driving scenes, human actions are very complex and have the characteristic of multiple concurrent actions. It has a great significance to detect human actions in urban traffic scenes for auxiliary or autonomous driving systems. In ...
Highlights

The TITAN-Human Action dataset is built for action detection in urban driving scenes.
We propose a novel semantics-guided network for spatial-temporal action detection.
The experimental results show that the proposed SGDNet ...
0
Metrics
Total Citations0
Article
December 2024
Dual Branch Non-Autoregressive Image Captioning
- Yuanqiu Liu,
- Hong Yu,
- Hui Li,
- Xin Han,
- Han Liu
Pattern RecognitionPages 325–340https://doi.org/10.1007/978-3-031-78456-9_21
Abstract
Image captioning is a typical task in multimodal learning. Many existing image captioning models rely on autoregressive paradigms, causing notable delays in inference and impacting practical applications. While non-autoregressive methods ...
0
Metrics
Total Citations0
research-article
December 2024
Class Incremental Learning via Semantic Information Mapping and Background Information Calibrating
IEEE Transactions on Circuits and Systems for Video Technology (IEEETCSVT), Volume 34, Issue 12_Part_2Pages 13373–13385https://doi.org/10.1109/TCSVT.2024.3447066
Replaying episodic memory hippocampus-based is a promising class incremental learning (CIL) method, and it must address the problem of catastrophic forgetting. However, most current studies have ignored background information provided by the ...
0
Metrics
Total Citations0
research-article
December 2024
Dehazing & Reasoning YOLO: Prior knowledge-guided network for object detection in foggy weather
Pattern Recognition (PATT), Volume 156, Issue Chttps://doi.org/10.1016/j.patcog.2024.110756
Abstract
Fast and accurate object detection in foggy weather is crucial for visual tasks such as autonomous driving and video surveillance. Existing methods typically preprocess images with enhancement techniques before the object detector, so that the ...
Highlights

We design a Restoration Subnetwork Module (RSM) based on the atmospheric scattering model and three Adaptive Feature Fusion Modules (AFFM) for encouraging the network to learn more discriminative features from foggy images.
We ...
0
Metrics
Total Citations0
research-article
November 2024
Learning Local-Global Representation for Scribble-Based RGB-D Salient Object Detection via Transformer
IEEE Transactions on Circuits and Systems for Video Technology (IEEETCSVT), Volume 34, Issue 11_Part_2Pages 11592–11604https://doi.org/10.1109/TCSVT.2024.3424651
Manual scribbles have been introduced to RGB-D Salient Object Detection (SOD) as a credible indicator for salient regions and backgrounds, helping to strike a balance between detection accuracy and labeling efficiency. Previous works address this task by ...
0
Metrics
Total Citations0
Upcoming Conferences

KDD '25

August 3 - 7, 2025

Metro Toronto Convention Centre, Toronto, ON, Canada

KDD '25 Website

CIKM '25

November 10 - 14, 2025

COEX, Seoul, Republic of Korea
erratum
November 2024
Corrigendum to “STAFFormer: Spatio-temporal adaptive fusion transformer for efficient 3D human pose estimation” [Journal of Image and Vision Computing volume 149 (2024) 105142]
- Feng Hao,
- Fujin Zhong,
- Yunhe Wang,
- Hong Yu,
- Jun Hu,
- Yan Yang
Image and Vision Computing (IAVC), Volume 151, Issue Chttps://doi.org/10.1016/j.imavis.2024.105305
0
Metrics
Total Citations0
research-article
November 2024
Detection of color phenotype in strawberry germplasm resources based on field robot and semantic segmentation
- Ningyuan Yang,
- Zhenyu Huang,
- Yong He,
- Wenfei Xiao,
- Hong Yu,
- Lihua Qian,
- Yixin Xu,
- Yimin Tao,
- Ping Lyu,
- Xiaohan Lyu,
- Xuping Feng
Computers and Electronics in Agriculture (COEA), Volume 226, Issue Chttps://doi.org/10.1016/j.compag.2024.109464
Highlights

Semantic segmentation of strawberries using the modified Segment Anything Model.
Class weight resolves severe sample category imbalance.
CLAHE algorithm reduces the natural light impact in agricultural image processing.
...
Abstract
Strawberry holds significant economic value, but the laborious and time-consuming process of evaluating phenotypic traits in numerous germplasm resources during breeding poses a challenge. Prior studies relied on manual image collection within a ...
0
Metrics
Total Citations0
research-article
October 2024
VSG<sup>3</sup>A<sup>2</sup>: A Genetic Algorithm-Based Virtual Sample Generation Approach Using Information Gain and Acceptance-Rejection Sampling
IEEE Transactions on Evolutionary Computation (TEC), Volume 28, Issue 5Pages 1514–1528https://doi.org/10.1109/TEVC.2023.3298703
Virtual sample generation (VSG) is an important technology for dealing with small sample learning in some industries. Using evolutionary computation algorithms to solve VSG is a promising way. However, two issues remain unaddressed in the existing VSG ...
0
Metrics
Total Citations0
erratum
October 2024
Corrigendum to “Mask-guided discriminative feature network for occluded person re-identification” [J. Vis. Commun. Image Represent. 101 (2024) 104178]
- Fujin Zhong,
- Yunhe Wang,
- Hong Yu,
- Jun Hu,
- Yan Yang
Journal of Visual Communication and Image Representation (JVCIR), Volume 104, Issue Chttps://doi.org/10.1016/j.jvcir.2024.104301
0
Metrics
Total Citations0
Article
September 2024
Alignment-Enhanced Network for Temporal Language Grounding in Videos
- Hong Yu,
- Yu Zhang,
- Yuanqiu Liu,
- Hui Li,
- Han Liu
Artificial Neural Networks and Machine Learning – ICANN 2024Pages 177–192https://doi.org/10.1007/978-3-031-72338-4_13
Abstract
Temporal language grounding in videos aims to ground one video segment in an untrimmed video based on a given sentence query. The main challenge in this task lies in how to align the video and textual modalities effectively. Most existing methods ...
0
Metrics
Total Citations0
research-article
September 2024
STAFFormer: Spatio-temporal adaptive fusion transformer for efficient 3D human pose estimation
- Feng Hao,
- Fujin Zhong,
- Hong Yu,
- Jun Hu,
- Yan Yang
Image and Vision Computing (IAVC), Volume 149, Issue Chttps://doi.org/10.1016/j.imavis.2024.105142
Abstract
Existing two-stage methods for 3D Human Pose Estimation often use 2D poses as input, which are then lifted to obtain 3D representations. This typically involves frame-by-frame estimation, whether in 2D or 3D, resulting in high computational ...
Highlights

TDFR recovers dense temporal frames from sparsely ones provided by a 2D pose estimator.
TDFR facilitates 10x increase in estimation speed while maintaining accuracy.
STAF module adaptively fuses through both spatial and temporal ...
0
Metrics
Total Citations0
research-article
October 2024
Technological Dialogue and Intelligent Generation: An Empirical Study of Primary Chinese Teachers Using Generative AI for Lesson Preparation
- Cuitian Huang,
- Hong Yu
IECT '24: Proceedings of the 2024 International Conference on Intelligent Education and Computer TechnologyPages 606–611https://doi.org/10.1145/3687311.3687419

The purpose of this study is to explore the application of generative artificial intelligence (AI) in Chinese literacy teaching in the first grade of primary school and its influence on teachers' practical knowledge construction and teaching innovation. ...
0
43
Metrics
Total Citations0
Total Downloads43
Last 12 Months43
Last 6 weeks14
Get Access
research-article
June 2024
Dynamic Properties and Chaos Control of a High Dimensional Double Rotor Model
Automatic Control and Computer Sciences (ACCS), Volume 58, Issue 3Pages 227–236https://doi.org/10.3103/S0146411624700123
Abstract
In this paper, a high dimensional double rotor model is proposed. We establish its dynamic equations, and simply it into a four-dimensional mapping form. The bifurcations of the double rotor mapping under different control parameters are ...
0
Metrics
Total Citations0
research-article
June 2024
Correlation-Guided Semantic Consistency Network for Visible-Infrared Person Re-Identification
IEEE Transactions on Circuits and Systems for Video Technology (IEEETCSVT), Volume 34, Issue 6Pages 4503–4515https://doi.org/10.1109/TCSVT.2023.3340225
Visible-infrared person re-identification (VI-ReID) has raised more attention in night-time surveillance applications due to the struggle to capture valid appearance information under poor illumination conditions via visible cameras. Existing works ...
4
Metrics
Total Citations4
Article
July 2024
Lung Cancer Risk Prediction Model Trained with Multi-source Data
- Shijie Sun,
- Hanyue Liu,
- Ye Wang,
- Hong Yu
Rough SetsPages 280–294https://doi.org/10.1007/978-3-031-65668-2_19
Abstract
Recent research about lung cancer risk prediction model require the data for predicting as same as the data for training whether based on single-source data or multi-source data. Both of them either cannot fully use collected multi-source data to ...
0
Metrics
Total Citations0
research-article
May 2024
Mask-guided discriminative feature network for occluded person re-identification
- Fujin Zhong,
- Yunhe Wang,
- Hong Yu,
- Jun Hu,
- Yan Yang
Journal of Visual Communication and Image Representation (JVCIR), Volume 101, Issue Chttps://doi.org/10.1016/j.jvcir.2024.104178
Abstract
In recent years, although research on person re-identification (ReID) has made significant progress, occluded person ReID remains a major challenge. In real-world scenes, persons are often occluded by various obstacles such as vehicles, umbrellas,...
Highlights

Designed Mask-guided Discriminative Feature Enhancement and Fusion (MDFEF) module to balance global and local information, suppressing occlusion noise.
Proposed end-to-end Mask-guided Discriminative Feature Network (MDFNet) for ...
0
Metrics
Total Citations0
research-article
May 2024
Knowledge graph embedding based on dynamic adaptive atrous convolution and attention mechanism for link prediction
Information Processing and Management: an International Journal (IPRM), Volume 61, Issue 3https://doi.org/10.1016/j.ipm.2024.103642
Abstract
Knowledge graph embedding (KGE) is essential for various applications, particularly in link prediction and other downstream tasks. While existing convolutional neural network (CNN)-based methods have been effective, they face challenges in ...
Highlights

We construct a new dynamic adaptive atrous convolutional neural network.
Convolutional kernel can dynamically learn multidimensional attentions to inputs.
Model can expand receptive field and avoid losses from pooling and ...
3
Metrics
Total Citations3
research-article
May 2024
A multi-granularity hierarchical network for long- and short-term forecasting on multivariate time series data
Applied Soft Computing (APSC), Volume 157, Issue Chttps://doi.org/10.1016/j.asoc.2024.111537
Abstract
Multivariate time series forecasting is a significant research problem in many fields such as economics, finance and transportation, where simultaneous long- and short-term forecasting is required. However, current techniques are typically ...
Highlights

It is for long- and short-term forecasting on multivariate time series data.
It is a multi-granular hierarchy network using idea of granularity division.
The external relationships and internal correlation are introduced and mined.
1
Metrics
Total Citations1
research-article
March 2024
Mental-LLM: Leveraging Large Language Models for Mental Health Prediction via Online Text Data
Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT), Volume 8, Issue 1Article No.: 31, Pages 1–32https://doi.org/10.1145/3643540

Advances in large language models (LLMs) have empowered a variety of applications. However, there is still a significant gap in research when it comes to understanding and enhancing the capabilities of LLMs in the field of mental health. In this work, we ...
29
3,766
Metrics
Total Citations29
Total Downloads3,766
Last 12 Months3,766
Last 6 weeks485
Get Access
research-article
February 2024
Depression detection via capsule networks with contrastive learning
AAAI'24/IAAI'24/EAAI'24: Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial IntelligenceArticle No.: 2480, Pages 22231–22239https://doi.org/10.1609/aaai.v38i20.30228

Depression detection is a challenging and crucial task in psychological illness diagnosis. Utilizing online user posts to predict whether a user suffers from depression seems an effective and promising direction. However, existing methods suffer from ...
0
Metrics
Total Citations0

Search Results

Applied Filters

Publication Date

People

Authors

Institutions

Publications

Journal/Magazine Names

All Publications

Content Type

Supplemental Material Type

Publisher

Proceedings Series

ACM SIG Sponsors

Results

Caption

A benchmark dataset and semantics-guided detection network for spatial–temporal human actions in urban driving scenes

Dual Branch Non-Autoregressive Image Captioning

Class Incremental Learning via Semantic Information Mapping and Background Information Calibrating

Dehazing & Reasoning YOLO: Prior knowledge-guided network for object detection in foggy weather

Learning Local-Global Representation for Scribble-Based RGB-D Salient Object Detection via Transformer

Upcoming Conferences

Corrigendum to “STAFFormer: Spatio-temporal adaptive fusion transformer for efficient 3D human pose estimation” [Journal of Image and Vision Computing volume 149 (2024) 105142]

Detection of color phenotype in strawberry germplasm resources based on field robot and semantic segmentation

VSG<sup>3</sup>A<sup>2</sup>: A Genetic Algorithm-Based Virtual Sample Generation Approach Using Information Gain and Acceptance-Rejection Sampling

Corrigendum to “Mask-guided discriminative feature network for occluded person re-identification” [J. Vis. Commun. Image Represent. 101 (2024) 104178]

Alignment-Enhanced Network for Temporal Language Grounding in Videos

STAFFormer: Spatio-temporal adaptive fusion transformer for efficient 3D human pose estimation

Technological Dialogue and Intelligent Generation: An Empirical Study of Primary Chinese Teachers Using Generative AI for Lesson Preparation

Dynamic Properties and Chaos Control of a High Dimensional Double Rotor Model

Correlation-Guided Semantic Consistency Network for Visible-Infrared Person Re-Identification

Lung Cancer Risk Prediction Model Trained with Multi-source Data

Mask-guided discriminative feature network for occluded person re-identification

Knowledge graph embedding based on dynamic adaptive atrous convolution and attention mechanism for link prediction

A multi-granularity hierarchical network for long- and short-term forecasting on multivariate time series data

Mental-LLM: Leveraging Large Language Models for Mental Health Prediction via Online Text Data

Depression detection via capsule networks with contrastive learning