Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleDecember 2024
An fMRI-based auditory decoding framework combined with convolutional neural network for predicting the semantics of real-life sounds from brain activity
AbstractSemantic decoding, understood as predicting the semantic information carried by stimuli presented to subjects based on neural signals, is an active area of research. Previous studies have mainly focused on the visual perception process, with ...
- research-articleNovember 2024
A systematic network pruning framework using ADMM-type algorithm: A systematic network pruning framework using ADMM-type algorithm
AbstractRegularization is a common technique to prune neural networks which removes some weights with acceptable accuracy. Moreover, the alternating direction method of multipliers (ADMM) can quickly solve regularization problems. However, for learning ...
- research-articleNovember 2024
DirectL: Efficient Radiance Fields Rendering for 3D Light Field Displays
ACM Transactions on Graphics (TOG), Volume 43, Issue 6Article No.: 234, Pages 1–19https://doi.org/10.1145/3687897Autostereoscopic display technology, despite decades of development, has not achieved extensive application, primarily due to the daunting challenge of three-dimensional (3D) content creation for non-specialists. The emergence of Radiance Field as an ...
- research-articleOctober 2024
An fMRI-based visual decoding framework combined with two-stage learning and asynchronous iterative update strategy
Pattern Analysis & Applications (PAAS), Volume 27, Issue 4https://doi.org/10.1007/s10044-024-01347-zAbstractReconstructing visual stimulus information from evoked brain activity is a significant task of visual decoding of the human brain. However, due to the limited size of published functional magnetic resonance imaging (fMRI) datasets, it is difficult ...
- research-articleAugust 2024
TFRS: A task-level feature rectification and separation method for few-shot video action recognition
AbstractFew-shot video action recognition (FS-VAR) is a challenging task that requires models to have significant expressive power in order to identify previously unseen classes using only a few labeled examples. However, due to the limited number of ...
Highlights- We rectifies support sample by utilizing information from similar base prototypes.
- We remove the projection onto the shared centroid.
- Our method can be applied to various established FS-VAR frameworks.
- We emphasize the ...
-
- research-articleFebruary 2024
A real-time hierarchical control method for safe human–robot coexistence
Robotics and Computer-Integrated Manufacturing (RCIM), Volume 86, Issue Chttps://doi.org/10.1016/j.rcim.2023.102666AbstractIn modern industrial environments, robots are expected to work close to human operators collaborating with them in completing various tasks, e.g., collaborative assembly or delivery of objects. However, it is difficult to take task optimization ...
Highlights- Hierarchical control method ensure safe human–robot Coexistence.
- Cast the Model Predictive Control as a Bayesian Inference problem
- Safety-critical controller with Stochastic Control Barrier Function to ensure safety
- research-articleFebruary 2024
A multimodal fusion-based deep learning framework combined with local-global contextual TCNs for continuous emotion recognition from videos
Applied Intelligence (KLU-APIN), Volume 54, Issue 4Pages 3040–3057https://doi.org/10.1007/s10489-024-05329-wAbstractContinuous emotion recognition plays a crucial role in developing friendly and natural human-computer interaction applications. However, there exist two significant challenges unresolved in this field: how to effectively fuse complementary ...
- research-articleFebruary 2024
Semantic-guided spatio-temporal attention for few-shot action recognition
Applied Intelligence (KLU-APIN), Volume 54, Issue 3Pages 2458–2471https://doi.org/10.1007/s10489-024-05294-4AbstractFew-shot action recognition is a challenging problem aimed at learning a model capable of adapting to recognize new categories using only a few labeled videos. Recently, some works use attention mechanisms to focus on relevant regions to obtain ...
- research-articleFebruary 2024
GDB: Gated Convolutions-based Document Binarization
AbstractDocument binarization is a crucial pre-processing step for various document analysis tasks. However, existing methods fail to accurately capture stroke edges, primarily due to the inherent limitations of vanilla convolutions and the absence of ...
Highlights- We propose a novel end-to-end document binarization method using gated convolutions.
- We solve the problem of imprecise stroke edge extraction by gated convolutions.
- We visualize the gating values to explain the effectiveness of ...
- research-articleOctober 2023
DocDiff: Document Enhancement via Residual Diffusion Models
- Zongyuan Yang,
- Baolin Liu,
- Yongping Xxiong,
- Lan Yi,
- Guibin Wu,
- Xiaojun Tang,
- Ziqi Liu,
- Junjie Zhou,
- Xing Zhang
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 2795–2806https://doi.org/10.1145/3581783.3611730Removing degradation from document images not only improves their visual quality and readability, but also enhances the performance of numerous automated document analysis and recognition tasks. However, existing regression-based methods optimized for ...
- research-articleOctober 2023
PS-SA: An Efficient Self-Attention via Progressive Sampling for User Behavior Sequence Modeling
- Jiacen Hu,
- Zhangming Chan,
- Yu Zhang,
- Shuguang Han,
- Siyuan Lou,
- Baolin Liu,
- Han Zhu,
- Yuning Jiang,
- Jian Xu,
- Bo Zheng
CIKM '23: Proceedings of the 32nd ACM International Conference on Information and Knowledge ManagementPages 4639–4645https://doi.org/10.1145/3583780.3615495As the self-attention mechanism offers powerful capabilities for capturing sequential relationships, it has become increasingly popular to use it for modeling user behavior sequences in recommender systems. However, the self-attention mechanism has a ...
- research-articleSeptember 2023
Ghost key distribution under mutual authentication mechanism
Information Sciences: an International Journal (ISCI), Volume 640, Issue Chttps://doi.org/10.1016/j.ins.2023.119025AbstractMost existing ghost-imaging-based cryptographic key-distribution protocols involve one-way authentication, and the lack of a mutual authentication mechanism can easily lead to the tampering of key information. In this study, we ...
- research-articleAugust 2023
Construction of the brain-inspired computing model verified by spatiotemporal correspondence between the hierarchical computation of the model and the complex multi-stage processing of the human brain during facial expression recognition
Applied Intelligence (KLU-APIN), Volume 53, Issue 21Pages 26286–26295https://doi.org/10.1007/s10489-023-04802-2AbstractHuman brain can recognize facial expressions in a short period of time. The complex multi-stage processing of neural activities in facial expression recognition is involved. Deep convolutional neural network (DCNN) has excellent performance ...
- research-articleAugust 2023
Capturing Conversion Rate Fluctuation during Sales Promotions: A Novel Historical Data Reuse Approach
- Zhangming Chan,
- Yu Zhang,
- Shuguang Han,
- Yong Bai,
- Xiang-Rong Sheng,
- Siyuan Lou,
- Jiacen Hu,
- Baolin Liu,
- Yuning Jiang,
- Jian Xu,
- Bo Zheng
KDD '23: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data MiningPages 3774–3784https://doi.org/10.1145/3580305.3599788Conversion rate (CVR) prediction is one of the core components in online recommender systems, and various approaches have been proposed to obtain accurate and well-calibrated CVR estimation. However, we observe that a well-trained CVR prediction model ...
- research-articleJune 2023
A multimodal fusion-based deep learning framework combined with keyframe extraction and spatial and channel attention for group emotion recognition from videos
Pattern Analysis & Applications (PAAS), Volume 26, Issue 3Pages 1493–1503https://doi.org/10.1007/s10044-023-01178-4AbstractVideo-based group emotion recognition is an important research area in computer vision and is of great significance for the intelligent understanding of videos and human–computer interactions. Previous studies have adopted the traditional two-...
- research-articleDecember 2022
OTDE: optimal transport distribution enhancement for few-shot video recognition
Applied Intelligence (KLU-APIN), Volume 53, Issue 13Pages 17115–17127https://doi.org/10.1007/s10489-022-04369-4AbstractIn recent years, action recognition has become a subject of focus in the field of computer vision. Interest has emerged regarding the recognition of previously unseen classes given a few labeled examples; this is known as few-shot video ...
- research-articleOctober 2022
KDM: A knowledge-guided and data-driven method for few-shot video action recognition
AbstractFew Shot-Video Action Recognition (FS-VAR) has recently aroused great interest with the rise of meta-learning. The generalization ability of complex meta-learning is limited despite that it is effective and prevalent. In this work, we ...
- research-articleNovember 2021
High-integrity based cooperative file transmission at urban intersections using pure V2V communication
AbstractVehicular Ad Hoc Networks (VANETs) provide safety management and entertainment services in smart city scenarios. However, due to the highly time-varying network topology caused by vehicles’ movement and non-line-of-sight (NLOS) areas ...
- research-articleOctober 2021
Reconstruction of natural images from evoked brain activity with a dictionary-based invertible encoding procedure
Neurocomputing (NEUROC), Volume 456, Issue CPages 338–351https://doi.org/10.1016/j.neucom.2021.05.083AbstractStudies on visual encoding and reconstruction based on functional magnetic resonance imaging (fMRI) have inspired each other in recent years. However, as far as we know, there has not been any study that has achieved the reconstruction ...
- research-articleJanuary 2021
Visual encoding and decoding of the human brain based on shared features
IJCAI'20: Proceedings of the Twenty-Ninth International Joint Conference on Artificial IntelligenceArticle No.: 103, Pages 738–744Using a convolutional neural network to build visual encoding and decoding models of the human brain is a good starting point for the study on relationship between deep learning and human visual cognitive mechanism. However, related studies have not fully ...