Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleJanuary 2025
Prompt-based visual alignment for zero-shot policy transfer
- Haihan Gao,
- Rui Zhang,
- Qi Yi,
- Hantao Yao,
- Haochen Li,
- Jiaming Guo,
- Shaohui Peng,
- Yunkai Gao,
- QiCheng Wang,
- Xing Hu,
- Yuanbo Wen,
- Zihao Zhang,
- Zidong Du,
- Ling Li,
- Qi Guo,
- Yunji Chen
ICML'24: Proceedings of the 41st International Conference on Machine LearningArticle No.: 599, Pages 14954–14968Overfitting in RL has become one of the main obstacles to applications in reinforcement learning (RL). Existing methods do not provide explicit semantic constrain for the feature extractor, hindering the agent from learning a unified cross-domain ...
- research-articleJanuary 2025
OCEAN-MBRL: offline conservative exploration for model-based offline reinforcement learning
- Fan Wu,
- Rui Zhang,
- Qi Yi,
- Yunkai Gao,
- Jiaming Guo,
- Shaohui Peng,
- Siming Lan,
- Husheng Han,
- Yansong Pan,
- Kaizhao Yuan,
- Pengwei Jin,
- Ruizhi Chen,
- Yunji Chen,
- Ling Li
AAAI'24/IAAI'24/EAAI'24: Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial IntelligenceArticle No.: 1772, Pages 15897–15905https://doi.org/10.1609/aaai.v38i14.29520Model-based offline reinforcement learning (RL) algorithms have emerged as a promising paradigm for offline RL. These algorithms usually learn a dynamics model from a static dataset of transitions, use the model to generate synthetic trajectories, and ...
- research-articleMay 2024
Context shift reduction for offline meta-reinforcement learning
- Yunkai Gao,
- Rui Zhang,
- Jiaming Guo,
- Fan Wu,
- Qi Yi,
- Shaohui Peng,
- Siming Lan,
- Ruizhi Chen,
- Zidong Du,
- Xing Hu,
- Qi Guo,
- Ling Li,
- Yunji Chen
NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 3505, Pages 80024–80043Offline meta-reinforcement learning (OMRL) utilizes pre-collected offline datasets to enhance the agent's generalization ability on unseen tasks. However, the context shift problem arises due to the distribution discrepancy between the contexts used for ...
- research-articleMay 2024
Contrastive modules with temporal attention for multi-task reinforcement learning
- Siming Lan,
- Rui Zhang,
- Qi Yi,
- Jiaming Guo,
- Shaohui Peng,
- Yunkai Gao,
- Fan Wu,
- Ruizhi Chen,
- Zidong Du,
- Xing Hu,
- Xishan Zhang,
- Ling Li,
- Yunji Chen
NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 1585, Pages 36507–36523In the field of multi-task reinforcement learning, the modular principle, which involves specializing functionalities into different modules and combining them appropriately, has been widely adopted as a promising approach to prevent the negative ...
- research-articleJuly 2023
Online prototype alignment for few-shot policy transfer
- Qi Yi,
- Rui Zhang,
- Shaohui Peng,
- Jiaming Guo,
- Yunkai Gao,
- Kaizhao Yuan,
- Ruizhi Chen,
- Siming Lan,
- Xing Hu,
- Zidong Du,
- Xishan Zhang,
- Qi Guo,
- Yunji Chen
ICML'23: Proceedings of the 40th International Conference on Machine LearningArticle No.: 1670, Pages 39968–39983Domain adaptation in reinforcement learning (RL) mainly deals with the changes of observation when transferring the policy to a new environment. Many traditional approaches of domain adaptation in RL manage to learn a mapping function between the source ...
- research-articleMay 2023
Serial combinational optimization method for double wishbone suspension’s pseudo damage improvement
Structural and Multidisciplinary Optimization (SPSMO), Volume 66, Issue 6https://doi.org/10.1007/s00158-023-03579-9AbstractThe durability of commercial vehicles is heavily dependent on the design of their suspension systems’ hard points. The design optimization of hard points involves a large number of design parameters. However, traditional surrogate model-assisted ...
- research-articleDecember 2021
A feasible identification method of uncertainty responses for vehicle structures
Structural and Multidisciplinary Optimization (SPSMO), Volume 64, Issue 6Pages 3861–3876https://doi.org/10.1007/s00158-021-03065-0AbstractMany unavoidable uncertainties in the engineering structure will affect its performance response. It is necessary to analyze the uncertain responses generated by uncertain factors in the vehicle design. Therefore, this study proposes a feasible ...
- articleMarch 2019
Robust topology optimization for multiple fiber-reinforced plastic (FRP) composites under loading uncertainties
Structural and Multidisciplinary Optimization (SPSMO), Volume 59, Issue 3Pages 695–711https://doi.org/10.1007/s00158-018-2175-0This study proposes a non-deterministic robust topology optimization of ply orientation for multiple fiber-reinforced plastic (FRP) materials, such as carbon fiber---reinforced plastic (CFRP) and glass fiber---reinforced plastic (GFRP) composites, under ...
- research-articleOctober 2015
Crashworthiness analysis and design of multi-cell hexagonal columns under multiple loading cases
Finite Elements in Analysis and Design (FEAD), Volume 104, Issue CPages 89–101https://doi.org/10.1016/j.finel.2015.06.004Multi-cell thin-walled structures have proven fairly effective in energy absorption and have been extensively used in vehicle engineering. However, the effects of multi-cell configurations and oblique loads on the crashworthiness performance have been ...
- articleMay 2013
Multiobjective reliability-based optimization for design of a vehicledoor
Finite Elements in Analysis and Design (FEAD), Volume 67Pages 13–21https://doi.org/10.1016/j.finel.2012.11.007Structural optimization for vehicle door signifies one of the key topics of research to continuously improve its performances. However, majority of the studies to date have not considered uncertainties whilst it has been known that a deterministic ...
- ArticleNovember 2009
Improving embedding efficiency via matrix embedding: a case study
Matrix embedding is proved an effective way to improve embedding efficiency of steganography, and usually, higher dimensional matrix will provide better embedding efficiency. However, the sender might suffer huge computational complexity when employing ...
- ArticleSeptember 2009
Employing Optimal Matrix for Efficient Matrix Embedding
IIH-MSP '09: Proceedings of the 2009 Fifth International Conference on Intelligent Information Hiding and Multimedia Signal ProcessingPages 161–165https://doi.org/10.1109/IIH-MSP.2009.59Embedding efficiency is a metric of steganographic security. Higher embedding efficiency is usually desired to get more secure steganography. Matrix embedding is proved an effective way to increase embedding efficiency of steganography, and meanwhile ...
- ArticleJune 2009
Constructing specific matrix for efficient matrix embedding
ICME'09: Proceedings of the 2009 IEEE international conference on Multimedia and ExpoPages 1006–1009In order to enhance the steganographic security, higher embedding efficiency (average number of secret data bits embedded per one embedding change) is desired. Matrix embedding is a well-known steganographic scheme that can improve the embedding ...
- ArticleApril 2009
Detecting LSB matching by characterizing the amplitude of histogram
ICASSP '09: Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal ProcessingPages 1505–1508https://doi.org/10.1109/ICASSP.2009.4959881In this paper, we present an improved method for detecting LSB matching steganography in gray-scale image. Our improvements focus on three aspects: (1) instead of using the amplitude of local extrema of the image's histogram in the previous work, we turn ...