default search action
Chenjia Bai
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j14]Tong Li, Chenjia Bai, Kang Xu, Chen Chu, Peican Zhu, Zhen Wang:
Skill matters: Dynamic skill learning for multi-agent cooperative reinforcement learning. Neural Networks 181: 106852 (2025) - 2024
- [j13]Chenjia Bai, Lingxiao Wang, Jianye Hao, Zhuoran Yang, Bin Zhao, Zhen Wang, Xuelong Li:
Pessimistic value iteration for multi-task data sharing in Offline Reinforcement Learning. Artif. Intell. 326: 104048 (2024) - [j12]Xudong Yu, Chenjia Bai, Hongyi Guo, Changhong Wang, Zhen Wang:
Diverse randomized value functions: A provably pessimistic approach for offline reinforcement learning. Inf. Sci. 680: 121146 (2024) - [j11]Xiaoyu Wen, Xudong Yu, Rui Yang, Haoyuan Chen, Chenjia Bai, Zhen Wang:
Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness. J. Artif. Intell. Res. 81: 481-509 (2024) - [j10]Zhihong Deng, Zuyue Fu, Lingxiao Wang, Zhuoran Yang, Chenjia Bai, Tianyi Zhou, Zhaoran Wang, Jing Jiang:
False Correlation Reduction for Offline Reinforcement Learning. IEEE Trans. Pattern Anal. Mach. Intell. 46(2): 1199-1211 (2024) - [j9]Jianye Hao, Tianpei Yang, Hongyao Tang, Chenjia Bai, Jinyi Liu, Zhaopeng Meng, Peng Liu, Zhen Wang:
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain. IEEE Trans. Neural Networks Learn. Syst. 35(7): 8762-8782 (2024) - [j8]Chenjia Bai, Ting Xiao, Zhoufan Zhu, Lingxiao Wang, Fan Zhou, Animesh Garg, Bin He, Peng Liu, Zhaoran Wang:
Monotonic Quantile Network for Worst-Case Offline Reinforcement Learning. IEEE Trans. Neural Networks Learn. Syst. 35(7): 8954-8968 (2024) - [c16]Jinyi Liu, Zhi Wang, Yan Zheng, Jianye Hao, Chenjia Bai, Junjie Ye, Zhen Wang, Haiyin Piao, Yang Sun:
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments. AAAI 2024: 13954-13962 - [c15]Shirong Liu, Chenjia Bai, Zixian Guo, Hao Zhang, Gaurav Sharma, Yang Liu:
SelfBC: Self Behavior Cloning for Offline Reinforcement Learning. ECAI 2024: 2306-2313 - [c14]Chenjia Bai, Rushuai Yang, Qiaosheng Zhang, Kang Xu, Yi Chen, Ting Xiao, Xuelong Li:
Constrained Ensemble Exploration for Unsupervised Skill Discovery. ICML 2024 - [c13]Jiafei Lyu, Chenjia Bai, Jingwen Yang, Zongqing Lu, Xiu Li:
Cross-Domain Policy Adaptation by Capturing Representation Mismatch. ICML 2024 - [c12]Xiaoyu Wen, Chenjia Bai, Kang Xu, Xudong Yu, Yang Zhang, Xuelong Li, Zhen Wang:
Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning. ICML 2024 - [c11]Junjie Zhang, Chenjia Bai, Haoran He, Zhigang Wang, Bin Zhao, Xiu Li, Xuelong Li:
SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation. ICML 2024 - [c10]Sirui Zheng, Chenjia Bai, Zhuoran Yang, Zhaoran Wang:
How Does Goal Relabeling Improve Sample Efficiency? ICML 2024 - [c9]Jiyuan Shi, Chenjia Bai, Haoran He, Lei Han, Dong Wang, Bin Zhao, Mingguo Zhao, Xiu Li, Xuelong Li:
Robust Quadrupedal Locomotion via Risk-Averse Policy Learning. ICRA 2024: 11459-11466 - [i33]Haoran He, Chenjia Bai, Ling Pan, Weinan Zhang, Bin Zhao, Xuelong Li:
Large-Scale Actionless Video Pre-Training via Discrete Diffusion for Efficient Policy Learning. CoRR abs/2402.14407 (2024) - [i32]Xudong Yu, Chenjia Bai, Haoran He, Changhong Wang, Xuelong Li:
Regularized Conditional Diffusion Model for Multi-Task Preference Alignment. CoRR abs/2404.04920 (2024) - [i31]Xudong Yu, Chenjia Bai, Hongyi Guo, Changhong Wang, Zhen Wang:
Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning. CoRR abs/2404.06188 (2024) - [i30]Qiaosheng Zhang, Chenjia Bai, Shuyue Hu, Zhen Wang, Xuelong Li:
Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning. CoRR abs/2404.19292 (2024) - [i29]Chenjia Bai, Lingxiao Wang, Jianye Hao, Zhuoran Yang, Bin Zhao, Zhen Wang, Xuelong Li:
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning. CoRR abs/2404.19346 (2024) - [i28]Xiaoyu Wen, Chenjia Bai, Kang Xu, Xudong Yu, Yang Zhang, Xuelong Li, Zhen Wang:
Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning. CoRR abs/2405.06192 (2024) - [i27]Changhong Wang, Xudong Yu, Chenjia Bai, Qiaosheng Zhang, Zhen Wang:
Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning. CoRR abs/2405.07223 (2024) - [i26]Yang Zhang, Shixin Yang, Chenjia Bai, Fei Wu, Xiu Li, Zhen Wang, Xuelong Li:
Towards Efficient LLM Grounding for Embodied Multi-Agent Collaboration. CoRR abs/2405.14314 (2024) - [i25]Jiafei Lyu, Chenjia Bai, Jingwen Yang, Zongqing Lu, Xiu Li:
Cross-Domain Policy Adaptation by Capturing Representation Mismatch. CoRR abs/2405.15369 (2024) - [i24]Chenjia Bai, Rushuai Yang, Qiaosheng Zhang, Kang Xu, Yi Chen, Ting Xiao, Xuelong Li:
Constrained Ensemble Exploration for Unsupervised Skill Discovery. CoRR abs/2405.16030 (2024) - [i23]Junjie Zhang, Chenjia Bai, Haoran He, Wenke Xia, Zhigang Wang, Bin Zhao, Xiu Li, Xuelong Li:
SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation. CoRR abs/2405.19586 (2024) - [i22]Yang Zhang, Chenjia Bai, Bin Zhao, Junchi Yan, Xiu Li, Xuelong Li:
Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World Models. CoRR abs/2406.15836 (2024) - [i21]Shirong Liu, Chenjia Bai, Zixian Guo, Hao Zhang, Gaurav Sharma, Yang Liu:
SelfBC: Self Behavior Cloning for Offline Reinforcement Learning. CoRR abs/2408.02165 (2024) - [i20]Zhao Shan, Chenyou Fan, Shuang Qiu, Jiyuan Shi, Chenjia Bai:
Forward KL Regularized Preference Optimization for Aligning Diffusion Policies. CoRR abs/2409.05622 (2024) - [i19]Chenyou Fan, Chenjia Bai, Zhao Shan, Haoran He, Yang Zhang, Zhen Wang:
Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner. CoRR abs/2409.19949 (2024) - [i18]Xinyi Yuan, Zhiwei Shang, Zifan Wang, Chenkai Wang, Zhao Shan, Zhenchao Qi, Meixin Zhu, Chenjia Bai, Xuelong Li:
Preference Aligned Diffusion Planner for Quadrupedal Locomotion Control. CoRR abs/2410.13586 (2024) - [i17]Jiafei Lyu, Kang Xu, Jiacheng Xu, Mengbei Yan, Jingwen Yang, Zongzhang Zhang, Chenjia Bai, Zongqing Lu, Xiu Li:
ODRL: A Benchmark for Off-Dynamics Reinforcement Learning. CoRR abs/2410.20750 (2024) - 2023
- [j7]Chenjia Bai, Lingxiao Wang, Yixin Wang, Zhaoran Wang, Rui Zhao, Chenyao Bai, Peng Liu:
Addressing Hindsight Bias in Multigoal Reinforcement Learning. IEEE Trans. Cybern. 53(1): 392-405 (2023) - [j6]Chenjia Bai, Peng Liu, Kaiyu Liu, Lingxiao Wang, Yingnan Zhao, Lei Han, Zhaoran Wang:
Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning. IEEE Trans. Neural Networks Learn. Syst. 34(8): 4776-4790 (2023) - [j5]Xudong Yu, Chenjia Bai, Changhong Wang, Dengxiu Yu, C. L. Philip Chen, Zhen Wang:
Self-Supervised Imitation for Offline Reinforcement Learning With Hindsight Relabeling. IEEE Trans. Syst. Man Cybern. Syst. 53(12): 7732-7743 (2023) - [c8]Rushuai Yang, Chenjia Bai, Hongyi Guo, Siyuan Li, Bin Zhao, Zhen Wang, Peng Liu, Xuelong Li:
Behavior Contrastive Learning for Unsupervised Skill Discovery. ICML 2023: 39183-39204 - [c7]Haoran He, Chenjia Bai, Kang Xu, Zhuoran Yang, Weinan Zhang, Dong Wang, Bin Zhao, Xuelong Li:
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning. NeurIPS 2023 - [c6]Kang Xu, Chenjia Bai, Xiaoteng Ma, Dong Wang, Bin Zhao, Zhen Wang, Xuelong Li, Wei Li:
Cross-Domain Policy Adaptation via Value-Guided Data Filtering. NeurIPS 2023 - [i16]Rushuai Yang, Chenjia Bai, Hongyi Guo, Siyuan Li, Bin Zhao, Zhen Wang, Peng Liu, Xuelong Li:
Behavior Contrastive Learning for Unsupervised Skill Discovery. CoRR abs/2305.04477 (2023) - [i15]Kang Xu, Chenjia Bai, Shuang Qiu, Haoran He, Bin Zhao, Zhen Wang, Wei Li, Xuelong Li:
On the Value of Myopic Behavior in Policy Reuse. CoRR abs/2305.17623 (2023) - [i14]Kang Xu, Chenjia Bai, Xiaoteng Ma, Dong Wang, Bin Zhao, Zhen Wang, Xuelong Li, Wei Li:
Cross-Domain Policy Adaptation via Value-Guided Data Filtering. CoRR abs/2305.17625 (2023) - [i13]Haoran He, Chenjia Bai, Kang Xu, Zhuoran Yang, Weinan Zhang, Dong Wang, Bin Zhao, Xuelong Li:
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning. CoRR abs/2305.18459 (2023) - [i12]Haoran He, Chenjia Bai, Hang Lai, Lingxiao Wang, Weinan Zhang:
Privileged Knowledge Distillation for Sim-to-Real Policy Generalization. CoRR abs/2305.18464 (2023) - [i11]Jiyuan Shi, Chenjia Bai, Haoran He, Lei Han, Dong Wang, Bin Zhao, Xiu Li, Xuelong Li:
Robust Quadrupedal Locomotion via Risk-Averse Policy Learning. CoRR abs/2308.09405 (2023) - [i10]Xiaoyu Wen, Xudong Yu, Rui Yang, Chenjia Bai, Zhen Wang:
Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness. CoRR abs/2309.16973 (2023) - [i9]Jinyi Liu, Zhi Wang, Yan Zheng, Jianye Hao, Chenjia Bai, Junjie Ye, Zhen Wang, Haiyin Piao, Yang Sun:
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments. CoRR abs/2312.12145 (2023) - 2022
- [c5]Chenjia Bai, Lingxiao Wang, Zhuoran Yang, Zhi-Hong Deng, Animesh Garg, Peng Liu, Zhaoran Wang:
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning. ICLR 2022 - [c4]Shuang Qiu, Lingxiao Wang, Chenjia Bai, Zhuoran Yang, Zhaoran Wang:
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning. ICML 2022: 18168-18210 - [c3]Rui Yang, Chenjia Bai, Xiaoteng Ma, Zhaoran Wang, Chongjie Zhang, Lei Han:
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing. NeurIPS 2022 - [i8]Chenjia Bai, Lingxiao Wang, Zhuoran Yang, Zhihong Deng, Animesh Garg, Peng Liu, Zhaoran Wang:
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning. CoRR abs/2202.11566 (2022) - [i7]Rui Yang, Chenjia Bai, Xiaoteng Ma, Zhaoran Wang, Chongjie Zhang, Lei Han:
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing. CoRR abs/2206.02829 (2022) - [i6]Shuang Qiu, Lingxiao Wang, Chenjia Bai, Zhuoran Yang, Zhaoran Wang:
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning. CoRR abs/2207.14800 (2022) - 2021
- [c2]Chenjia Bai, Lingxiao Wang, Lei Han, Jianye Hao, Animesh Garg, Peng Liu, Zhaoran Wang:
Principled Exploration via Optimistic Bootstrapping and Backward Induction. ICML 2021: 577-587 - [c1]Chenjia Bai, Lingxiao Wang, Lei Han, Animesh Garg, Jianye Hao, Peng Liu, Zhaoran Wang:
Dynamic Bottleneck for Robust Self-Supervised Exploration. NeurIPS 2021: 17007-17020 - [i5]Chenjia Bai, Lingxiao Wang, Lei Han, Jianye Hao, Animesh Garg, Peng Liu, Zhaoran Wang:
Principled Exploration via Optimistic Bootstrapping and Backward Induction. CoRR abs/2105.06022 (2021) - [i4]Tianpei Yang, Hongyao Tang, Chenjia Bai, Jinyi Liu, Jianye Hao, Zhaopeng Meng, Peng Liu:
Exploration in Deep Reinforcement Learning: A Comprehensive Survey. CoRR abs/2109.06668 (2021) - [i3]Chenjia Bai, Lingxiao Wang, Lei Han, Animesh Garg, Jianye Hao, Peng Liu, Zhaoran Wang:
Dynamic Bottleneck for Robust Self-Supervised Exploration. CoRR abs/2110.10735 (2021) - [i2]Zhihong Deng, Zuyue Fu, Lingxiao Wang, Zhuoran Yang, Chenjia Bai, Zhaoran Wang, Jing Jiang:
SCORE: Spurious COrrelation REduction for Offline Reinforcement Learning. CoRR abs/2110.12468 (2021) - 2020
- [j4]Weiyi Ang, Chenjia Bai, Chao Cai, Yingnan Zhao, Peng Liu:
深度强化学习中稀疏奖励问题研究综述 (Survey on Sparse Reward in Deep Reinforcement Learning). 计算机科学 47(3): 182-191 (2020) - [j3]Yingnan Zhao, Peng Liu, Chenjia Bai, Wei Zhao, Xianglong Tang:
Obtaining accurate estimated action values in categorical distributional reinforcement learning. Knowl. Based Syst. 194: 105511 (2020) - [j2]Peng Liu, Chenjia Bai, Yingnan Zhao, Chenyao Bai, Wei Zhao, Xianglong Tang:
Generating attentive goals for prioritized hindsight reinforcement learning. Knowl. Based Syst. 203: 106140 (2020) - [i1]Chenjia Bai, Peng Liu, Zhaoran Wang, Kaiyu Liu, Lingxiao Wang, Yingnan Zhao:
Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning. CoRR abs/2010.08755 (2020)
2010 – 2019
- 2019
- [j1]Chenjia Bai, Peng Liu, Wei Zhao, Xianglong Tang:
Guided goal generation for hindsight multi-goal reinforcement learning. Neurocomputing 359: 353-367 (2019)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-12 21:59 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint