More Web Proxy on the site http://driver.im/

default search action

combined dblp search
author search
venue search
publication search

ask others

Chenjia Bai

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j14]
- view
  authority control:
- export record
  dblp key:
  - journals/nn/LiBXCZW25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nn/LiBXCZW25
Tong Li, Chenjia Bai, Kang Xu, Chen Chu, Peican Zhu, Zhen Wang:
Skill matters: Dynamic skill learning for multi-agent cooperative reinforcement learning. Neural Networks 181: 106852 (2025)
2024
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/ai/BaiWHYZWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ai/BaiWHYZWL24
Chenjia Bai, Lingxiao Wang, Jianye Hao, Zhuoran Yang, Bin Zhao, Zhen Wang, Xuelong Li:
Pessimistic value iteration for multi-task data sharing in Offline Reinforcement Learning. Artif. Intell. 326: 104048 (2024)
[j12]
- view
  authority control:
- export record
  dblp key:
  - journals/isci/YuBGWW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/isci/YuBGWW24
Xudong Yu, Chenjia Bai, Hongyi Guo, Changhong Wang, Zhen Wang:
Diverse randomized value functions: A provably pessimistic approach for offline reinforcement learning. Inf. Sci. 680: 121146 (2024)
[j11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/jair/WenYYCBW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jair/WenYYCBW24
Xiaoyu Wen, Xudong Yu, Rui Yang, Haoyuan Chen, Chenjia Bai, Zhen Wang:
Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness. J. Artif. Intell. Res. 81: 481-509 (2024)
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/DengFWYBZWJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/DengFWYBZWJ24
Zhihong Deng, Zuyue Fu, Lingxiao Wang, Zhuoran Yang, Chenjia Bai, Tianyi Zhou, Zhaoran Wang, Jing Jiang:
False Correlation Reduction for Offline Reinforcement Learning. IEEE Trans. Pattern Anal. Mach. Intell. 46(2): 1199-1211 (2024)
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/HaoYTBLMLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/HaoYTBLMLW24
Jianye Hao, Tianpei Yang, Hongyao Tang, Chenjia Bai, Jinyi Liu, Zhaopeng Meng, Peng Liu, Zhen Wang:
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain. IEEE Trans. Neural Networks Learn. Syst. 35(7): 8762-8782 (2024)
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/BaiXZWZGHLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/BaiXZWZGHLW24
Chenjia Bai, Ting Xiao, Zhoufan Zhu, Lingxiao Wang, Fan Zhou, Animesh Garg, Bin He, Peng Liu, Zhaoran Wang:
Monotonic Quantile Network for Worst-Case Offline Reinforcement Learning. IEEE Trans. Neural Networks Learn. Syst. 35(7): 8954-8968 (2024)
[c16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/0002WZHBYWPS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/0002WZHBYWPS24
Jinyi Liu, Zhi Wang, Yan Zheng, Jianye Hao, Chenjia Bai, Junjie Ye, Zhen Wang, Haiyin Piao, Yang Sun:
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments. AAAI 2024: 13954-13962
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/ecai/LiuBGZ0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ecai/LiuBGZ0024
Shirong Liu, Chenjia Bai, Zixian Guo, Hao Zhang, Gaurav Sharma, Yang Liu:
SelfBC: Self Behavior Cloning for Offline Reinforcement Learning. ECAI 2024: 2306-2313
[c14]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/BaiYZXCX024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/BaiYZXCX024
Chenjia Bai, Rushuai Yang, Qiaosheng Zhang, Kang Xu, Yi Chen, Ting Xiao, Xuelong Li:
Constrained Ensemble Exploration for Unsupervised Skill Discovery. ICML 2024
[c13]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/LyuBYL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LyuBYL024
Jiafei Lyu, Chenjia Bai, Jingwen Yang, Zongqing Lu, Xiu Li:
Cross-Domain Policy Adaptation by Capturing Representation Mismatch. ICML 2024
[c12]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/WenBXYZ0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WenBXYZ0024
Xiaoyu Wen, Chenjia Bai, Kang Xu, Xudong Yu, Yang Zhang, Xuelong Li, Zhen Wang:
Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning. ICML 2024
[c11]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ZhangBH000024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZhangBH000024
Junjie Zhang, Chenjia Bai, Haoran He, Zhigang Wang, Bin Zhao, Xiu Li, Xuelong Li:
SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation. ICML 2024
[c10]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ZhengBY024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZhengBY024
Sirui Zheng, Chenjia Bai, Zhuoran Yang, Zhaoran Wang:
How Does Goal Relabeling Improve Sample Efficiency? ICML 2024
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/ShiBH000Z0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/ShiBH000Z0024
Jiyuan Shi, Chenjia Bai, Haoran He, Lei Han, Dong Wang, Bin Zhao, Mingguo Zhao, Xiu Li, Xuelong Li:
Robust Quadrupedal Locomotion via Risk-Averse Policy Learning. ICRA 2024: 11459-11466
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-14407
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-14407
Haoran He, Chenjia Bai, Ling Pan, Weinan Zhang, Bin Zhao, Xuelong Li:
Large-Scale Actionless Video Pre-Training via Discrete Diffusion for Efficient Policy Learning. CoRR abs/2402.14407 (2024)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-04920
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-04920
Xudong Yu, Chenjia Bai, Haoran He, Changhong Wang, Xuelong Li:
Regularized Conditional Diffusion Model for Multi-Task Preference Alignment. CoRR abs/2404.04920 (2024)
[i31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-06188
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-06188
Xudong Yu, Chenjia Bai, Hongyi Guo, Changhong Wang, Zhen Wang:
Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning. CoRR abs/2404.06188 (2024)
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-19292
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-19292
Qiaosheng Zhang, Chenjia Bai, Shuyue Hu, Zhen Wang, Xuelong Li:
Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning. CoRR abs/2404.19292 (2024)
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-19346
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-19346
Chenjia Bai, Lingxiao Wang, Jianye Hao, Zhuoran Yang, Bin Zhao, Zhen Wang, Xuelong Li:
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning. CoRR abs/2404.19346 (2024)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-06192
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-06192
Xiaoyu Wen, Chenjia Bai, Kang Xu, Xudong Yu, Yang Zhang, Xuelong Li, Zhen Wang:
Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning. CoRR abs/2405.06192 (2024)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-07223
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-07223
Changhong Wang, Xudong Yu, Chenjia Bai, Qiaosheng Zhang, Zhen Wang:
Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning. CoRR abs/2405.07223 (2024)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-14314
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-14314
Yang Zhang, Shixin Yang, Chenjia Bai, Fei Wu, Xiu Li, Zhen Wang, Xuelong Li:
Towards Efficient LLM Grounding for Embodied Multi-Agent Collaboration. CoRR abs/2405.14314 (2024)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-15369
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-15369
Jiafei Lyu, Chenjia Bai, Jingwen Yang, Zongqing Lu, Xiu Li:
Cross-Domain Policy Adaptation by Capturing Representation Mismatch. CoRR abs/2405.15369 (2024)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-16030
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-16030
Chenjia Bai, Rushuai Yang, Qiaosheng Zhang, Kang Xu, Yi Chen, Ting Xiao, Xuelong Li:
Constrained Ensemble Exploration for Unsupervised Skill Discovery. CoRR abs/2405.16030 (2024)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-19586
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-19586
Junjie Zhang, Chenjia Bai, Haoran He, Wenke Xia, Zhigang Wang, Bin Zhao, Xiu Li, Xuelong Li:
SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation. CoRR abs/2405.19586 (2024)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-15836
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-15836
Yang Zhang, Chenjia Bai, Bin Zhao, Junchi Yan, Xiu Li, Xuelong Li:
Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World Models. CoRR abs/2406.15836 (2024)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-02165
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-02165
Shirong Liu, Chenjia Bai, Zixian Guo, Hao Zhang, Gaurav Sharma, Yang Liu:
SelfBC: Self Behavior Cloning for Offline Reinforcement Learning. CoRR abs/2408.02165 (2024)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-05622
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-05622
Zhao Shan, Chenyou Fan, Shuang Qiu, Jiyuan Shi, Chenjia Bai:
Forward KL Regularized Preference Optimization for Aligning Diffusion Policies. CoRR abs/2409.05622 (2024)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-19949
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-19949
Chenyou Fan, Chenjia Bai, Zhao Shan, Haoran He, Yang Zhang, Zhen Wang:
Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner. CoRR abs/2409.19949 (2024)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-13586
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-13586
Xinyi Yuan, Zhiwei Shang, Zifan Wang, Chenkai Wang, Zhao Shan, Zhenchao Qi, Meixin Zhu, Chenjia Bai, Xuelong Li:
Preference Aligned Diffusion Planner for Quadrupedal Locomotion Control. CoRR abs/2410.13586 (2024)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-20750
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-20750
Jiafei Lyu, Kang Xu, Jiacheng Xu, Mengbei Yan, Jingwen Yang, Zongzhang Zhang, Chenjia Bai, Zongqing Lu, Xiu Li:
ODRL: A Benchmark for Off-Dynamics Reinforcement Learning. CoRR abs/2410.20750 (2024)
2023
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/tcyb/BaiWWWZBL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcyb/BaiWWWZBL23
Chenjia Bai, Lingxiao Wang, Yixin Wang, Zhaoran Wang, Rui Zhao, Chenyao Bai, Peng Liu:
Addressing Hindsight Bias in Multigoal Reinforcement Learning. IEEE Trans. Cybern. 53(1): 392-405 (2023)
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/BaiLLWZHW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/BaiLLWZHW23
Chenjia Bai, Peng Liu, Kaiyu Liu, Lingxiao Wang, Yingnan Zhao, Lei Han, Zhaoran Wang:
Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning. IEEE Trans. Neural Networks Learn. Syst. 34(8): 4776-4790 (2023)
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/tsmc/YuBWYCW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tsmc/YuBWYCW23
Xudong Yu, Chenjia Bai, Changhong Wang, Dengxiu Yu, C. L. Philip Chen, Zhen Wang:
Self-Supervised Imitation for Offline Reinforcement Learning With Hindsight Relabeling. IEEE Trans. Syst. Man Cybern. Syst. 53(12): 7732-7743 (2023)
[c8]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/YangBGLZW0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/YangBGLZW0023
Rushuai Yang, Chenjia Bai, Hongyi Guo, Siyuan Li, Bin Zhao, Zhen Wang, Peng Liu, Xuelong Li:
Behavior Contrastive Learning for Unsupervised Skill Discovery. ICML 2023: 39183-39204
[c7]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/HeBXY0WZ023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/HeBXY0WZ023
Haoran He, Chenjia Bai, Kang Xu, Zhuoran Yang, Weinan Zhang, Dong Wang, Bin Zhao, Xuelong Li:
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning. NeurIPS 2023
[c6]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/XuBMWZW0L23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/XuBMWZW0L23
Kang Xu, Chenjia Bai, Xiaoteng Ma, Dong Wang, Bin Zhao, Zhen Wang, Xuelong Li, Wei Li:
Cross-Domain Policy Adaptation via Value-Guided Data Filtering. NeurIPS 2023
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-04477
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-04477
Rushuai Yang, Chenjia Bai, Hongyi Guo, Siyuan Li, Bin Zhao, Zhen Wang, Peng Liu, Xuelong Li:
Behavior Contrastive Learning for Unsupervised Skill Discovery. CoRR abs/2305.04477 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-17623
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-17623
Kang Xu, Chenjia Bai, Shuang Qiu, Haoran He, Bin Zhao, Zhen Wang, Wei Li, Xuelong Li:
On the Value of Myopic Behavior in Policy Reuse. CoRR abs/2305.17623 (2023)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-17625
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-17625
Kang Xu, Chenjia Bai, Xiaoteng Ma, Dong Wang, Bin Zhao, Zhen Wang, Xuelong Li, Wei Li:
Cross-Domain Policy Adaptation via Value-Guided Data Filtering. CoRR abs/2305.17625 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-18459
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-18459
Haoran He, Chenjia Bai, Kang Xu, Zhuoran Yang, Weinan Zhang, Dong Wang, Bin Zhao, Xuelong Li:
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning. CoRR abs/2305.18459 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-18464
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-18464
Haoran He, Chenjia Bai, Hang Lai, Lingxiao Wang, Weinan Zhang:
Privileged Knowledge Distillation for Sim-to-Real Policy Generalization. CoRR abs/2305.18464 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-09405
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-09405
Jiyuan Shi, Chenjia Bai, Haoran He, Lei Han, Dong Wang, Bin Zhao, Xiu Li, Xuelong Li:
Robust Quadrupedal Locomotion via Risk-Averse Policy Learning. CoRR abs/2308.09405 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-16973
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-16973
Xiaoyu Wen, Xudong Yu, Rui Yang, Chenjia Bai, Zhen Wang:
Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness. CoRR abs/2309.16973 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-12145
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-12145
Jinyi Liu, Zhi Wang, Yan Zheng, Jianye Hao, Chenjia Bai, Junjie Ye, Zhen Wang, Haiyin Piao, Yang Sun:
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments. CoRR abs/2312.12145 (2023)
2022
[c5]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/Bai0YDG0W22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Bai0YDG0W22
Chenjia Bai, Lingxiao Wang, Zhuoran Yang, Zhi-Hong Deng, Animesh Garg, Peng Liu, Zhaoran Wang:
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning. ICLR 2022
[c4]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/QiuWBYW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/QiuWBYW22
Shuang Qiu, Lingxiao Wang, Chenjia Bai, Zhuoran Yang, Zhaoran Wang:
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning. ICML 2022: 18168-18210
[c3]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/YangBMWZH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YangBMWZH22
Rui Yang, Chenjia Bai, Xiaoteng Ma, Zhaoran Wang, Chongjie Zhang, Lei Han:
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing. NeurIPS 2022
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-11566
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-11566
Chenjia Bai, Lingxiao Wang, Zhuoran Yang, Zhihong Deng, Animesh Garg, Peng Liu, Zhaoran Wang:
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning. CoRR abs/2202.11566 (2022)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-02829
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-02829
Rui Yang, Chenjia Bai, Xiaoteng Ma, Zhaoran Wang, Chongjie Zhang, Lei Han:
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing. CoRR abs/2206.02829 (2022)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-14800
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-14800
Shuang Qiu, Lingxiao Wang, Chenjia Bai, Zhuoran Yang, Zhaoran Wang:
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning. CoRR abs/2207.14800 (2022)
2021
[c2]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/BaiWHHG0W21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/BaiWHHG0W21
Chenjia Bai, Lingxiao Wang, Lei Han, Jianye Hao, Animesh Garg, Peng Liu, Zhaoran Wang:
Principled Exploration via Optimistic Bootstrapping and Backward Induction. ICML 2021: 577-587
[c1]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/BaiWHGHLW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/BaiWHGHLW21
Chenjia Bai, Lingxiao Wang, Lei Han, Animesh Garg, Jianye Hao, Peng Liu, Zhaoran Wang:
Dynamic Bottleneck for Robust Self-Supervised Exploration. NeurIPS 2021: 17007-17020
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-06022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-06022
Chenjia Bai, Lingxiao Wang, Lei Han, Jianye Hao, Animesh Garg, Peng Liu, Zhaoran Wang:
Principled Exploration via Optimistic Bootstrapping and Backward Induction. CoRR abs/2105.06022 (2021)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-06668
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-06668
Tianpei Yang, Hongyao Tang, Chenjia Bai, Jinyi Liu, Jianye Hao, Zhaopeng Meng, Peng Liu:
Exploration in Deep Reinforcement Learning: A Comprehensive Survey. CoRR abs/2109.06668 (2021)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-10735
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-10735
Chenjia Bai, Lingxiao Wang, Lei Han, Animesh Garg, Jianye Hao, Peng Liu, Zhaoran Wang:
Dynamic Bottleneck for Robust Self-Supervised Exploration. CoRR abs/2110.10735 (2021)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-12468
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-12468
Zhihong Deng, Zuyue Fu, Lingxiao Wang, Zhuoran Yang, Chenjia Bai, Zhaoran Wang, Jing Jiang:
SCORE: Spurious COrrelation REduction for Offline Reinforcement Learning. CoRR abs/2110.12468 (2021)
2020
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/jsjkx/AngBCZ020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jsjkx/AngBCZ020
Weiyi Ang, Chenjia Bai, Chao Cai, Yingnan Zhao, Peng Liu:
深度强化学习中稀疏奖励问题研究综述 (Survey on Sparse Reward in Deep Reinforcement Learning). 计算机科学 47(3): 182-191 (2020)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/kbs/ZhaoLBZT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/kbs/ZhaoLBZT20
Yingnan Zhao, Peng Liu, Chenjia Bai, Wei Zhao, Xianglong Tang:
Obtaining accurate estimated action values in categorical distributional reinforcement learning. Knowl. Based Syst. 194: 105511 (2020)
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/kbs/LiuBZBZT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/kbs/LiuBZBZT20
Peng Liu, Chenjia Bai, Yingnan Zhao, Chenyao Bai, Wei Zhao, Xianglong Tang:
Generating attentive goals for prioritized hindsight reinforcement learning. Knowl. Based Syst. 203: 106140 (2020)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-08755
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-08755
Chenjia Bai, Peng Liu, Zhaoran Wang, Kaiyu Liu, Lingxiao Wang, Yingnan Zhao:
Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning. CoRR abs/2010.08755 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/ijon/BaiLZT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijon/BaiLZT19
Chenjia Bai, Peng Liu, Wei Zhao, Xianglong Tang:
Guided goal generation for hindsight multi-goal reinforcement learning. Neurocomputing 359: 353-367 (2019)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.