I am a first-year Ph.D. student at the Gaoling School of Artificial Intelligence, Renmin University of China, fortunate to be co-advised by Prof. Zhicheng Dou and Prof. Ji-Rong Wen. Previously, I received M.Eng (2024) and B.Eng (2021) degrees in Information and Communication Engineering from Beijing University of Posts and Telecommunications(BUPT), advised by Prof. Weiran Xu. Before this, I was research intern at Alibaba Qwen Team and Meituan NLP center.
Currently, My research interests focus on
-
Alignment for Large Language Models: Foundation modeling(Qwen2,Qwen2.5), Data Composition(DMT), Instruction Following(AUTOIF, IC-IFD)
-
Large Language Models Reasoning: Mathmatics(RFT, MuggleMath), Coding(DolphCoder, XCoder), Multimodal(AR-MCTS, We-Math), Scientific(CS-Bench), Tool-Integrated(DotaMath)
-
Deep Search Agent: Deep Search & Research (Search-o1, WebThinker), Preference Alignment(DPA-RAG), Instruction Following(VIF-RAG), Reward Modeling (RAG-Critic), Knowledge Alignment (SKP, ChatKBQA), Modular Tookit(FlashRAG), Emotion Recognition(InstructERC)
My long-term goal is to explore an automated, scalable, and safe way that fosters exceptional intelligence to achieve AGI.
Feel free to email me for any form of academic cooperation!