Welcome to Bin Zhang’s Personal Homepage!
I am a fourth-year Ph.D. student at the Institute of Automation, Chinese Academy of Sciences. My research interests include Reinforcement Learning, Multi-agent System, and Large Language Model (LLM) Agents. I am very fortunate to be advised by Prof. Guoliang Fan (范国梁) of Complex Systems Cognition and Decision Making Lab.
In my doctoral studies, I have been investigating how to enable autonomous agents to learn and make decisions in complex, multi-agent environments. This includes developing new reinforcement learning algorithms that allow agents to cooperate, compete, and adapt in the face of uncertainty and partial information. I have published more than 20 papers at the international AI conferences with total google scholar citations 150+ (You can also use google scholar badge ).
If you are interested with my experience or research. Plese feel free to contact with me! Wechat
🔥 News
- 2024.05: 🎉🎉 One first-author paper is accepted by ICML 2024!
- 2024.04: 🎉🎉 One paper is accepted by IJCAI 2024!
- 2024.03: 🎉🎉 One paper is accepted by IJCNN 2024!
- 2024.02: 🎉🎉 One first-author paper is accepted by ICLR 2024 Workshop on LLM Agents!
- 2024.02: 🎉🎉 One paper is accepted by ICLR 2024 Workshop on LLM Agents!
- 2023.12: 🎉🎉 Two papers are accepted by AAMAS 2024!
- 2023.12: 🎉🎉 One paper is accepted by ICASSP 2024!
- 2023.09: 🎉🎉 One paper is accepted by NeurIPS 2023!
- 2023.04: 🎉🎉 One first-author paper is accepted by IJCAI 2023!
📖 Educations
- 2020.09 - 2025.06 (expected), Ph.D in Institute of Automation, Chinese Academy of Sciences
- 2016.09 - 2020.06, B.S. in School of Control Science and Engineering, Shandong University
💬 Research Interest
- Reinforcement Learning
- Multi-agent Coordination
- LLM agents (Tool Learning, Text-to-SQL, Text-based Game)
📝 Publications
Tptu: Task planning and tool usage of large language model-based ai agents
Jingqing Ruan, Yihong Chen, Bin Zhang, Zhiwei Xu, Tianpeng Bao, Guoqing Du, Shiwei Shi, Hangyu Mao, Ziyue Li, Xingyu Zeng, Rui Zhao
- We propose a structured framework tailored for LLM-based AI Agents and discuss the crucial capabilities necessary for tackling intricate problems.
Benchmarking the text-to-sql capability of large language models: A comprehensive evaluation
Bin Zhang, Yuxiao Ye, Guoqing Du, Xiaoru Hu, Zhishuai Li, Sun Yang, Chi Harold Liu, Rui Zhao, Ziyue Li, Hangyu Mao
- We evaluate LLMs through five Text-to-SQL related tasks, reveal performance differences and suggest task-specific optimization strategies.
ICML 2024
Stackelberg Decision Transformer for Asynchronous Action Coordination in Multi-Agent Systems, Bin Zhang, Hangyu Mao, Lijuan Li, Zhiwei Xu, Dapeng Li, Rui Zhao, and Guoliang FanIJCAI 2024
PTDE: Personalized Training with Distilled Execution for Multi-Agent Reinforcement Learning, Yiqun Chen, Hangyu Mao, Jiaxin Mao, Shiguang Wu, Tianle Zhang, Bin Zhang, Wei Yang, Hongxing ChangICLR 2024 Workshop on LLM Agents
Controlling Large Language Model-based Agents for Large-Scale Decision-Making: An Actor-Critic Approach, Bin Zhang, Hangyu Mao, Jingqing Ruan, et al.ICLR 2024 Workshop on LLM Agents
Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Systems, Yilun Kong, Jingqing Ruan, Yihong Chen, Bin Zhang, et al.IJCNN 2024
SGCD: Subgroup Contribution Decomposition for Multi-Agent Reinforcement Learning, Hao Chen, Bin Zhang, Guoliang Fan.AAMAS 2024
PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning, Hangyu Mao, Rui Zhao, Ziyue Li, Zhiwei Xu, Hao Chen, Yiqun Chen, Bin Zhang, et al.AAMAS Extended Abstract 2024
From Explicit Communication to Tacit Cooperation:A Novel Paradigm for Cooperative MARL, Dapeng Li, Zhiwei Xu, Bin Zhang, and Guoliang Fan.ICASSP 2024
Adaptive Parameter Sharing for Multi-Agent Reinforcement Learning, Dapeng Li, Na Lou, Bin Zhang, Zhiwei Xu, Guoliang FanNeurIPS 2023
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL, Zhiwei Xu, Bin Zhang, Dapeng Li, Guangchong Zhou, Zeren Zhang, Guoliang FanIJCAI 2023
Inducing Stackelberg Equilibrium through Spatio-Temporal Sequential Decision-Making in Multi-Agent Reinforcement Learning, Bin Zhang, Lijuan Li, Zhiwei Xu, Dapeng Li, Guoliang FanIJCNN 2023
SEA: A Spatially Explicit Architecture for Multi-Agent Reinforcement Learning, Dapeng Li, Zhiwei Xu, Bin Zhang, Guoliang FanAAAI 2023
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning, Zhiwei Xu, Bin Zhang, Dapeng Li, Zeren Zhang, Guangchong Zhou, Hao Chen, Guoliang FanAAAI 2023
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism, Zhiwei Xu, Yunpeng Bai, Bin Zhang, Dapeng Li, and Guoliang FanNeurIPS 2022
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning, Zhiwei Xu, Dapeng Li, Bin Zhang, Yuan Zhan, Yunpeng Bai, and Guoliang FanICONIP 2022
Multi-Agent Hyper-Attention Policy Optimization, Bin Zhang, Zhiwei Xu, Yiqun Chen, Dapeng Li, Yunpeng Bai, Guoliang Fan, and Lijuan LiICONIP 2022
Efficient Policy Generation in Multi-Agent Systems via Hypergraph Neural Network, Bin Zhang, Yunpeng Bai, Zhiwei Xu, Dapeng Li, and Guoliang FanIJCNN 2022
Cooperative Multi-agent Reinforcement Learning with Hypergraph Convolution, Yunpeng Bai, Chen Gong, Bin Zhang, Guoliang Fan, Xinwen Hou, Yu LiuAAMAS 2022
SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning, Zhiwei Xu, Yunpeng Bai, Dapeng Li, Bin Zhang, and Guoliang FanICONIP 2022
Learning to Coordinate via Multiple Graph Neural Networks, Zhiwei Xu, Bin Zhang, Yunpeng Bai, Dapeng Li, and Guoliang Fan
🧑🎨 Preprint
arXiv:2403.02951
Benchmarking the text-to-sql capability of large language models: A comprehensive evaluation, Bin Zhang, Yuxiao Ye, Guoqing Du, Xiaoru Hu, Zhishuai Li, Sun Yang, Chi Harold Liu, Rui Zhao, Ziyue Li, Hangyu MaoarXiv:2403.09732
PET-SQL: A Prompt-enhanced Two-stage Text-to-SQL Framework with Cross-consistency, Zhishuai Li, Xiang Wang, Jingjing Zhao, Sun Yang, Guoqing Du, Xiaoru Hu, Bin Zhang, Yuxiao Ye, Ziyue Li, Rui Zhao, Hangyu Mao- Constructing Informative Subtask Representations for Multi-Agent Coordination, Guangchong Zhou, Zhiwei Xu, Bin Zhang, Dapeng Li, Zeren Zhang, Guoliang Fan
🎖 Honors and Awards
- 2023, Future Star Award, SenseTime Research (Intern Top 1)
- 2023, Merit Student, University of Chinese Academy of Sciences
- 2022, Climbing Scholarship, University of Chinese Academy of Sciences
- 2020, Outstanding Graduates, Shandong Province
- 2020, Weichai Power Scholarship, Shandong University
- 2017-2019, National Scholarship, Ministry of Education (2 times)
- 2017-2020, First-class Scholarship, Shandong University (3 times)
💻 Internships
- 2023.07 - 2024.04, SenseTime Research, China.