WENHAO LI'S PERSONAL WEBSITE

Wenhao Li

Post Doctoral Fellow
Institute of Robotics and Intelligent Manufacturing,
the Chinese University of Hong Kong, Shenzhen.

Former Researcher at Tencent AI Lab.
Ph.D. from East China Normal University,
advised by Prof. Aimin Zhou and Prof. Hongyuan Zha,
co-advised by A.P. Bo Jin and A.P. Xiangfeng Wang.

My primary research focus encompasses the resolution of intricate multi-agent collaboration dilemmas within diverse real-world decision-making tasks using multi-agent reinforcement learning (MARL) algorithms. My published works chiefly comprise two facets: methodologies and practical applications:

Concerning methodologies, I strive to devise MARL algorithms exuding superior robustness, scalability, and transferability. Additionally, I endeavor to incorporate generative models (GFlowNet, diffusion models, etc.) within MARL, aiming to augment the expressive capacity of neural network policies and bolster algorithmic competence in tackling high-dimensional problems.
In the realm of applications, I am dedicated to remodeling a variety of real-world issues from the perspective of stochastic games, encompassing image segmentation, multi-agent path-finding, and precision agriculture, utilizing MARL algorithms for resolutions. Concurrently, my interests also lie in the fusion of MARL with computational social science, formulating and solving quintessential social dilemmas in social sciences through a stochastic game lens.

High sample complexity and weak transferability capabilities impede the existing MARL implementations, akin to surmounting "two formidable mountains." The advent of large pre-trained language models (LLMs), embedded with human knowledge and exhibiting potent zero-shot generalization capability on novel tasks, illuminates a feasible trajectory for obliterating these obstacles. Consequently, my recent scholarly pursuits have centered around decision-making with large pre-trained models, with submitted manuscripts chiefly outlining the following technical approaches:

LLM + MARL (AI Agents): Through the integration of the vast human knowledge encapsulated within LLMs into the MARL paradigm, the colossal exploration space hitherto demanded by end-to-end learning can be significantly mitigated, thereby substantially enhancing the sample efficiency of algorithms.
In-context MARL: Evidence suggests that the formidable zero-shot generalization capability of LLMs partly stems from their in-context learning capabilities. By endowing the reinforcement learning paradigm with in-context learning faculties, I seek to bolster decision-making proficiencies across various specialized, cooperative tasks in diverse domains.

Curriculum vitae (EN)

Curriculum Vitae (CN)

2024
News: 1 paper accepted at IJCAI 2024, in close collaboration with Han Wang!
News: 1 tutorial accepted at AAMAS 2024, in close collaboration with Prof. Xiangfeng Wang, Dr. Junjie Sheng and Dr. Yun Hua!
News: 1 (single-author) paper accepted at ICLR 2024!

2023
News: 1 paper accepted at NeurIPS 2023, in close collaboration with Yue Lin!
News: 1 paper accepted at Journal of Machine Learning Research (long paper, 75 pages)!
News: 1 paper accepted at ICML 2023, 3 papers accepted at AAMAS 2023!

2022
News: I am awarded the fellowship of China Postdoctoral Science Foundation!
News: Congratulations to my co-advisor, A.P. Xiangfeng Wang, for winning the IEEE Signal Processing Society 2021 Best Paper Award for his paper "Multi-Agent Distributed Optimization via Inexact Consensus ADMM"!
News: 1 paper accepted at IJCAI 2022 & 1 paper accepted at DASFAA 2022!
News: 1 paper accepted at ICRA 2022 & 1 paper accepted at ICLR 2022!

Received Bachelor of Engineering degree in 2016 from Base Class of Computer Science, School of Information Science and Engineering, Lanzhou University, and received Master of Engineering degree in 2019 from Institute of Computer Science and Software Engineering, East China Normal University.