Preprints
Reward Translation via Reward Machine in Semi-Alignable MDPs
Hua, Yun; Li, Wenhao; Jin, Bo; Wang, Baoxiang; He, Xiaofeng; Zha, Hongyuan; Wang, Xiangfeng |
|
Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects
Cheng, Yuheng; Zhang, Ceyao; Zhang, Zhengwen; Meng, Xiangrui; Hong, Sirui; Li, Wenhao; Wang, Zihao; Wang, Zekai; Yin, Feng; Zhao, Junhua; He, Xiuqiang; |
|
Complementary Information Mutual Learning for Multimodality Medical Image Segmentation
Shen, Chuyun; Li, Wenhao; Chen, Haoqing; Wang, Xiaoling; Zhu, Fengping; Li, Yunxin; Wang, Xiangfeng; Jin, Bo; |
|
Can Language Agents Be Alternatives to PPO? A Preliminary Empirical Study On OpenAI Gym
Sheng, Junjie; Huang, Zixiao; Shen, Chuyun; Li, Wenhao; Hua, Yun; Jin, Bo; Zha, Hongyuan; Wang, Xiangfeng; |
|
Negotiated Reasoning: On Provably Addressing Relative Over-Generalization
Sheng, Junjie; Li, Wenhao; Jin, Bo; Zha, Hongyuan; Wang, Jun; Wang, Xiangfeng; |
|
Learning Roles with Emergent Social Value Orientations
Li, Wenhao; Wang, Xiangfeng; Jin, Bo; Lu, Jingyi; Zha, Hongyuan; |
|
Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning
Li, Wenhao; Qiao, Dan; Wang, Baoxiang; Wang, Xiangfeng; Jin, Bo; Zha, Hongyuan; |
|
Publications
Efficient Planning with Latent Diffusion (ICLR 2024)
Li, Wenhao; |
|
Machine Learning-Driven Multi-Agent Pathfinding: An Overview (Operations Research Transactions 2023)
Wang, Xiangfeng; Li, Wenhao; |
|
Learning Structured Communication For Multi-Agent Reinforcement Learning (JAAMAS 2023)
Sheng, Junjie; Wang, Xiangfeng; Jin, Bo; Yan, Junchi; Li, Wenhao; Chang, Tsunghui; Wang, Jun; Zha, Hongyuan; |
|
Information Design in Multi-Agent Reinforcement Learning (NeurIPS 2023)
Lin, Yue; Li, Wenhao; Zha, Hongyuan; Wang, Baoxiang; |
|
Temporally-Extended Prompts Optimization for SAM in Interactive Medical Image Segmentation (BIBM 2023)
Shen, Chuyun; Li, Wenhao; Zhang, Ya; Wang, Xiangfeng; |
|
Hierarchical Diffusion for Offline Decision Making (ICML 2023)
Li, Wenhao; Wang, Xiangfeng; Jin, Bo; Zha, Hongyuan; |
|
Learning Optimal “Pigovian Tax” in Sequential Social Dilemmas (AAMAS 2023)
Hua, Yun; Gao, Shang; Li, Wenhao; Jin, Bo; Wang, Xiangfeng; Zha, Hongyuan; |
|
Model-Based Reinforcement Learning for Auto-Bidding in Display Advertising (AAMAS 2023)
Chen, Shuang; Xu, Qisen; Zhang, Liang; Jin, Yongbo; Li, Wenhao; Mo, Linjian; |
|
Diverse Policy Optimization for Structured Action Space (AAMAS 2023)
Li, Wenhao; Wang, Baoxiang; Yang, Shanchao; Zha, Hongyuan; |
|
F2A2: Flexible fully-decentralized approximate actor-critic for cooperative multi-agent reinforcement learning (JMLR 2023)
Li, Wenhao; Jin, Bo; Wang, Xiangfeng; Yan, Junchi; Zha, Hongyuan; |
|
Interactive Medical Image Segmentation with Self-Adaptive Confidence Calibration (FITEE 2023)
Li, Wenhao; Shen, Chuyun; Xu, Qisen; Hu, Bin; Jin, Bo; Cai, Haibing; Zhu, Fengping; Li, Yuxin; Wang, Xiangfeng; |
|
VMAgent: Scheduling Simulator for Reinforcement Learning (IJCAI 2022)
Sheng, Junjie; Cai, Shengliang; Cui, Haochuan; Li, Wenhao; Hua, Yun; Jin, Bo; Zhou, Wenli; Hu, Yiqiu; Zhu, Lei; Peng, Qian; |
|
Weighted Mean-Field Multi-Agent Reinforcement Learning via Reward Attribution Decomposition (DASFAA 2022)
Wu, Tingyu; Li, Wenhao; Jin, Bo; Zhang, Wei; Wang, Xiangfeng; |
|
Multi-Agent Path Finding with Prioritized Communication Learning (ICRA 2022)
Li, Wenhao; Chen, Hongjun; Jin, Bo; Tan, Wenzhe; Zha, Hongyuan; Wang, Xiangfeng; |
|
Dealing with Non-Stationarity in MARL via Trust Region Decomposition (ICLR 2022)
Li, Wenhao; Wang, Xiangfeng; Jin, Bo; Sheng, Junjie; Zha, Hongyuan; |
|
HMRL: Hyper-Meta Learning for Sparse Reward Reinforcement Learning Problem (KDD 2021)
Hua, Yun; Wang, Xiangfeng; Jin, Bo; Li, Wenhao; Yan, Junchi; He, Xiaofeng; Zha, Hongyuan; |
|
Structured Diversification Emergence via Reinforced Organization Control and Hierarchical Consensus Learning (AAMAS 2021)
Li, Wenhao; Wang, Xiangfeng; Jin, Bo; Sheng, Junjie; Hua, Yun; Zha, Hongyuan; |
|
Structured Cooperative Reinforcement Learning with Time-varying Composite Action Space (TPAMI 2021)
Li, Wenhao; Wang, Xiangfeng; Jin, Bo; Luo, Dijun; Zha, Hongyuan; |
|
Iteratively-Refined Interactive 3d Medical Image Segmentation with Multi-Agent Reinforcement Learning (CVPR 2020)
Liao, Xuan; Li, Wenhao; Xu, Qisen; Wang, Xiangfeng; Jin, Bo; Zhang, Xiaoyun; Wang, Yanfeng; Zhang, Ya; |
|
SparseMAAC: Sparse attention for multi-agent reinforcement learning (DASFAA 2019)
Li, Wenhao; Jin, Bo; Wang, Xiangfeng; |
|
Distributed and parallel ADMM for structured nonconvex optimization problem (IEEE Transactions on Cybernetics 2019)
Wang, Xiangfeng; Yan, Junchi; Jin, Bo; Li, Wenhao; |
|