X. Wang
profile photo

Xiangfeng Wang

I am a professor at East China Normal University and Shenzhen Loop Area Institute. Before that, I received my Bachelor degree and PhD degree from Nanjing University in 2009 and 2014 respectively under the supervision of Professor Bingsheng He. During my PhD study, I was supported by the National Scholarship Council for joint PhD at the University of Minnesota under the supervision of Professor Zhi-Quan Luo.

Email  /  Google Scholar

Research Areas

  • Agent (Optimizaiton, RL, MARL, LLM-based) and Applications

  • AI for Math/Optimization/Game Theory
  • Awards

  • 2024, Shanghai Open Source Innovation Excellence Award

  • 2024, CSIAM Applied Mathematics Implementation Award

  • 2023, Technology Cooperation Annual Outstanding Partner Award (Huawei Cloud)

  • 2022, Nomination Award for Youth Science and Technology Award of ORSC

  • Publications

    Journal Papers:

    1. Learning Roles with Emergent Social Value Orientations
      Wenhao Li, Xiangfeng Wang, Bo Jin, Jingyi Lu, and Hongyuan Zha
      IEEE Transactions on Pattern Analysis and Machine Intelligence 2025 | paper

    2. Machine learning-driven multi-agent pathfinding: An overview《机器学习驱动的多智能体路径搜寻算法综述》
      Xiangfeng Wang, and Wenhao Li
      Operations Research Transactions 运筹学学报 2023 | paper

    3. F2A2: Flexible Fully-decentralized Approximate Actor-critic for Cooperative Multi-agent Reinforcement Learning
      Wenhao Li, Bo Jin, Xiangfeng Wang, Junchi Yan, and Hongyuan Zha
      Journal of Machine Learning Research 2023 | paper

    4. Multi Agent Deep Reinforcement Learning-based Urban Traffic Signal Management《面向城市交通信号优化的多智能体强化学习综述》
      Yun Hua, Xiangfeng Wang, and Bo Jin
      Operations Research Transactions 运筹学学报 2022 | paper

    5. Learning Structured Communication for Multi-agent Reinforcement Learning
      Junjie Sheng, Xiangfeng Wang, Bo Jin, Wenhao Li, Junchi Yan, Tsung-Hui Chang, Jun Wang, and Hongyuan Zha
      Autonomous Agents and Multi-Agent Systems 2022 | paper

    6. Structured Cooperative Reinforcement Learning with Time-varying Composite Action Space
      Wenhao Li, Xiangfeng Wang, Bo Jin, Dijun Luo, and Hongyuan Zha
      IEEE Transactions on Pattern Analysis and Machine Intelligence 2022 | paper

    7. Perturbation Techniques for Convergence Analysis of Proximal Gradient Method and Other First-order Algorithms via Variational Analysis
      Xiangfeng Wang, Jane J. Ye, Xiaoming Yuan, Shangzhi Zeng, and Jin Zhang
      Set-Valued and Variational Analysis 2022 | paper

    8. Learning to Schedule Multi-NUMA Virtual Machines via Reinforcement Learning
      Junjie Sheng, Yiqiu Hu, Wenli Zhou, Lei Zhu, Bo Jin, Jun Wang, and Xiangfeng Wang
      Pattern Recognition 2022 | paper

    9. The O(1/n) Worst-case Convergence Rate of ADMM with Variable Penalty Parameters 《变惩罚参数交替方向法的O(1/n)迭代复杂度分析》
      Xingju Cai, Xiangfeng Wang, and Wenxing Zhang
      Numerical Mathematics A Journal of Chinese Universities 高等学校计算数学学报 2021 | Commemorate the 100th anniversary of the birth of Professor Xuchu He

    10. Distributed and Parallel ADMM for Structured Nonconvex Optimization Problem
      Xiangfeng Wang, Junchi Yan, Bo Jin, and wenhao Li
      IEEE Transactions on Cybernetics 2021 | paper

    11. The Distance Between Convex Sets with Minkowski Sum Structure: Application to Collision Detection
      Xiangfeng Wang, Junping Zhang, and Wenxing Zhang
      Computational Optimization and Applications 2020 | paper

    12. A Block Successive Upper Bound Minimization Method of Multipliers for Linearly Constrained Convex Optimization
      Mingyi Hong, Tsung-Hui Chang, Xiangfeng Wang, Meisam Razaviyayn, Shiqian Ma, and Zhi-Quan Luo
      Mathematics of Operations Research 2020 | paper

    13. Joint Active Learning with Feature Selection via CUR Matrix Decomposition
      Changsheng Li, Xiangfeng Wang, Weishan Dong, Junchi Yan, Qingshan Liu, and Hongyuan Zha
      IEEE Transactions on Pattern Analysis and Machine Intelligence 2019 | paper

    14. Dynamic Structure Embedded Online Multiple-Output Regression for Streaming Data
      Changsheng Li, Fan Wei, Weishan Dong, Xiangfeng Wang, Qingshan Liu, and Xin Zhang
      IEEE Transactions on Pattern Analysis and Machine Intelligence 2019 | paper

    15. Implementing the ADMM to Big Datasets: A Case Study of LASSO
      Hangrui Yue, Qingzhi Yang, Xiangfeng Wang, and Xiaoming Yuan
      SIAM Journal on Scientific Computing 2018 | paper

    16. On the Flexibility of Block Coordinate Descent for Large-Scale Optimization
      Xiangfeng Wang, Wenjie Zhang, Junchi Yan, Xiaoming Yuan, and Hongyuan Zha
      Neurocomputing 2018 | paper

    17. Iteration Complexity Analysis of Block Coordinate Descent Methods
      Mingyi Hong, Xiangfeng Wang, Meisam Razaviyayn, and Zhi-Quan Luo
      Mathematical Programming 2017 | paper

    18. Asynchronous Distributed ADMM for Large-Scale Optimization-Part I: Algorithm and Convergence Analysis
      Tsung-Hui Chang, Mingyi Hong, Wei-Cheng Liao, and Xiangfeng Wang
      IEEE Transactions on Signal Processing 2016 | paper

    19. Asynchronous Distributed ADMM for Large-Scale Optimization-Part II: Linear Convergence Analysis and Numerical Performances
      Tsung-Hui Chang, Wei-Cheng Liao, Mingyi Hong, and Xiangfeng Wang
      IEEE Transactions on Signal Processing 2016 | paper

    20. On the Convergence Rate of a Class of Proximal-Based Decomposition Methods for Monotone Variational Inequalities
      Xiangfeng Wang
      Journal of the Operations Research Society of China 2015 | paper

    21. Solving Multiple-Block Separable Convex Minimization Problems using Two-Block ADMM
      Xiangfeng Wang, Mingyi Hong, Shiqian Ma, and Zhi-Quan Luo
      Pacific Journal of Optimization 2015 | paper

    22. Multi-Agent Distributed Large-Scale Optimization by Inexact Consensus ADMM
      Tsung-Hui Chang, Mingyi Hong, and Xiangfeng Wang
      IEEE Transactions on Signal Processing 2015 | paper | IEEE Signal Processing Society Best Paper Award

    23. The Linearized Alternating Direction Method of Multipliers for Dantzig Selector
      Xiangfeng Wang, and Xiaoming Yuan
      SIAM Journal of Scientific Computing 2012 | paper

    Conference Papers:

    1. Shapley-Coop: Credit Assignment for Emergent Cooperation in Self-Interested LLM Agents
      Yun Hua, Haosheng Chen, Shiqin Wang, Wenhao Li, Xiangfeng Wang, and Jun Luo
      NeurIPS 2025 | paper

    2. LOPT: Learning Optimal Pigovian Tax in Sequential Social Dilemmas
      Yun Hua, Shang Gao, Wenhao Li, Haosheng Chen, Bo Jin, Xiangfeng Wang, Jun Luo, and Hongyuan Zha
      NeurIPS 2025 | paper

    3. DES-Gymnax: Fast Discrete Event Simulator in JAX
      Yun Hua, Jun Luo, and Xiangfeng Wang
      WSC (Winter Simulation Conference) 2025 | paper

    4. Reward Translation via Reward Machine in Semi-Alignable MDPs
      Yun Hua, Haosheng Chen, Wenhao Li, Bo Jin, Baoxiang Wang, Hongyuan Zha, and Xiangfeng Wang
      ICML 2025 | paper

    5. SkyRover: A Modular Simulator for Cross-Domain Pathfinding
      Wenhui Ma, Wenhao Li, Bo Jin, Changhong Lu, and Xiangfeng Wang
      IJCAI-Demo 2025 | paper | SkyRover

    6. Multi-Agent Credit Assignment with Pretrained Language Models
      Wenhao Li, Dan Qiao, Baoxiang Wang, Xiangfeng Wang, Wei Yin, Hao Shen, Bo Jin, and Hongyuan Zha
      AISTATS 2025 | paper

    7. SolSearch: An LLM-Driven Framework for Efficient SAT-Solving Code Generation
      Junjie Sheng, Yanqiu Lin, Jiehao Wu, Jianqi Shi, Yanhong Huang, Min Zhang, and Xiangfeng Wang
      ICSE 2025 | paper

    8. Long-tailed Diffusion Models with Oriented Calibration
      Tianjiao Zhang, Huangjie Zheng, Jiangchao Yao, Xiangfeng Wang, Mingyuan Zhou, Ya Zhang, and Yanfeng Wang
      ICLR 2024 | paper

    9. Hierarchical Diffusion for Offline Decision Making
      Wenhao Li, Xiangfeng Wang, Bo Jin, and Hongyuan Zha
      ICML 2023 | paper

    10. Learning Cooperative Oversubscription for Cloud by Chance-Constrained MARL
      Junjie Sheng, Lu Wang, Fangkai Yang, Bo Qiao, Hang Dong, Xiangfeng Wang, Bo Jin, Jun Wang, Si Qin, Saravan Rajmohan, Qingwei Lin, and Dongmei Zhang
      WWW 2023 | paper

    11. VMAgent: A Practical Virtual Machine Scheduling Platform
      Junjie Sheng, Shengliang Cai, Haochuan Cui, Wenhao Li, Yun Hua, Bo Jin, Wenli Zhou, Yiqiu Hu, Lei Zhu, Qian Peng, Hongyuan Zha, and Xiangfeng Wang
      IJCAI-Demo 2022 | paper | VMAgent

    12. Multi-Agent Path Finding with Prioritized Communication Learning
      Wenhao Li, Hongjun Chen, Bo Jin, Wenzhe Tan, Hongyuan Zha, and Xiangfeng Wang
      ICRA 2022 | paper

    13. Dealing with Non-Stationarity in Multi-Agent Reinforcement Learning via Trust Region Decomposition
      Wenhao Li, Xiangfeng Wang, Bo Jin, Junjie Sheng, and Hongyuan Zha
      ICLR 2022 | paper

    14. HMRL: Hyper-Meta Learning for Sparse Reward Reinforcement Learning Problem
      Yun Hua, Xiangfeng Wang, Bo Jin, Wenhao Li, Junchi Yan, Xiaofeng He, and Hongyuan Zha
      KDD 2021 | paper

    15. Structured Diversification Emergence via Reinforced Organization Control and Hierarchical Consensus Learning
      Wenhao Li, Xiangfeng Wang, Bo Jin, Junjie Sheng, Yun Hua, and Hongyuan Zha
      AAMAS 2021 | paper

    16. Iteratively-Refined Interactive 3D Medical Image Segmentation with Multi-Agent Reinforcement Learning
      Xuan Liao, Wenhao Li, Qisen Xu, Xiangfeng Wang, Bo Jin, Xiaoyun Zhang, Ya Zhang, and Yanfeng Wang
      CVPR 2020 | paper

    17. A Fast Proximal Point Method for Computing Exact Wasserstein Distance
      Yujia Xie, Xiangfeng Wang, Ruijia Wang, and Hongyuan Zha
      UAI 2019 | paper

    18. Deep eXtreme Multi-label Learning
      Wenjie Zhang, Junchi Yan, Xiangfeng Wang, and Hongyuan Zha
      ICMR 2018 | paper

    Books and Book Chapters:

    1. Splitting Optimization: Theory, Methodology, and Applications
      Xiangfeng Wang, Xingju Cai, and Deren Han
      Springer Nature 2025 | paper

    2. 群体智能/Swarm Intelligence
      王祥丰, 金博
      教育科学出版社-人工智能与智能教育丛书 2021

    3. 人机协同/Human-Machine Synergy
      吴信东, 王祥丰, 金博, 于政, 吴明辉
      科学出版社 2022

    4. 人工智能伦理与安全(第4章-安全可信人工智能)
      王晓玲, 王祥丰, 金博
      清华大学出版社 2022

    5. Block-wise Alternating Direction Method of Multipliers with Gaussian Back Substitution for Multiple-block Convex Programming
      Xiaoling Fu, Bingsheng He, Xiangfeng Wang, and Xiaoming Yuan
      Splitting Algorithms, Modern Operator Theory, and Applications 2019 | chapter

    Collaboration Network

    1. Huawei 华为云/海思 | Agent for Scheduling/Plannning/Operators

    2. TCL | Agent

    3. Shanghai AI Laboratory | Multi-Agent Collaboration/AI for Social Science

    Welcome to use this website's source code, just add a link back to here.
    No. Visitor Since Feb 2022. Powered by w3.css