Hongru (Merlin) Wang

Research Associate @ EdinburghNLP and EdinburghAI

prof_pic.jpg

I am currently a research associate (postdoc) at University of Edinburgh, working closely with Prof. Amos Storkey and Prof. Jeff Z. Pan, working on theory of agent (mainly planning, memory, self-evolving). I received PhD degree from The Chinese University of Hong Kong under the supervision of Prof. Kam-Fai Wong (ACL Fellow). I spent wonderful time at BlenderLab at University of Illinois Urbana-Champaign during my Ph.D study. I work closely with Prof. Heng Ji, Prof. Irwin King and Prof. Mengdi Wang. Besides that, I am also co-founder and organizer of Nexus for IntelligeCE (NICE), which provides a platform to share and discuss recent progress in AI & NLP for our more than 150,000 fans at the internet.

My research focus revolves around Theory of Agent (ToA), which unifying internal reasoning and external acting (a.k.a., two major behaviors) of agent as two epistemically equivalent tools to model the internal world stored in the parametric space and external physical world. My long-term objective is to achieve the impossible triangle between safety (env), personalization (user) and autonomy (agent) to learn from interactions internally or externally. For further information, please see my CV (last update: 2026.05.06).

Learning of Agent — under-thinking & under-acting

Research Questions

  • What does an agent need to learn that can’t be compressed into parameters?
  • Will over-delegation erode internal reasoning capability over time?
  • How do reasoning, acting, environments, and time scale jointly?

Representative Works

Behavior of Agent — over-thinking & over-acting

Research Questions

  • Why do agents fail to recognize their own knowledge boundary?
  • When should an agent stop reasoning, stop acting, or stop both?
  • What makes a behavior miscalibrated rather than simply wrong?

Representative Works

Evaluation of Agent — safety, personalization, reward modeling

Research Questions

  • What does a correct answer hide about the process that produced it?
  • How should a reward model reason, not just score?
  • Can an agent be safe, personalized, and autonomous at once?

Representative Works

Agent Applications

:fire: I will be on the job market starting in Aug 2026 and am open to both academic faculty positions and industrial research roles. If you believe I might be a good fit for your institution or organization, I’d love to connect!

news

May 01, 2026 We have 3 papers accepted by ICML 2026, including Theory of Agent, Search-R2, and HistBench. Congrats to all authors, As agents enter the second half, I believe this is not only an engineering challenge, but also a scientific journey toward understanding intelligence itself. Feeling grateful, happy, and energized for the road ahead.
Apr 08, 2026 We have 8 papers accepted by ACL 2026: 7 Main and 1 Findings, including 3 corresponding-author and 1 first-author papers. More Details can be found here. Congrats to all co-authors!
Mar 20, 2026 Our initial efficient reasoning work: AdaCtrl is accepted by TMLR 2026. Congrats to all co-authors!
Jan 25, 2026 We have RM-R1 and PAPO accepted by ICLR 2026, Congratulations to all co-authors! It has been a truly memorable time at UIUC. :sparkles: :smile:
Dec 30, 2025 We have two surveys: The Landscape of Agentic Reinforcement Learning for LLMs: A Survey and A Survey of Self-Evolving Agents: What, When, How, and Where to Evolve on the Path to Artificial Super Intelligence accepted by TMLR 2026, Congratulations to all co-authors! This is my first time leading such a large collaboration involving researchers from around the world.

selected preprints

  1. otc-po.jpg
    Hongru Wang, Cheng Qian, Wanjun Zhong, Xiusi Chen, Jiahao Qiu, Shijue Huang, Bowen Jin, Mengdi Wang, Kam-Fai Wong, and Heng Ji
    2025
  2. alita.jpg
    Jiahao Qiu, Xuan Qi, Tongcheng Zhang, Xinzhe Juan, Jiacheng Guo, Yifu Lu, Yimin Wang, Zixin Yao, Qihan Ren, Xun Jiang, Xing Zhou, Dongrui Liu, Ling Yang, Yue Wu, Kaixuan Huang, Shilong Liu, Hongru Wang, and Mengdi Wang
    2025
  3. Rui Wang*, Hongru Wang*, Boyang Xue*, Jianhui Pang, Shudong Liu, Yi Chen, Jiahao Qiu, Derek Fai Wong, Heng Ji, and Kam-Fai Wong
    2025

selected publications

  1. toa_intro.jpg
    Hongru Wang, Cheng Qian, Manling Li, Jiahao Qiu, Boyang Xue, Mengdi Wang, Heng Ji, Amos Storkey, and Kam-Fai Wong
    In ICML, 2025
  2. self-evo.jpg
    Huan-ang Gao, Jiayi Geng, Wenyue Hua, Mengkang Hu, Xinzhe Juan, Hongzhang Liu, Shilong Liu, Jiahao Qiu, Xuan Qi, Qihan Ren, Yiran Wu, Hongru Wang, Han Xiao, Yuhang Zhou, Shaokun Zhang, Jiayi Zhang, Jinyu Xiang, Yixiong Fang, Qiwen Zhao, Dongrui Liu, Cheng Qian, Zhenhailong Wang, Minda Hu, Huazheng Wang, Qingyun Wu, Heng Ji, and Mengdi Wang
    Transactions on Machine Learning Research, 2026
  3. Boyang Xue, Fei Mi, Qi Zhu, Hongru Wang, Rui Wang, Sheng Wang, Erxin Yu, Xuming Hu, and Kam-Fai Wong
    In ACL, 2025
  4. srlm.jpg
    Hongru Wang, Deng Cai, Wanjun Zhong, Shijue Huang, Jeff Z. Pan, Zeming Liu, and Kam-Fai Wong
    In ACL Findings, 2025
  5. Hongru Wang, Wenyu Huang, Yufei Wang, Yuanhao Xi, Jianqiao Lu, Huan Zhang, Nan Hu, Zeming Liu, Jeff Z. Pan, and Kam-Fai Wong
    In ACL Findings, 2025
  6. self-dc.jpg
    Hongru Wang*, Boyang Xue*, Baohang Zhou, Tianhua Zhang, Cunxiang Wang, Huimin Wang, Guanhua Chen, and Kam-Fai Wong
    In NAACL, 2025 Blog
  7. Yu Zhao, Alessio Devoto, Giwon Hong, Xiaotang Du, Aryo Pradipta Gema, Hongru Wang, Xuanli He, Kam-Fai Wong, and Pasquale Minervini
    In NAACL, 2025 Abs
  8. tool_tut.jpg
    Hongru Wang, Yujia Qin, Yankai Lin, Jeff Z. Pan, and Kam-Fai Wong
    In SIGIR, Washington DC, USA, 2024 Abs
  9. knowledge_conflict_survey.jpg
    Rongwu Xu, Zehan Qi, Zhijiang Guo, Cunxiang Wang, Hongru Wang, Yue Zhang, and Wei Xu
    In EMNLP, 2024 Abs
  10. Jianqiao Lu, Zhiyang Dou, Hongru Wang, Zeyu Cao, Jianbo Dai, Yunlong Feng, and Zhijiang Guo
    In NeurIPS, 2024
  11. appbench.jpg
    Hongru Wang, Rui Wang, Boyang Xue, Heming Xia, Jingtao Cao, Zeming Liu, Jeff Z. Pan, and Kam-Fai Wong
    In EMNLP, 2024 Abs
  12. cue-cot.jpg
    Hongru Wang, Rui Wang, Fei Mi, Yang Deng, Zezhong Wang, Bin Liang, Ruifeng Xu, and Kam-Fai Wong
    In EMNLP Findings, 2023
  13. Hongru Wang, Minda Hu, Yang Deng, Rui Wang, Fei Mi, Weichao Wang, Yasheng Wang, Wai-Chung Kwan, Irwin King, and Kam-Fai Wong
    In EMNLP Findings, 2023 Best Paper