Hongru (Merlin) Wang

Research Associate @ EdinburghNLP and EdinburghAI

prof_pic.jpg

I am currently a research associate (postdoc) at University of Edinburgh, working closely with Prof. Amos Storkey and Prof. Jeff Z. Pan, working on theory of agents (mainly planning, memory, self-evolving), world model-based RL.

I received PhD degree from The Chinese University of Hong Kong under the supervision of Prof. Kam-Fai Wong (ACL Fellow). I spent wonderful time at EdinburghNLP and BlenderLab at University of Edinburgh and University of Illinois Urbana-Champaign during my Ph.D study. I work closely with Prof. Jeff Z. Pan, Prof. Heng Ji, Prof. Irwin King and Prof. Mengdi Wang. I am co-founder and organizer of NLP Academic Exchange Platform (NICE), which provides a platform to share and discuss recent progress in AI & NLP.

My research focus revolves around Theory of Agent (ToA), which unifying internal reasoning and external acting (a.k.a., two major behaviors) of agent as two epistemically equivalent tools to model the internal world stored in the parametric space and external physical world. Where Theory of Mind (ToM) refers to the ability to attribute mental states (e.g., beliefs, intentions, knowledge) to oneself and others, enabling the prediction and interpretation of behavior, ToA characterizes an agent’s capacity to model not only external environments but also its own internal knowledge state to make decisions and complete the goal. My long-term objective is to achieve the impossible triangle between safety, personalization and autonomy of language agent to learn from interactions internally or externally. For further information, please see my CV (last update: 2026.01.30).

:loudspeaker: Mentorship: I have very close connections with CUHK, UoE, UIUC and Princeton. If you like my research or would like to copperate / visit, you can directly contact me via X or Wechat. My mentees always publish better paper than me :)

:fire: I will be on the job market starting in Mar 2026 and am open to both academic faculty positions and industrial research roles. If you believe I might be a good fit for your institution or organization, I’d love to connect!

news

May 01, 2026 We have 3 papers accepted by ICML 2026, including Theory of Agent, Search-R2, and HistBench. Congrats to all authors, As agents enter the second half, I believe this is not only an engineering challenge, but also a scientific journey toward understanding intelligence itself. Feeling grateful, happy, and energized for the road ahead.
Apr 08, 2026 We have 8 papers accepted by ACL 2026: 7 Main and 1 Findings, including 3 corresponding-author and 1 first-author papers. More Details can be found here. Congrats to all co-authors!
Mar 20, 2026 Our initial efficient reasoning work: AdaCtrl is accepted by TMLR 2026. Congrats to all co-authors!
Jan 25, 2026 We have RM-R1 and PAPO accepted by ICLR 2026, Congratulations to all co-authors! It has been a truly memorable time at UIUC. :sparkles: :smile:
Dec 30, 2025 We have two surveys: The Landscape of Agentic Reinforcement Learning for LLMs: A Survey and A Survey of Self-Evolving Agents: What, When, How, and Where to Evolve on the Path to Artificial Super Intelligence accepted by TMLR 2026, Congratulations to all co-authors! This is my first time leading such a large collaboration involving researchers from around the world.

selected preprints

  1. Arxiv
    Acting Less is Reasoning More! Teaching Model to Act Efficiently
    Hongru Wang, Cheng Qian, Wanjun Zhong, Xiusi Chen, Jiahao Qiu, Shijue Huang, Bowen Jin, Mengdi Wang, Kam-Fai Wong, and Heng Ji
    2025
  2. Arxiv
    Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution
    Jiahao Qiu, Xuan Qi, Tongcheng Zhang, Xinzhe Juan, Jiacheng Guo, Yifu Lu, Yimin Wang, Zixin Yao, Qihan Ren, Xun Jiang, Xing Zhou, Dongrui Liu, Ling Yang, Yue Wu, Kaixuan Huang, Shilong Liu, Hongru Wang, and Mengdi Wang
    2025
  3. Arxiv
    Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models
    Rui Wang*, Hongru Wang*, Boyang Xue*, Jianhui Pang, Shudong Liu, Yi Chen, Jiahao Qiu, Derek Fai Wong, Heng Ji, and Kam-Fai Wong
    2025

selected publications

  1. Position: Agent Should Invoke External Tools ONLY When Epistemically Necessary
    Hongru Wang, Cheng Qian, Manling Li, Jiahao Qiu, Boyang Xue, Mengdi Wang, Heng Ji, Amos Storkey, and Kam-Fai Wong
    In ICML, 2025
  2. A Survey of Self-Evolving Agents: What, When, How, and Where to Evolve on the Path to Artificial Super Intelligence
    Huan-ang Gao, Jiayi Geng, Wenyue Hua, Mengkang Hu, Xinzhe Juan, Hongzhang Liu, Shilong Liu, Jiahao Qiu, Xuan Qi, Qihan Ren, Yiran Wu, Hongru Wang, Han Xiao, Yuhang Zhou, Shaokun Zhang, Jiayi Zhang, Jinyu Xiang, Yixiong Fang, Qiwen Zhao, Dongrui Liu, Cheng Qian, Zhenhailong Wang, Minda Hu, Huazheng Wang, Qingyun Wu, Heng Ji, and Mengdi Wang
    Transactions on Machine Learning Research, 2026
  3. UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models
    Boyang Xue, Fei Mi, Qi Zhu, Hongru Wang, Rui Wang, Sheng Wang, Erxin Yu, Xuming Hu, and Kam-Fai Wong
    In ACL, 2025
  4. Self-Reasoning Language Models: Unfold Hidden Reasoning Chains with Few Reasoning Catalyst
    Hongru Wang, Deng Cai, Wanjun Zhong, Shijue Huang, Jeff Z. Pan, Zeming Liu, and Kam-Fai Wong
    In ACL Findings, 2025
  5. Rethinking Stateful Tool Use in Multi-Turn Dialogues: Benchmarks and Challenges
    Hongru Wang, Wenyu Huang, Yufei Wang, Yuanhao Xi, Jianqiao Lu, Huan Zhang, Nan Hu, Zeming Liu, Jeff Z. Pan, and Kam-Fai Wong
    In ACL Findings, 2025
  6. Oral
    Self-DC: When to Reason and When to Act? Self Divide-and-Conquer for Compositional Unknown Questions
    Hongru Wang*, Boyang Xue*, Baohang Zhou, Tianhua Zhang, Cunxiang Wang, Huimin Wang, Guanhua Chen, and Kam-Fai Wong
    In NAACL, 2025
  7. Oral
    Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
    Yu Zhao, Alessio Devoto, Giwon Hong, Xiaotang Du, Aryo Pradipta Gema, Hongru Wang, Xuanli He, Kam-Fai Wong, and Pasquale Minervini
    In NAACL, 2025
  8. Tutorial
    Empowering Large Language Models: Tool Learning for Real-World Interaction
    Hongru Wang, Yujia Qin, Yankai Lin, Jeff Z. Pan, and Kam-Fai Wong
    In SIGIR, Washington DC, USA, 2024
  9. Knowledge Conflicts for LLMs: A Survey
    Rongwu Xu, Zehan Qi, Zhijiang Guo, Cunxiang Wang, Hongru Wang, Yue Zhang, and Wei Xu
    In EMNLP, 2024
  10. AutoPSV: Automated Process-Supervised Verifier
    Jianqiao Lu, Zhiyang Dou, Hongru Wang, Zeyu Cao, Jianbo Dai, Yunlong Feng, and Zhijiang Guo
    In NeurIPS, 2024
  11. AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction
    Hongru Wang, Rui Wang, Boyang Xue, Heming Xia, Jingtao Cao, Zeming Liu, Jeff Z. Pan, and Kam-Fai Wong
    In EMNLP, 2024
  12. Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMs
    Hongru Wang, Rui Wang, Fei Mi, Yang Deng, Zezhong Wang, Bin Liang, Ruifeng Xu, and Kam-Fai Wong
    In EMNLP Findings, 2023
  13. NILLI Best Paper @
    IDF
    Large Language Models as Source Planner for Personalized Knowledge-grounded Dialogues
    Hongru Wang, Minda Hu, Yang Deng, Rui Wang, Fei Mi, Weichao Wang, Yasheng Wang, Wai-Chung Kwan, Irwin King, and Kam-Fai Wong
    In EMNLP Findings, 2023