Hongru (Merlin) Wang

I am currently a research associate (postdoc) at University of Edinburgh, working closely with Prof. Amos Storkey and Prof. Jeff Z. Pan, working on theory of agent (mainly planning, memory, self-evolving). I received PhD degree from The Chinese University of Hong Kong under the supervision of Prof. Kam-Fai Wong (ACL Fellow). I spent wonderful time at BlenderLab at University of Illinois Urbana-Champaign as an visiting student during my Ph.D study under the supervision of Prof. Heng Ji (ACL Fellow). I work closely with Prof. Irwin King and Prof. Mengdi Wang. Besides that, I am also co-founder and organizer of Nexus for IntelligeCE (NICE), which provides a platform to share and discuss recent progress in AI & NLP for our more than 150,000 fans at the internet.

My research focus revolves around Theory of Agent (ToA), which unifying internal reasoning and external acting (a.k.a., two major behaviors) of agent as two epistemically equivalent tools to model the internal world stored in the parametric space and external physical world. My long-term objective is to achieve the impossible triangle between safety (env), personalization (user) and autonomy (agent) to learn from interactions internally or externally. For further information, please see my CV, Research Statement and Teaching Statement (last update: 2026.05.06).

Learning of Agent — under-thinking & under-acting

Research Questions

What does an agent need to learn that can’t be compressed into parameters?
Will over-delegation erode internal reasoning capability over time?
How do reasoning, acting, environments, and time scale jointly?

Representative Works

ToolRL: Reward is All Tool Learning Needs (NeurIPS 2025)
From Word to World: Can Large Language Models be Implicit Text-based World Models? (ACL 2026)
A Survey of Self-Evolving Agents (TMLR 2026)

Behavior of Agent — over-thinking & over-acting

Research Questions

Why do agents fail to recognize their own knowledge boundary?
When should an agent stop reasoning, stop acting, or stop both?
What makes a behavior miscalibrated rather than simply wrong?

Representative Works

AdaCtrl: Towards Adaptive and Controllable Reasoning via Difficulty-Aware Budgeting (TMLR 2026)
SMART: Self-Aware Agent for Tool Overuse Mitigation (ACL Findings 2025)
Self-DC: When to Reason and When to Act? Self Divide-and-Conquer for Compositional Unknown Questions (NAACL 2025)

Evaluation of Agent — safety, personalization, reward modeling

Research Questions

What does a correct answer hide about the process that produced it?
How should a reward model reason, not just score?
Can an agent be safe, personalized, and autonomous at once?

Representative Works

SafeToolBench: Pioneering a Prospective Benchmark to Evaluating Tool Utilization Safety in LLMs (EMNLP Findings 2025)
AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction (EMNLP 2024)
ToolSpectrum: Towards Personalized Tool Utilization for Large Language Models (ACL Findings 2025)
RM-R1: Reward Modeling as Reasoning (ICLR 2026)

Agent Applications

Theory of Agent

under-think / under-act

Behavior

over-think / over-act

Evaluation

safety, personalization, reward

I will be on the job market starting in Aug 2026 and am open to both academic faculty positions and industrial research roles. If you believe I might be a good fit for your institution or organization, I’d love to connect!

news

Jul 20, 2026	I’m happy to share my first paper as last author has been accepted by COLM 2026, looking forward to release it ASAP.
May 01, 2026	We have 3 papers accepted by ICML 2026, including Theory of Agent, Search-R2, and HistBench. Congrats to all authors, As agents enter the second half, I believe this is not only an engineering challenge, but also a scientific journey toward understanding intelligence itself. Feeling grateful, happy, and energized for the road ahead.
Apr 08, 2026	We have 8 papers accepted by ACL 2026: 7 Main and 1 Findings, including 3 corresponding-author and 1 first-author papers. More Details can be found here. Congrats to all co-authors!
Mar 20, 2026	Our initial efficient reasoning work: AdaCtrl is accepted by TMLR 2026. Congrats to all co-authors!
Jan 25, 2026	We have RM-R1 and PAPO accepted by ICLR 2026, Congratulations to all co-authors! It has been a truly memorable time at UIUC.

selected preprints

Acting Less is Reasoning More! Teaching Model to Act Efficiently Arxiv

Hongru Wang, Cheng Qian, Wanjun Zhong, Xiusi Chen, Jiahao Qiu, Shijue Huang, Bowen Jin, Mengdi Wang, Kam-Fai Wong, and Heng Ji

2025
Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution Arxiv

Jiahao Qiu, Xuan Qi, Tongcheng Zhang, Xinzhe Juan, Jiacheng Guo, Yifu Lu, Yimin Wang, Zixin Yao, Qihan Ren, Xun Jiang, Xing Zhou, Dongrui Liu, Ling Yang, Yue Wu, Kaixuan Huang, Shilong Liu, Hongru Wang, and Mengdi Wang

2025
Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models Arxiv

Rui Wang^*, Hongru Wang^*, Boyang Xue^*, Jianhui Pang, Shudong Liu, Yi Chen, Jiahao Qiu, Derek Fai Wong, Heng Ji, and Kam-Fai Wong

2025

selected publications

Position: Agent Should Invoke External Tools ONLY When Epistemically Necessary

Hongru Wang, Cheng Qian, Manling Li, Jiahao Qiu, Boyang Xue, Mengdi Wang, Heng Ji, Amos Storkey, and Kam-Fai Wong

In ICML, 2026
A Survey of Self-Evolving Agents: What, When, How, and Where to Evolve on the Path to Artificial Super Intelligence

Huan-ang Gao, Jiayi Geng, Wenyue Hua, Mengkang Hu, Xinzhe Juan, Hongzhang Liu, Shilong Liu, Jiahao Qiu, Xuan Qi, Qihan Ren, Yiran Wu, Hongru Wang, Han Xiao, Yuhang Zhou, Shaokun Zhang, Jiayi Zhang, Jinyu Xiang, Yixiong Fang, Qiwen Zhao, Dongrui Liu, Cheng Qian, Zhenhailong Wang, Minda Hu, Huazheng Wang, Qingyun Wu, Heng Ji, and Mengdi Wang

Transactions on Machine Learning Research, 2026
UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models

Boyang Xue, Fei Mi, Qi Zhu, Hongru Wang^†, Rui Wang, Sheng Wang, Erxin Yu, Xuming Hu, and Kam-Fai Wong

In ACL, 2025
Self-Reasoning Language Models: Unfold Hidden Reasoning Chains with Few Reasoning Catalyst

Hongru Wang, Deng Cai, Wanjun Zhong, Shijue Huang, Jeff Z. Pan, Zeming Liu, and Kam-Fai Wong

In ACL Findings, 2025
Rethinking Stateful Tool Use in Multi-Turn Dialogues: Benchmarks and Challenges

Hongru Wang, Wenyu Huang, Yufei Wang, Yuanhao Xi, Jianqiao Lu, Huan Zhang, Nan Hu, Zeming Liu, Jeff Z. Pan, and Kam-Fai Wong

In ACL Findings, 2025
Self-DC: When to Reason and When to Act? Self Divide-and-Conquer for Compositional Unknown Questions Oral

Hongru Wang^*, Boyang Xue^*, Baohang Zhou, Tianhua Zhang, Cunxiang Wang, Huimin Wang, Guanhua Chen, and Kam-Fai Wong

In NAACL, 2025 Blog
Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering Oral

Yu Zhao, Alessio Devoto, Giwon Hong, Xiaotang Du, Aryo Pradipta Gema, Hongru Wang, Xuanli He, Kam-Fai Wong, and Pasquale Minervini

In NAACL, 2025 Abs

Large language models (LLMs) can store a significant amount of factual knowledge in their parameters. However, their parametric knowledge may conflict with the information provided in the context—this phenomenon, known as \textitcontext-memory knowledge conflicts, can lead to undesirable model behaviour, such as reliance on outdated or incorrect information. Analysing the internal activations of LLMs, we find that they can internally register the signals of knowledge conflict at mid-layers. Such signals allow us to detect whether a knowledge conflict occurs and use \textitinference-time intervention strategies to resolve it. In this work, we propose SpARE, a \textittraining-free representation engineering method that uses pre-trained sparse auto-encoders (SAEs) to control the knowledge selection behaviour of LLMs. SpARE identifies the functional features that control the knowledge selection behaviours and applies them to edit the internal activations of LLMs at inference time. Our experimental results show that SpARE can effectively control the usage of either knowledge source to resolve knowledge conflict in open-domain question-answering tasks, surpassing existing representation engineering methods (+10%) as well as contrastive decoding methods (+15%).
Empowering Large Language Models: Tool Learning for Real-World Interaction Tutorial

Hongru Wang, Yujia Qin, Yankai Lin, Jeff Z. Pan, and Kam-Fai Wong

In SIGIR, Washington DC, USA, 2024 Abs

Since the advent of large language models (LLMs), the field of tool learning has remained very active in solving various tasks in practice, including but not limited to information retrieval. This half-day tutorial provides basic concepts of this field and an overview of recent advancements with several applications. In specific, we start with some foundational components and architecture of tool learning (i.e., cognitive tool and physical tool), and then we categorize existing studies in this field into tool-augmented learning and tool-oriented learning, and introduce various learning methods to empower LLMs this kind of capability. Furthermore, we provide several cases about when, what, and how to use tools in different applications. We end with some open challenges and several potential research directions for future studies. We believe this tutorial is suited for both researchers at different stages (introductory, intermediate, and advanced) and industry practitioners who are interested in LLMs and tool learning.
Knowledge Conflicts for LLMs: A Survey

Rongwu Xu, Zehan Qi, Zhijiang Guo, Cunxiang Wang, Hongru Wang, Yue Zhang, and Wei Xu

In EMNLP, 2024 Abs

This survey provides an in-depth analysis of knowledge conflicts for large language models (LLMs), highlighting the complex challenges they encounter when blending contextual and parametric knowledge. Our focus is on three categories of knowledge conflicts: context-memory, inter-context, and intra-memory conflict. These conflicts can significantly impact the trustworthiness and performance of LLMs, especially in real-world applications where noise and misinformation are common. By categorizing these conflicts, exploring the causes, examining the behaviors of LLMs under such conflicts, and reviewing available solutions, this survey aims to shed light on strategies for improving the robustness of LLMs, thereby serving as a valuable resource for advancing research in this evolving area.
AutoPSV: Automated Process-Supervised Verifier

Jianqiao Lu, Zhiyang Dou, Hongru Wang, Zeyu Cao, Jianbo Dai, Yunlong Feng, and Zhijiang Guo

In NeurIPS, 2024
AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction

Hongru Wang, Rui Wang, Boyang Xue, Heming Xia, Jingtao Cao, Zeming Liu, Jeff Z. Pan, and Kam-Fai Wong

In EMNLP, 2024 Abs

Large Language Models (LLMs) can interact with the real world by connecting with versatile external APIs, resulting in better problem-solving and task automation capabilities. Previous research primarily either focuses on APIs with limited arguments from a single source or overlooks the complex dependency relationship between different APIs. However, it is essential to utilize multiple APIs collaboratively from various sources, especially for complex user instructions. In this paper, we introduce MetaBench, the first benchmark to evaluate LLMs’ ability to plan and execute multiple APIs from various sources in order to complete the user’s task. Specifically, we consider two significant challenges in multiple APIs: 1) graph structures: some APIs can be executed independently while others need to be executed one by one, resulting in graph-like execution order; and 2) permission constraints: which source is authorized to execute the API call. We have experimental results on 9 distinct LLMs; e.g., GPT-4o achieves only a 2.0% success rate at the most complex instruction, revealing that the existing state-of-the-art LLMs still cannot perform well in this situation even with the help of in-context learning and finetuning. Our code and data are publicly available at \urlhttps://github.com/ruleGreen/AppBench.
Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMs

Hongru Wang, Rui Wang, Fei Mi, Yang Deng, Zezhong Wang, Bin Liang, Ruifeng Xu, and Kam-Fai Wong

In EMNLP Findings, 2023
Large Language Models as Source Planner for Personalized Knowledge-grounded Dialogues Best Paper

Hongru Wang, Minda Hu, Yang Deng, Rui Wang, Fei Mi, Weichao Wang, Yasheng Wang, Wai-Chung Kwan, Irwin King, and Kam-Fai Wong

In EMNLP Findings, 2023 Best Paper

Hongru receveid the Best Paper Award at 2023 International Doctoroal Forum