Hongru (Merlin) Wang

I am currently final-year Ph.D. candidate under the supervision of Prof. Kam-Fai Wong at Department of System Engineering and Engineering Management of The Chinese University of Hong Kong.

I received Bachelor’s degree and Master’s degree from Communication University of China and The Chinese University of Hong Kong respectively. I spent wonderful time at EdinburghNLP and BlenderLab at University of Edinburgh and University of Illinois Urbana-Champaign during my Ph.D study, and I work closely with Prof. Jeff Z. Pan, Prof. Heng Ji, and Prof. Mengdi Wang. I am co-founder and organizer of NLP Academic Exchange Platform (NICE), which provides a platform to share and discuss recent progress in AI & NLP.

My research focus revolves around Theory of Agent (ToA), which unifying internal reasoning and external acting (a.k.a., two major behaviors) of agent as two epistemically equivalent tools to model the internal world stored in the parametric space and external physical world. Where Theory of Mind (ToM) refers to the ability to attribute mental states (e.g., beliefs, intentions, knowledge) to oneself and others, enabling the prediction and interpretation of behavior, ToA characterizes an agent’s capacity to model not only external environments but also its own internal knowledge state to make decisions and complete the goal. My long-term objective is to achieve the impossible triangle between safety, personalization and autonomy of language agent to learn from interactions internally or externally. For further information, please see my CV (last update: 2025.05.30).

I will be on the job market starting in Fall 2025 and am open to both academic faculty positions and industrial research roles. If you believe I might be a good fit for your institution or organization, I’d love to connect!

news

May 25, 2025	We have 9 papers accepted by ACL 2025: 3 Main and 6 Findings, including 1 corresponding author, 2 first-author papers and 3 (co)first-author papers
Apr 22, 2025	We are so exited to introduce OTC-PO and ToolRL. We believe OTC-PO will be the foundation of agentic RL like the ReAct of Agent.
Jan 20, 2025	We have 3 papers are accepted by NAACL 2025, including one first author work: Self-DC that empower language agent when to rely on internal knowledge and when to call external tools.
Dec 30, 2024	Start my visiting at BLENDER Lab at University of Illinois Urbana-Champaign hosted by Prof. Heng Ji. It is also the final period of my Ph.D. study.
Sep 30, 2024	We have 1 paper accepted by NeurIPS 2024 and 1 paper accepted by MINT Workshop@NeurIPS 2024 about knowledge conflict and process reward model. Congratulations to all co-authors. This is my first paper at ML top conferences.

selected preprints

Arxiv

Acting Less is Reasoning More! Teaching Model to Act Efficiently

Hongru Wang, Cheng Qian, Wanjun Zhong, Xiusi Chen, Jiahao Qiu, Shijue Huang, Bowen Jin, Mengdi Wang, Kam-Fai Wong, and Heng Ji

2025

HTML
Arxiv

Toward a Theory of Agents as Tool-Use Decision-Makers

Hongru Wang, Cheng Qian, Manling Li, Jiahao Qiu, Boyang Xue, Mengdi Wang, Heng Ji, and Kam-Fai Wong

2025

HTML
Arxiv

Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution

Jiahao Qiu, Xuan Qi, Tongcheng Zhang, Xinzhe Juan, Jiacheng Guo, Yifu Lu, Yimin Wang, Zixin Yao, Qihan Ren, Xun Jiang, Xing Zhou, Dongrui Liu, Ling Yang, Yue Wu, Kaixuan Huang, Shilong Liu, Hongru Wang, and Mengdi Wang

2025

HTML
Arxiv

RM-R1: Reward Modeling as Reasoning

Xiusi Chen, Gaotang Li, Ziqi Wang, Bowen Jin, Cheng Qian, Yu Wang, Hongru Wang, Yu Zhang, Denghui Zhang, Tong Zhang, Hanghang Tong, and Heng Ji

2025

HTML
Arxiv

ToolRL: Reward is All Tool Learning Needs

Cheng Qian, Emre Can Acikgoz, Qi He, Hongru Wang, Xiusi Chen, Dilek Hakkani-Tür, Gokhan Tur, and Heng Ji

2025

HTML
Arxiv

AdaCtrl: Towards Adaptive and Controllable Reasoning via Difficulty-Aware Budgeting

Shijue Huang^*, Hongru Wang^*, Wanjun Zhong, Zhaochen Su, Jiazhan Feng, Bowen Cao, and Yi R. Fung

2025

HTML
Arxiv

Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models

Rui Wang^*, Hongru Wang^*, Boyang Xue^*, Jianhui Pang, Shudong Liu, Yi Chen, Jiahao Qiu, Derek Fai Wong, Heng Ji, and Kam-Fai Wong

2025

HTML

selected publications

UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models

Boyang Xue, Fei Mi, Qi Zhu, Hongru Wang^†, Rui Wang, Sheng Wang, Erxin Yu, Xuming Hu, and Kam-Fai Wong

In ACL, 2025

HTML
Self-Reasoning Language Models: Unfold Hidden Reasoning Chains with Few Reasoning Catalyst

Hongru Wang, Deng Cai, Wanjun Zhong, Shijue Huang, Jeff Z. Pan, Zeming Liu, and Kam-Fai Wong

In ACL Findings, 2025

HTML
Rethinking Stateful Tool Use in Multi-Turn Dialogues: Benchmarks and Challenges

Hongru Wang, Wenyu Huang, Yufei Wang, Yuanhao Xi, Jianqiao Lu, Huan Zhang, Nan Hu, Zeming Liu, Jeff Z. Pan, and Kam-Fai Wong

In ACL Findings, 2025

HTML
Oral

Self-DC: When to Reason and When to Act? Self Divide-and-Conquer for Compositional Unknown Questions

Hongru Wang, Boyang Xue, Baohang Zhou, Tianhua Zhang, Cunxiang Wang, Huimin Wang, Guanhua Chen, and Kam-Fai Wong

In NAACL, 2025

DOI HTML Blog
Oral

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

Yu Zhao, Alessio Devoto, Giwon Hong, Xiaotang Du, Aryo Pradipta Gema, Hongru Wang, Xuanli He, Kam-Fai Wong, and Pasquale Minervini

In NAACL, 2025

Abs DOI HTML

Large language models (LLMs) can store a significant amount of factual knowledge in their parameters. However, their parametric knowledge may conflict with the information provided in the context—this phenomenon, known as \textitcontext-memory knowledge conflicts, can lead to undesirable model behaviour, such as reliance on outdated or incorrect information. Analysing the internal activations of LLMs, we find that they can internally register the signals of knowledge conflict at mid-layers. Such signals allow us to detect whether a knowledge conflict occurs and use \textitinference-time intervention strategies to resolve it. In this work, we propose SpARE, a \textittraining-free representation engineering method that uses pre-trained sparse auto-encoders (SAEs) to control the knowledge selection behaviour of LLMs. SpARE identifies the functional features that control the knowledge selection behaviours and applies them to edit the internal activations of LLMs at inference time. Our experimental results show that SpARE can effectively control the usage of either knowledge source to resolve knowledge conflict in open-domain question-answering tasks, surpassing existing representation engineering methods (+10%) as well as contrastive decoding methods (+15%).
Tutorial

Empowering Large Language Models: Tool Learning for Real-World Interaction

Hongru Wang, Yujia Qin, Yankai Lin, Jeff Z. Pan, and Kam-Fai Wong

In SIGIR, Washington DC, USA, 2024

Abs DOI HTML

Since the advent of large language models (LLMs), the field of tool learning has remained very active in solving various tasks in practice, including but not limited to information retrieval. This half-day tutorial provides basic concepts of this field and an overview of recent advancements with several applications. In specific, we start with some foundational components and architecture of tool learning (i.e., cognitive tool and physical tool), and then we categorize existing studies in this field into tool-augmented learning and tool-oriented learning, and introduce various learning methods to empower LLMs this kind of capability. Furthermore, we provide several cases about when, what, and how to use tools in different applications. We end with some open challenges and several potential research directions for future studies. We believe this tutorial is suited for both researchers at different stages (introductory, intermediate, and advanced) and industry practitioners who are interested in LLMs and tool learning.
Knowledge Conflicts for LLMs: A Survey

Rongwu Xu, Zehan Qi, Zhijiang Guo, Cunxiang Wang, Hongru Wang, Yue Zhang, and Wei Xu

In EMNLP, 2024

Abs DOI HTML

This survey provides an in-depth analysis of knowledge conflicts for large language models (LLMs), highlighting the complex challenges they encounter when blending contextual and parametric knowledge. Our focus is on three categories of knowledge conflicts: context-memory, inter-context, and intra-memory conflict. These conflicts can significantly impact the trustworthiness and performance of LLMs, especially in real-world applications where noise and misinformation are common. By categorizing these conflicts, exploring the causes, examining the behaviors of LLMs under such conflicts, and reviewing available solutions, this survey aims to shed light on strategies for improving the robustness of LLMs, thereby serving as a valuable resource for advancing research in this evolving area.
AutoPSV: Automated Process-Supervised Verifier

Jianqiao Lu, Zhiyang Dou, Hongru Wang, Zeyu Cao, Jianbo Dai, Yunlong Feng, and Zhijiang Guo

In NeurIPS, 2024

HTML
AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction

Hongru Wang, Rui Wang, Boyang Xue, Heming Xia, Jingtao Cao, Zeming Liu, Jeff Z. Pan, and Kam-Fai Wong

In EMNLP, 2024

Abs DOI HTML

Large Language Models (LLMs) can interact with the real world by connecting with versatile external APIs, resulting in better problem-solving and task automation capabilities. Previous research primarily either focuses on APIs with limited arguments from a single source or overlooks the complex dependency relationship between different APIs. However, it is essential to utilize multiple APIs collaboratively from various sources, especially for complex user instructions. In this paper, we introduce MetaBench, the first benchmark to evaluate LLMs’ ability to plan and execute multiple APIs from various sources in order to complete the user’s task. Specifically, we consider two significant challenges in multiple APIs: 1) graph structures: some APIs can be executed independently while others need to be executed one by one, resulting in graph-like execution order; and 2) permission constraints: which source is authorized to execute the API call. We have experimental results on 9 distinct LLMs; e.g., GPT-4o achieves only a 2.0% success rate at the most complex instruction, revealing that the existing state-of-the-art LLMs still cannot perform well in this situation even with the help of in-context learning and finetuning. Our code and data are publicly available at \urlhttps://github.com/ruleGreen/AppBench.
Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMs

Hongru Wang, Rui Wang, Fei Mi, Yang Deng, Zezhong Wang, Bin Liang, Ruifeng Xu, and Kam-Fai Wong

In EMNLP Findings, 2023

DOI HTML
NILLI Best Paper @
IDF

Large Language Models as Source Planner for Personalized Knowledge-grounded Dialogues

Hongru Wang, Minda Hu, Yang Deng, Rui Wang, Fei Mi, Weichao Wang, Yasheng Wang, Wai-Chung Kwan, Irwin King, and Kam-Fai Wong

In EMNLP Findings, 2023

Best Paper DOI HTML

Hongru receveid the Best Paper Award at 2023 International Doctoroal Forum