Hongru (Merlin) Wang

I am currently final-year Ph.D. candidate under the supervision of Prof. Kam-Fai Wong at Department of System Engineering and Engineering Management of The Chinese University of Hong Kong.
I received Bachelor’s degree and Master’s degree from Communication University of China and The Chinese University of Hong Kong respectively. I spent wonderful time at EdinburghNLP and BlenderLab at University of Edinburgh and University of Illinois Urbana-Champaign during my Ph.D study, and I work closely with Prof. Jeff Z. Pan, Prof. Heng Ji, and Prof. Mengdi Wang. I am co-founder and organizer of NLP Academic Exchange Platform (NICE), which provides a platform to share and discuss recent progress in AI & NLP.
My research focus revolves around Theory of Agent (ToA), which unifying internal reasoning and external acting (a.k.a., two major behaviors) of agent as two epistemically equivalent tools to model the internal world stored in the parametric space and external physical world. Where Theory of Mind (ToM) refers to the ability to attribute mental states (e.g., beliefs, intentions, knowledge) to oneself and others, enabling the prediction and interpretation of behavior, ToA characterizes an agent’s capacity to model not only external environments but also its own internal knowledge state to make decisions and complete the goal. My long-term objective is to achieve the impossible triangle between safety, personalization and autonomy of language agent to learn from interactions internally or externally. For further information, please see my CV (last update: 2025.05.30).
I will be on the job market starting in Fall 2025 and am open to both academic faculty positions and industrial research roles. If you believe I might be a good fit for your institution or organization, I’d love to connect!
news
May 25, 2025 | We have 9 papers accepted by ACL 2025: 3 Main and 6 Findings, including 1 corresponding author, 2 first-author papers and 3 (co)first-author papers ![]() ![]() |
---|---|
Apr 22, 2025 | We are so exited to introduce OTC-PO and ToolRL. We believe OTC-PO will be the foundation of agentic RL like the ReAct of Agent. |
Jan 20, 2025 | We have 3 papers are accepted by NAACL 2025, including one first author work: Self-DC that empower language agent when to rely on internal knowledge and when to call external tools. |
Dec 30, 2024 | Start my visiting at BLENDER Lab at University of Illinois Urbana-Champaign hosted by Prof. Heng Ji. It is also the final period of my Ph.D. study. |
Sep 30, 2024 | We have 1 paper accepted by NeurIPS 2024 and 1 paper accepted by MINT Workshop@NeurIPS 2024 about knowledge conflict and process reward model. Congratulations to all co-authors. This is my first paper at ML top conferences. |
selected preprints
- Arxiv
- Arxiv
- ArxivAlita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution2025
- Arxiv
- Arxiv
- Arxiv
- ArxivHarnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models2025
selected publications
- UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language ModelsIn ACL, 2025
- Self-Reasoning Language Models: Unfold Hidden Reasoning Chains with Few Reasoning CatalystIn ACL Findings, 2025
- Rethinking Stateful Tool Use in Multi-Turn Dialogues: Benchmarks and ChallengesIn ACL Findings, 2025
-
- NILLI Best Paper @
IDFLarge Language Models as Source Planner for Personalized Knowledge-grounded DialoguesIn EMNLP Findings, 2023