May 25, 2025 | We have 9 papers accepted by ACL 2025: 3 Main and 6 Findings, including 1 corresponding author, 2 first-author papers and 3 (co)first-author papers |
Apr 22, 2025 | We are so exited to introduce OTC-PO and ToolRL. We believe OTC-PO will be the foundation of agentic RL like the ReAct of Agent. |
Jan 20, 2025 | We have 3 papers are accepted by NAACL 2025, including one first author work: Self-DC that empower language agent when to rely on internal knowledge and when to call external tools. |
Dec 30, 2024 | Start my visiting at BLENDER Lab at University of Illinois Urbana-Champaign hosted by Prof. Heng Ji. It is also the final period of my Ph.D. study. |
Sep 30, 2024 | We have 1 paper accepted by NeurIPS 2024 and 1 paper accepted by MINT Workshop@NeurIPS 2024 about knowledge conflict and process reward model. Congratulations to all co-authors. This is my first paper at ML top conferences. |
Nov 07, 2015 | A long announcement with details |