Reward Reasoning Model

Published in NeurIPS, Poster, 2025

Research Internship (Feb. 2025 – May 2025), Microsoft Research Asia, Beijing

Advisor: Li Dong

Recommended citation: Jiaxin Guo*, Zewen Chi*, Li Dong*, Qingxiu Dong, Xun Wu, Shaohan Huang, Furu Wei. (2025). "Reward Reasoning Model." NeurIPS 2025, Poster.
Download Paper