Reward Reasoning Model
Published in NeurIPS, Poster, 2025
Research Internship (Feb. 2025 – May 2025), Microsoft Research Asia, Beijing
Advisor: Li Dong
Recommended citation: Jiaxin Guo*, Zewen Chi*, Li Dong*, Qingxiu Dong, Xun Wu, Shaohan Huang, Furu Wei. (2025). "Reward Reasoning Model." NeurIPS 2025, Poster.
Download Paper