Generalizable Process Reward Models via Formally Verified Training Data

Publication
arXiv preprint arXiv:2505.15960
Ryo Kamoi
Ryo Kamoi

Ryo Kamoi is a PhD student at Penn State University (2023-). His research interests lie in large language models (LLMs), with a particular focus on the reasoning capabilities and self-improvement of LLMs.