selected

AAAR-1.0: Assessing AI's Potential to Assist Research
Efficient PRM Training Data Synthesis via Formal Verification
Evaluating LLMs at Detecting Errors in LLM Responses