selected

AAAR-1.0: Assessing AI's Potential to Assist Research
Evaluating LLMs at Detecting Errors in LLM Responses