NLP

GReaTer: Gradients Over Reasoning Makes Smaller Language Models Strong Prompt Optimizers
AAAR-1.0: Assessing AI's Potential to Assist Research
Evaluating LLMs at Detecting Errors in LLM Responses
DocMath-Eval: Evaluating Numerical Reasoning Capabilities of LLMs in Understanding Long Documents with Tabular Data
Fair Abstractive Summarization of Diverse Perspectives