1

GReaTer: Gradients Over Reasoning Makes Smaller Language Models Strong Prompt Optimizers
Evaluating LLMs at Detecting Errors in LLM Responses
DocMath-Eval: Evaluating Numerical Reasoning Capabilities of LLMs in Understanding Long Documents with Tabular Data
Fair Abstractive Summarization of Diverse Perspectives
Alternative methods for fast and stable GAN