Projects

Table of Contents

Detecting and Correcting Mistakes in LLM Responses

When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of Self-Correction of LLMs (TACL, 2024)

Evaluating LLMs at Detecting Errors in LLM Responses (COLM 2024)

Fact Verification

WiCE: Real-World Entailment for Claims in Wikipedia (EMNLP 2023)

Shortcomings of Question Answering Based Factuality Frameworks for Error Localization (EACL 2023)