Ryo Kamoi
Ryo Kamoi
Publications
Education
Work Experience
CV
Projects
Blog
Light
Dark
Automatic
NLP
GReaTer: Gradients Over Reasoning Makes Smaller Language Models Strong Prompt Optimizers
Sarkar Snigdha Sarathi Das
,
Ryo Kamoi
,
Bo Pang
,
Yusen Zhang
,
Caiming Xiong
,
Rui Zhang
PDF
Cite
Code
VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception of Geometric Information
Ryo Kamoi
,
Yusen Zhang
,
Sarkar Snigdha Sarathi Das
,
Ranran Haoran Zhang
,
Rui Zhang
PDF
Cite
Dataset
Website
When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of Self-Correction of LLMs
Ryo Kamoi
,
Yusen Zhang
,
Nan Zhang
,
Jiawei Han
,
Rui Zhang
PDF
Cite
Video
Slides
Paper List
AAAR-1.0: Assessing AI's Potential to Assist Research
Renze Lou
,
Hanzi Xu
,
Sijia Wang
,
Jiangshu Du
,
Ryo Kamoi
,
Xiaoxin Lu
,
Jian Xie
,
Yuxuan Sun
,
Yusen Zhang
,
Jihyun Janice Ahn
,
Hongchao Fang
,
Zhuoyang Zou
,
Wenchao Ma
,
Xi Li
,
Kai Zhang
,
Congying Xia
,
Lifu Huang
,
Wenpeng Yin
PDF
Cite
Evaluating LLMs at Detecting Errors in LLM Responses
Ryo Kamoi
,
Sarkar Snigdha Sarathi Das
,
Renze Lou
,
Jihyun Janice Ahn
,
Yilun Zhao
,
Xiaoxin Lu
,
Nan Zhang
,
Yusen Zhang
,
Ranran Haoran Zhang
,
Sujeeth Reddy Vummanthala
,
Salika Dave
,
Shaobo Qin
,
Arman Cohan
,
Wenpeng Yin
,
Rui Zhang
PDF
Cite
Code
Dataset
Poster
Direct-Inverse Prompting: Analyzing LLMs' Discriminative Capacity in Self-Improving Generation
Jihyun Janice Ahn
,
Ryo Kamoi
,
Lu Cheng
,
Rui Zhang
,
Wenpeng Yin
PDF
Cite
DocMath-Eval: Evaluating Numerical Reasoning Capabilities of LLMs in Understanding Long Documents with Tabular Data
Yilun Zhao
,
Yitao Long
,
Hongjun Liu
,
Linyong Nan
,
Lyuhao Chen
,
Ryo Kamoi
,
Yixin Liu
,
Xiangru Tang
,
Rui Zhang
,
Arman Cohan
PDF
Cite
Fair Abstractive Summarization of Diverse Perspectives
Yusen Zhang
,
Nan Zhang
,
Yixin Liu
,
Alexander Fabbri
,
Junru Liu
,
Ryo Kamoi
,
Xiaoxin Lu
,
Caiming Xiong
,
Jieyu Zhao
,
Dragomir Radev
,
Kathleen McKeown
,
Rui Zhang
PDF
Cite
WiCE: Real-World Entailment for Claims in Wikipedia
Models for textual entailment have increasingly been applied to settings like fact-checking, presupposition verification in question …
Ryo Kamoi
,
Tanya Goyal
,
Juan Diego Rodriguez
,
Greg Durrett
PDF
Cite
Dataset
Slides
Shortcomings of Question Answering Based Factuality Frameworks for Error Localization
Despite recent progress in abstractive summarization, models often generate summaries with factual errors. Numerous approaches to …
Ryo Kamoi
,
Tanya Goyal
,
Greg Durrett
PDF
Cite
Dataset
Video
Poster
Cite
×