Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations? Paper • 2405.05904 • Published May 9, 2024 • 6
Narrowing the Knowledge Evaluation Gap: Open-Domain Question Answering with Multi-Granularity Answers Paper • 2401.04695 • Published Jan 9, 2024 • 11
On the Robustness of Dialogue History Representation in Conversational Question Answering: A Comprehensive Study and a New Prompt-based Method Paper • 2206.14796 • Published Jun 29, 2022 • 1
RED-ACE: Robust Error Detection for ASR using Confidence Embeddings Paper • 2203.07172 • Published Mar 14, 2022 • 1
TrueTeacher: Learning Factual Consistency Evaluation with Large Language Models Paper • 2305.11171 • Published May 18, 2023 • 2
What You See is What You Read? Improving Text-Image Alignment Evaluation Paper • 2305.10400 • Published May 17, 2023 • 2