Attributing Response to Context: A Jensen-Shannon Divergence Driven Mechanistic Study of Context Attribution in Retrieval-Augmented Generation Paper • 2505.16415 • Published May 22, 2025 • 1
Anchored Answers: Unravelling Positional Bias in GPT-2's Multiple-Choice Questions Paper • 2405.03205 • Published May 6, 2024 • 1
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 135 items • Updated 14 days ago • 116