Large Legal Fictions: Profiling Legal Hallucinations in Large Language Models Paper • 2401.01301 • Published Jan 2, 2024
Do Language Models Know When They're Hallucinating References? Paper • 2305.18248 • Published May 29, 2023
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations Paper • 2504.10481 • Published Apr 14 • 84