From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review Paper • 2504.19678 • Published Apr 28 • 3
AbGen: Evaluating Large Language Models in Ablation Study Design and Evaluation for Scientific Research Paper • 2507.13300 • Published Jul 17 • 16