Why Stop at One Error? Benchmarking LLMs as Data Science Code Debuggers for Multi-Hop and Multi-Bug Errors Paper • 2503.22388 • Published Mar 28 • 1
UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset Paper • 2402.04588 • Published Feb 7, 2024 • 2
MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization Paper • 2402.11453 • Published Feb 18, 2024
Why Stop at One Error? Benchmarking LLMs as Data Science Code Debuggers for Multi-Hop and Multi-Bug Errors Paper • 2503.22388 • Published Mar 28 • 1
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 By eliebak and 2 others • Jan 28 • 878