BhasaAnuvaad Collection A Speech Translation Dataset for 13 Indian Languages • 11 items • Updated Jan 16 • 20
Reinforcement Learning Foundations for Deep Research Systems: A Survey Paper • 2509.06733 • Published 6 days ago • 29
Reverse-Engineered Reasoning for Open-Ended Generation Paper • 2509.06160 • Published 6 days ago • 139
Set Block Decoding is a Language Model Inference Accelerator Paper • 2509.04185 • Published 10 days ago • 48
Towards a Unified View of Large Language Model Post-Training Paper • 2509.04419 • Published 9 days ago • 68
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs Paper • 2508.16153 • Published 23 days ago • 146
Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models Paper • 2504.03624 • Published Apr 4 • 13
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL Paper • 2508.13167 • Published Aug 6 • 125
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7 • 345
ZeroSearch: Incentivize the Search Capability of LLMs without Searching Paper • 2505.04588 • Published May 7 • 65
view article Article The NLP Course is becoming the LLM Course! By burtenshaw and 9 others • Apr 3 • 99
OpenMath Collection A collection of models and datasets introduced in "OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset" • 15 items • Updated 11 days ago • 44
R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning Paper • 2503.05379 • Published Mar 7 • 38