Reasoning-01 - a mgkwill Collection

mgkwill 's Collections

OpenSci

chat-models-candidates

Reasoning-01

updated May 29

Skywork Open Reasoner 1 Technical Report

Paper • 2505.22312 • Published May 28 • 55
Unveiling Instruction-Specific Neurons & Experts: An Analytical Framework for LLM's Instruction-Following Capabilities

Paper • 2505.21191 • Published May 27 • 3
Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 184
Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 288
MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12 • 82
RM-R1: Reward Modeling as Reasoning

Paper • 2505.02387 • Published May 5 • 79
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published Mar 20 • 53