Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
mgkwill 's Collections
Read later
OpenSci
Reasoning-01
chat-models-candidates

Reasoning-01

updated May 29
Upvote
-

  • Skywork Open Reasoner 1 Technical Report

    Paper • 2505.22312 • Published May 28 • 55

  • Unveiling Instruction-Specific Neurons & Experts: An Analytical Framework for LLM's Instruction-Following Capabilities

    Paper • 2505.21191 • Published May 27 • 3

  • Absolute Zero: Reinforced Self-play Reasoning with Zero Data

    Paper • 2505.03335 • Published May 6 • 184

  • Qwen3 Technical Report

    Paper • 2505.09388 • Published May 14 • 288

  • MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

    Paper • 2505.07608 • Published May 12 • 82

  • RM-R1: Reward Modeling as Reasoning

    Paper • 2505.02387 • Published May 5 • 79

  • Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

    Paper • 2503.16219 • Published Mar 20 • 53
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs