mradermacher/GCIRS-Reasoning-1.5B-R1-GGUF Reinforcement Learning • 2B • Updated 28 days ago • 2.28k • 1