umangkaushik
ubermenchh
AI & ML interests
None yet
Organizations
models
33

ubermenchh/Qwen2.5-3B-open-r1-math
Text Generation
•
3B
•
Updated
•
3

ubermenchh/Qwen2.5-3B-open-r1-math-lora
Updated

ubermenchh/Qwen2.5-3B-openr1-math
Text Generation
•
Updated
•
2

ubermenchh/Qwen2.5-0.5B-openr1-math
Updated

ubermenchh/llama3.1-8B-gsm8k-grpo
8B
•
Updated
•
2

ubermenchh/SmolLM2-SFT-sarvam-samvaad
Text Generation
•
0.2B
•
Updated
•
2

ubermenchh/SmolLM2-360M-r1-grpo-countdown
Updated

ubermenchh/SmolLM2-DPO-ultrafeedback-binarized-preferences
Text Generation
•
0.1B
•
Updated
•
6

ubermenchh/SmolLM2-DPO
Text Generation
•
0.1B
•
Updated
•
3

ubermenchh/SmolLM2-FT-the-smol-stack
Text Generation
•
0.1B
•
Updated
•
3