Papers
AI & ML interests
R3 Model is all you need
Recent Activity
View all activity
models
66

rubricreward/LLaMA-3.2-3B-DPO-HelpSteer3-R3-Qwen3-14B-LoRA-4k
Text Generation
•
Updated
•
14

rubricreward/LLaMA-3.2-3B-DPO-HelpSteer3-R3-Qwen3-8B-14k
Text Generation
•
Updated
•
13

rubricreward/LLaMA-3.2-3B-DPO-HelpSteer3-R3-Qwen3-4B-14k
Text Generation
•
Updated
•
13

rubricreward/R3-DeepSeek-R1-Distill-Qwen-14B-LoRA-4k
15B
•
Updated
•
7

rubricreward/R3-DeepSeek-R1-Distill-Qwen-14B-LoRA-14k
15B
•
Updated
•
10

rubricreward/R3-DeepSeek-R1-Distill-Qwen-14B-14k
Text Generation
•
15B
•
Updated
•
11

rubricreward/R3-DeepSeek-R1-Distill-Qwen-14B-4k
Text Generation
•
15B
•
Updated
•
11

rubricreward/R3-Phi-4-reasoning-plus-LoRA-14k
15B
•
Updated
•
13

rubricreward/R3-Qwen3-14B-LoRA-14k
15B
•
Updated
•
15

rubricreward/R3-Qwen3-8B-LoRA-14k
Text Generation
•
8B
•
Updated
•
11
•
2
datasets
152
rubricreward/PolyGuardMix-tgt_prompt_tgt_thinking-filtered_correct
Viewer
•
Updated
•
2.57M
•
11
rubricreward/PolyGuardMix-tgt_prompt_en_thinking-filtered_correct
Viewer
•
Updated
•
2.62M
•
12
rubricreward/PolyGuardMix-en_prompt_en_thinking-filtered_correct
Viewer
•
Updated
•
2.63M
•
17
rubricreward/PolyGuardMix-tgt_prompt_tgt_thinking
Viewer
•
Updated
•
2.88M
•
8
rubricreward/PolyGuardMix-tgt_prompt_en_thinking
Viewer
•
Updated
•
2.92M
•
14
rubricreward/PolyGuardMix-en_prompt_en_thinking
Viewer
•
Updated
•
2.92M
•
9
rubricreward/HelpSteer3-tgt_prompt_tgt_thinking-filtered_correct
Viewer
•
Updated
•
21.1k
•
54
rubricreward/HelpSteer3-tgt_prompt_tgt_thinking
Viewer
•
Updated
•
38.5k
•
40
rubricreward/HelpSteer3-tgt_prompt_en_thinking-filtered_correct
Viewer
•
Updated
•
21.1k
•
50
rubricreward/HelpSteer3-en_prompt_en_thinking-filtered_correct
Viewer
•
Updated
•
21.5k
•
47