TAUR-dev/M-0827_rl_reflect_countdown__0epoch_3and4args__grpo_minibs32_lr1e-6_rolloutn16-rl 2B • Updated 5 days ago • 43
TAUR-dev/M-0827_rl_reflect_countdown__0epoch_4args__grpo_minibs32_lr1e-6_rolloutn16-rl 2B • Updated 6 days ago • 16
TAUR-dev/M-0827_rl_reflect_countdown__2epoch_3args__grpo_minibs32_lr1e-6_rolloutn16-rl 2B • Updated 6 days ago • 12
TAUR-dev/M-0827_rl_reflect_countdown__0epoch_3args__grpo_minibs32_lr1e-6_rolloutn16-rl 2B • Updated 6 days ago • 12
TAUR-dev/M-0827_rl_reflect_countdown__4epoch_4args__grpo_minibs32_lr1e-6_rolloutn16-rl 2B • Updated 6 days ago • 8
TAUR-dev/M-0827_rl_reflect_countdown__2epoch_3and4args__grpo_minibs32_lr1e-6_rolloutn16-rl 2B • Updated 6 days ago • 38
TAUR-dev/D-ExpTracker__jack_experiments__all_stages_tacc__v1 Viewer • Updated 10 minutes ago • 355 • 262
TAUR-dev/D-EVAL__standard_eval_v3__sft_train__8_30_25__cd3arg-eval_0 Viewer • Updated about 2 hours ago • 8k
TAUR-dev/reflection_csqa_sft_num_correct_1.1.2.1.2.3_num_incorrect_0.1.0.2.1.0 Viewer • Updated about 7 hours ago • 25
TAUR-dev/reflection_longmult3dig_sft_num_correct_1.1.2.1.2.3_num_incorrect_0.1.0.2.1.0 Viewer • Updated about 7 hours ago • 21.2k
TAUR-dev/reflection_gsm8k_sft_num_correct_1.1.2.1.2.3_num_incorrect_0.1.0.2.1.0 Viewer • Updated about 8 hours ago • 26.5k