stellalisy
/

rethink_rlvr_reproduce-format-qwen2.5_math_7b-lr5e-7-kl0.00-step50

Text Generation

text-generation-inference

Model card Files Files and versions

rethink_rlvr_reproduce-format-qwen2.5_math_7b-lr5e-7-kl0.00-step50

Ctrl+K

Ctrl+K

1 contributor

History: 3 commits

stellalisy's picture

Upload tokenizer

afa727d verified about 2 months ago