yang31210999/Qwen3-4B-Instruct-2507-0809-rank128-lr0.0002-s1k_gptoss20b_high-1k 4B • Updated Aug 9 • 12
yang31210999/Qwen3-4B-Thinking-2507-0809-rank128-lr0.0002-s1k_gptoss20b_low-1k 4B • Updated Aug 9 • 12
yang31210999/Qwen3-4B-Thinking-2507-0809-rank128-lr0.0002-s1k_gptoss20b_high-1k 4B • Updated Aug 9 • 11
yang31210999/Qwen3-4B-Instruct-2507-0809-rank128-lr0.0002-s1k_gptoss20b_low-1k 4B • Updated Aug 9 • 11