when i asked the model: who are you ?it answers Qwen..
It should
有趣
论文中都说了 SFT 数据是 DeepSeek V3 以及 Qwen3 MoE 构造的
· Sign up or log in to comment