Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
yang31210999 's Collections
Qwen-3-4B-2507-GPTOSS-Distillation-0810
PTQTP
LLM-Neo

Qwen-3-4B-2507-GPTOSS-Distillation-0810

updated Aug 9

Qwen-3-4B-2507 use data from IIGroup/s1K-1.1-gpt-oss-20b to distill.

Upvote
-

  • yang31210999/Qwen3-4B-Instruct-2507-0809-rank128-lr0.0002-s1k_gptoss20b_high-1k

    4B • Updated Aug 9 • 12

  • yang31210999/Qwen3-4B-Thinking-2507-0809-rank128-lr0.0002-s1k_gptoss20b_low-1k

    4B • Updated Aug 9 • 12

  • yang31210999/Qwen3-4B-Thinking-2507-0809-rank128-lr0.0002-s1k_gptoss20b_high-1k

    4B • Updated Aug 9 • 11

  • yang31210999/Qwen3-4B-Instruct-2507-0809-rank128-lr0.0002-s1k_gptoss20b_low-1k

    4B • Updated Aug 9 • 11
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs