Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
xz's picture
7 1

xz

mxz
·

AI & ML interests

NLP ML RL

Organizations

None yet

models 7

mxz/qwen-R1-3B

3B • Updated Mar 4 • 4

mxz/qwen-R1-1.5B

2B • Updated Mar 4 • 4

mxz/qwen-R1-0.5b

0.5B • Updated Mar 3 • 4

mxz/llama3-8b-dpo

Text Generation • 8B • Updated Jul 28, 2024 • 3

mxz/llama3-8b-ppo

Text Generation • 8B • Updated Jul 28, 2024 • 3

mxz/llama3-8b-sft

Text Generation • 8B • Updated Jul 28, 2024 • 3

mxz/ppo-LunarLander-v2

Reinforcement Learning • Updated Jul 17, 2024

datasets 4

mxz/awesome-dpo

Viewer • Updated Jul 28, 2024 • 302k • 1

mxz/CValues

Viewer • Updated Jul 26, 2024 • 146k • 1

mxz/CValues_DPO

Viewer • Updated Jul 26, 2024 • 146k • 11

mxz/alpaca_en_zh_ruozhiba_gpt4-data

Viewer • Updated Jul 26, 2024 • 190k • 6
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs