Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
1
1
1
Yifan
YYF42
Follow
Tonic's profile picture
21world's profile picture
2 followers
·
3 following
AI & ML interests
None yet
Organizations
Papers
1
arxiv:
2310.08659
models
10
Sort: Recently updated
YYF42/Qwen-2.5-1.5B-Simple-RL-3epoch-newprompt
2B
•
Updated
Mar 3
•
3
YYF42/Qwen-2.5-1.5B-Simple-RL-3epoch
Text Generation
•
2B
•
Updated
Mar 3
•
4
YYF42/Qwen-2.5-1.5B-Simple-RL-16response
Text Generation
•
2B
•
Updated
Mar 3
•
4
YYF42/Qwen-2.5-1.5B-Simple-RL-4response
Text Generation
•
2B
•
Updated
Mar 2
•
5
YYF42/Qwen-2.5-1.5B-Simple-RL-2response
Text Generation
•
2B
•
Updated
Mar 2
•
4
YYF42/Qwen-2.5-1.5B-Simple-RL-baseline3
Text Generation
•
2B
•
Updated
Mar 2
•
4
YYF42/Qwen-2.5-1.5B-Simple-RL-baseline2
Text Generation
•
2B
•
Updated
Mar 1
•
4
YYF42/Qwen-2.5-1.5B-Simple-RL
2B
•
Updated
Feb 28
•
3
YYF42/Qwen2.5-1.5B-Open-R1-GRPO
Updated
Feb 24
YYF42/Qwen-2.5-7B-Simple-RL
Updated
Feb 24
datasets
0
None public yet