ZiYi Yang's picture

4 14 8

ZiYi Yang

AALF

·

https://github.com/yangzy39

yangzy39

AI & ML interests

None yet

Recent Activity

authored a paper 12 days ago

ThinkSwitcher: When to Think Hard, When to Think Fast

authored a paper 12 days ago

Mutual-Taught for Co-adapting Policy and Reward Models

authored a paper 12 days ago

FuseRL: Dense Preference Optimization for Heterogeneous Model Fusion

View all activity

Organizations

New activity in FuseAI/FuseO1-DeepSeekR1-QwQ-SkyT1-32B-Preview 6 months ago

merging configs

#10 opened 6 months ago by

New activity in AALF/gemma-2-27b-it-SimPO-37K 12 months ago

Difference between this and the other (100 steps) model?

#1 opened about 1 year ago by

New activity in AALF/gemma-2-27b-it-SimPO-37K about 1 year ago

Difference between this and the other (100 steps) model?

#1 opened about 1 year ago by

Difference between this and the other (100 steps) model?

#1 opened about 1 year ago by